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NEISSERIA MENINGITIDIS ANTIGENS AND COMPOSITIONS 

This application is a continuation-in-part of the following U.S. Provisional Patent 
applications, from each of which priority is claimed, and eacH of which is incorporated by 
reference in its entirety: 60/083,758 (filed May 1, 1998); 60/094,869 (filed July 31, 1998); 
60/098,994 (filed September 2, 1998); 60/099,062 (filed September 2, 1998); 60/103,749 
(filed October 9, 1 998); 60/103,794 (filed October 9, 1998); 60/103,796 (filed October 9, 
1998); and 60/121,528 (filed February 25, 1999). 

This invention relates to antigens from the bacterial species: Neisseria meningitidis 
and Neisseria gonorrhoeae. 



BACKGROUND 

Neisseria meningitidis is a non-motile, gram negative diplococcus human pathogen. It 
colonizes the pharynx, causing meningitis and, occasionally, septicaemia in the absence of 
meningitis. It is closely related to N gonorrhoea, although one feature that clearly 
differentiates meningococcus from gonococcus is the presence of a polysaccharide capsule 
that is present in all pathogenic meningococci. 

N. meningitidis causes both endemic and epidemic disease. In the United States the 
attack rate is 0.6-1 per 100,000 persons per year, and it can be much greater during outbreaks, 
(see Lieberman et al (1996) Safety and Immunogenicity of a Serogroups A/C Neisseria 
meningitidis Oligosaccharide-Protein Conjugate Vaccine in Young Children. JAMA 
275(19):1499-1503; Schuchat et al (1997) Bacterial Meningitis in the United States in 1995. 
NEnglJMed 337(14):970-976). In developing countries, endemic disease rates are much 
higher and during epidemics incidence rates can reach 500 cases per 100,000 persons per 
year. Mortality is extremely high, at 10-20% in the United States, and much higher in 
developing countries. Following the introduction of the conjugate vaccine against 
Haemophilus influenzae, N meningitidis is the major cause of bacterial meningitis at all ages 
in the United States (Schuchat et al (1997) supra). 

Based on the organism's capsular polysaccharide, 12 serogroups of N. meningitidis 
have been identified. Group A is the pathogen most often implicated in epidemic disease in 
sub-Saharan Africa. Serogroups B and C are responsible for the vast majority of cases in the 
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United States and in most developed countries. Serogroups W135 and Y are responsible for 
the rest of the cases in the United States and developed countries. The meningococcal vaccine 
currently in use is a tetravalent polysaccharide vaccine composed of serogroups A, C, Y and 
W135. Although efficacious in adolescents and adults, it induces a poor immune response and 
short duration of protection, and cannot be used in infants [eg. Morbidity and Mortality 
weekly report, Vol.46, No. RR-5 (1997)]. This is because polysaccharides are T-cell 
independent antigens that induce a weak immune response that cannot be boosted by repeated 
immunization. Following the success of the vaccination against K influenzae, conjugate 
vaccines against serogroups A and C have been developed and are at the final stage of clinical 
testing (Zollinger WD 4< New and Improved Vaccines Against Meningococcal Disease". In: 
New Generation Vaccines, supra, pp. 469-488; Lieberman et al (1996) supra; Costantino et al 
(1992) Development and phase I clinical testing of a conjugate vaccine against 
meningococcus A and C. Vaccine 10:691-698). 

Meningococcus B (menB) remains a problem, however. This serotype currently is 
responsible for approximately 50% of total meningitis in the United States, Europe, and South 
America. The polysaccharide approach cannot be used because the menB capsular 
polysaccharide is a polymer of a(2-8>-linked iV-acetyl neuraminic acid that is also present in 
mammalian tissue. This results in tolerance to the antigen; indeed, if an immune response 
were elicited, it would be anti-self, and therefore undesirable. In order to avoid induction of 
autoimmunity and to induce a protective immune response, the capsular polysaccharide has, 
for instance, been chemically modified substituting the //-acetyl groups with iV-propionyl 
groups, leaving the specific antigenicity unaltered (Romero & Outschoorn (1994) Current 
status of Meningococcal group B vaccine candidates: capsular or non-capsular? Clin 
Microbiol Rev 7(4):559«575). 

Alternative approaches to menB vaccines have used complex mixtures of outer 
membrane proteins (OMPs), containing either the OMPs alone, or OMPs enriched in porins, 
or deleted of the class 4 OMPs that are believed to induce antibodies that block bactericidal 
activity. This approach produces vaccines that are not well characterized. They are able to 
protect against the homologous strain, but are not effective at large where there are many 
antigenic variants of the outer membrane proteins. To overcome the antigenic variability, 
multivalent vaccines containing up to nine different porins have been constructed (eg. 
Poolman JT (1992) Development of a meningococcal vaccine. Infect. Agents Dis. 4: 13-28). 
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Additional proteins to be used in outer membrane vaccines have been the opa and opc 
proteins, but none of these approaches have been able to overcome the antigenic variability 
(eg. Ala'Aldeen & Borriello (1996) The meningococcal transferrin-binding proteins 1 and 2 
are both surface exposed and generate bactericidal antibodies capable of killing homologous 
and heterologous strains. Vaccine 14(l):49-53). 

A certain amount of sequence data is available for meningococcal and gonoccocal 
genes and proteins (e.g. EP-A-0467714, W096/29412), but this is by no means complete. 
Other men B proteins may include those listed in WO 97/28273, WO 96/29412, WO 
95/03413, US 5,439,808, and US 5,879,686. 

The provision of further sequences could provide an opportunity to identify secreted 
or surface-exposed proteins that are presumed targets for the immune system and which are 
not antigenically variable. For instance, some of the identified proteins could be components 
of efficacious vaccines against meningococcus B, some could be components of vaccines 
against all meningococcal serotypes, and others could be components of vaccines against all 
pathogenic Neisseriae including Neisseria meningitidis or Neisseria gonorrhoeae. Those 
sequences specific to N. meningitidis or N gonorrhoeae that are more highly conserved are 
further preferred sequences. 

It is thus an object of the invention is to provide Neisserial DNA sequences which 
encode proteins that are antigenic or immunogenic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the products of protein expression and purification of the predicted 
ORF 919 as cloned and expressed in E. coli. 

Fig. 2 illustrates the products of protein expression and purification of the predicted 
ORF 279 as cloned and expressed in E. coli. 

Fig. 3 illustrates the products of protein expression and purification of the predicted 
ORF 576-1 as cloned and expressed in E. coli. 

Fig. 4 illustrates the products of protein expression and purification of the predicted 
ORF 519-1 as cloned and expressed in E. coli. 

Fig. 5 illustrates the products of protein expression and purification of the predicted 
ORF 121-1 as cloned and expressed in E. coli. 
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Fig. 6 illustrates the products of protein expression and purification of the predicted 
ORF 128-1 as cloned and expressed in E. coll 

Fig. 7 illustrates the products of protein expression and purification of the predicted 
ORF 206 as cloned and expressed in E. coli. 

Fig. 8 illustrates the products of protein expression and purification of the predicted 
ORF 287 as cloned and expressed in E. coli. 

Fig. 9 illustrates the products of protein expression and purification of the predicted 
ORF 406 as cloned and expressed in E. coli. 

Fig. 10 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 919 as cloned and expressed in E. coli. 

Fig. 11 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 279 as cloned and expressed in E. coli. 

Fig. 12 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 576-1 as cloned and expressed in E. coll 

Fig. 13 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 519-1 as cloned and expressed in E. coll 

Fig. 14 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 121-1 as cloned and expressed in E. coll 

Fig. 15 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 128-1 as cloned and expressed in E. coli. 

Fig. 16 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 206 as cloned and expressed in E. coll 

Fig. 17 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 287 as cloned and expressed in E. coll 

Fig. 1 8 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 406 as cloned and expressed in E. coli. 

Fig. 19 shows an alignment comparison of amino acid sequences for ORF 225 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 
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Fig. 20 shows an alignment comparison of amino acid sequences for ORF 235 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 21 shows an alignment comparison of amino acid sequences for ORF 287 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 22 shows an alignment comparison of amino acid sequences for ORF 519 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 23 shows an alignment comparison of amino acid sequences for ORF 919 for 
several strains of Neisseria, Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

THE INVENTION 

The invention provides proteins comprising the TV. meningitidis amino acid sequences 
and N. gonorrhoeae amino acid sequences disclosed in the examples. 

It also provides proteins comprising sequences homologous (i.e., those having 
sequence identity) to the N meningitidis amino acid sequences disclosed in the examples. 
Depending on the particular sequence, the degree of homology (sequence identity) is 
preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). These proteins 
include mutants and allelic variants of the sequences disclosed in the examples. Typically, 
50% identity or more between two proteins is considered to be an indication of functional 
equivalence. Identity between proteins is preferably determined by the Smith- Waterman 
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homology search algorithm as implemented in MPSRCH program (Oxford Molecular) using 
an affine gap search with parameters:gap penalty 12, gap extension penalty 1. 

The invention further provides proteins comprising fragments of the N. meningitidis 
amino acid sequences and N. gonorrhoeae amino acid sequences disclosed in the examples. 
The fragments should comprise at least n consecutive amino acids from the sequences and, 
depending on the particular sequence, n is 7 or more (eg. 8, 10, 12, 14, 16, 18, 20 or more). 
Preferably the fragments comprise an epitope from the sequence. 

The proteins of the invention can, of course, be prepared by various means (eg. 
recombinant expression, purification from cell culture, chemical synthesis etc.) and in various 
forms (eg. native, fusions etc.). They are preferably prepared in substantially pure or isolated 
form (ie. substantially free from other N. meningitidis orN. gonorrhoeae host cell proteins) 

According to a further aspect, the invention provides antibodies which bind to these 
proteins. These may be polyclonal or monoclonal and may be produced by any suitable 
means. 

According to a further aspect, the invention provides nucleic acid comprising the 
N. meningitidis nucleotide sequences and N. gonorrhoeae nucleotide sequences disclosed in 
the examples. 

According to a further aspect, the invention comprises nucleic acids having sequence 
identity of greater than 50% (e.g., 60%, 70%, 80%, 90%, 95%, 99% or more) to the nucleic 
acid sequences herein. Sequence identity is determined as above-discussed. 

According to a further aspect, the invention comprises nucleic acid that hybridizes to 
the sequences provided herein. Conditions for hybridization are set forth herein. 

Nucleic acid comprising fragments of these sequences are also provided. These should 
comprise at least n consecutive nucleotides from the TV. meningitidis sequences or N. 
gonorrhoeae sequences and depending on the particular sequence, n is 10 or more (eg 12, 14, 
15, 18, 20, 25, 30, 35, 40 or more). 

According to a further aspect, the invention provides nucleic acid encoding the 
proteins and protein fragments of the invention. 

It should also be appreciated that the invention provides nucleic acid comprising 
sequences complementary to those described above (eg. for antisense or probing purposes). 

Nucleic acid according to the invention can, of course, be prepared in many ways (eg. 
by chemical synthesis, in part or in whole, from genomic or cDNA libraries, from the 
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organism itself etc.) and can take various forms (eg. single stranded, double stranded, vectors, 
probes etc.). 

In addition, the term "nucleic acid" includes DNA and RNA, and also their analogues, 
such as those containing modified backbones, and also protein nucleic acids (PNA) etc. 

According to a further aspect, the invention provides vectors comprising nucleotide 
sequences of the invention (eg. expression vectors) and host cells transformed with such 
vectors. 

According to a further aspect, the invention provides compositions comprising protein, 
antibody, and/or nucleic acid according to the invention. These compositions may be suitable 
as vaccines, for instance, or as diagnostic reagents or as immunogenic compositions. 

The invention also provides nucleic acid, protein, or antibody according to the 
invention for use as medicaments (eg. as vaccines) or as diagnostic reagents. It also provides 
the use of nucleic acid, protein, or antibody according to the invention in the manufacture of 
(I) a medicament for treating or preventing infection due to Neisserial bacteria (ii) a 
diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised 
against Neisserial bacteria or (iii) for raising antibodies. Said Neisserial bacteria may be any 
species or strain (such as N. gonorrhoeae) but are preferably N. meningitidis, especially strain 
B or strain C. 

The invention also provides a method of treating a patient, comprising administering 
to the patient a therapeutically effective amount of nucleic acid, protein, and/or antibody 
according to the invention. 

According to further aspects, the invention provides various processes. 

A process for producing proteins of the invention is provided, comprising the step of 
culturing a host cell according to the invention under conditions which induce protein 
expression. 

A process for detecting polynucleotides of the invention is provided, comprising the 
steps of: (a) contacting a nucleic probe according to the invention with a biological sample 
under hybridizing conditions to form duplexes; and (b) detecting said duplexes. 

A process for detecting proteins of the invention is provided, comprising the steps of: 
(a) contacting an antibody according to the invention with a biological sample under 
conditions suitable for the formation of an antibody-antigen complexes; and (b) detecting 
said complexes. 
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Having now generally described the inventiion, the same will be more readily 
understood through reference to the following examples which are provided by way of 
illustration, and are not intended to be limiting of the present invention, unless specified. 

Methodology - Summary of standard procedures and techniques. 
General 

This invention provides Neisseria meningitidis menB nucleotide sequences, amino 
acid sequences encoded therein. With these disclosed sequences, nucleic acid probe assays 
and expression cassettes and vectors can be produced. The expression vectors can be 
transformed into host cells to produce proteins. The purified or isolated polypeptides (which 
may also be chemically synthesized) can be used to produce antibodies to detect menB 
proteins. Also, the host cells or extracts can be utilized for biological assays to isolate 
agonists or antagonists, hi addition, with these sequences one can search to identify open 
reading frames and identify amino acid sequences. The proteins may also be used in 
immunogenic compositions, antigenic compositions and as vaccine components. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA, and 
immunology, which are within the skill of the art. Such techniques are explained fully in the 
literature e.g., Sambrook Molecular Cloning; A Laboratory Manual Second Edition (1989); 
DNA Cloning, Volumes J and ii (D.N Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 
ed, 1984); Nucleic Acid Hybridization (B.D. Hames & SJ. Higgins eds. 1984); Transcription 
and Translation (B.D. Hames & SJ. Higgins eds. 1984); Animal Cell Culture (R.I. Freshney 
ed. 1986); Immobilized Cells and Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide to 
Molecular Cloning (1984); the Methods in Enzymology series (Academic Press, Inc.), 
especially volumes 154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); Mayer and Walker, eds. (1987), 
Immunochemical Methods in Cell and Molecular Biology (Academic Press, London); Scopes, 
(1987) Protein Purification: Principles and Practice, Second Edition (Springer- Verlag, 
N.Y.), and Handbook of Experimental Immunology, Volumes I-IV (DM. Weir and C.C. 
Blackwell eds 1986). 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
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All publications, patents, and patent applications cited herein are incorporated in full 
by reference. 

Expression systems 

The Neisseria menB nucleotide sequences can be expressed in a variety of different 
expression systems; for example those used with mammalian cells, plant cells, baculoviruses, 
bacteria, and yeast. 

i. Mammalian Systems 

Mammalian expression systems are known in the art. A mammalian promoter is any 
DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstream (3 1 ) transcription of a coding sequence (e.g., structural gene) into mRNA. A 
promoter will have a transcription initiating region, which is usually placed proximal to the 5' 
end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream 
of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to 
begin RNA synthesis at the correct site. A mammalian promoter will also contain an 
upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. 
An upstream promoter element determines the rate at which transcription is initiated and can 
act in either orientation (Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian 
Cells." In Molecular Cloning: A Laboratory Manual, ,2nd ed.). 

Mammalian viral genes are often highly expressed and have a broad host range; 
therefore sequences encoding mammalian viral genes provide particularly useful promoter 
sequences. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus promoter. In 
addition, sequences derived from non-viral genes, such as the murine metallothionein gene, 
also provide useful promoter sequences. Expression may be either constitutive or regulated 
(inducible). Depending on the promoter selected, many promotes may be inducible using 
known substrates, such as the use of the mouse mammary tumor virus (MMTV) promoter 
with the glucocorticoid responsive element (GRE) that is induced by glucocorticoid in 
hormone-responsive transformed cells (see for example, U.S. Patent 5,783,681). 

The presence of an enhancer element (enhancer), combined with the promoter 
elements described above, will usually increase expression levels. An enhancer is a 
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regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to 
homologous or heterologous promoters, with synthesis beginning at the normal RNA start 
site. Enhancers are also active when they are placed upstream or downstream from the 
transcription initiation site, in either normal or flipped orientation, or at a distance of more 
than 1000 nucleotides from the promoter (Maniatis et al. (1987) Science 236:1237; Alberts et 
al. (1989) Molecular Biology of the Cell, 2nd cd.). Enhancer elements derived from viruses 
may be particularly useful, because they usually have a broader host range. Examples include 
the SV40 early gene enhancer (Dijkema et al (1985) EMBOJ. 4:761) and the 
enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus 
(Gorman et al. (1982b) Proc. Natl Acad. ScL 79:6777) and from human cytomegalovirus 
(Boshart et al. (1985) Cell 47:521). Additionally, some enhancers are regulatable and become 
active only in the presence of an inducer, such as a hormone or metal ion (Sassone-Corsi and 
Borelli (1986) Trends Genet. 2:215; Maniatis et al. (1987) Science 236:1237). 

A DNA molecule may be expressed intracellularly in mammalian cells. A promoter 
sequence may be directly linked with the DNA molecule, in which case the first amino acid at 
the N-terminus of the recombinant protein will always be a methionine, which is encoded by 
the ATG start codon. If desired, the N-terminus may be cleaved from the protein by in vitro 
incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign protein in mammalian cells. 
Preferably, there are processing sites encoded between the leader fragment and the foreign 
gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually 
encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of 
the protein from the cell. The adenovirus tripartite leader is an example of a leader sequence 
that provides for secretion of a foreign protein in mammalian cells. 

Usually, transcription termination and polyadenylation sequences recognized by 
mammalian cells are regulatory regions located 3' to the translation stop codon and thus, 
together with the promoter elements, flank the coding sequence. The 3 1 terminus of the 
mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation 
(Birnstiel et al. (1985) Cell 47:349; Proudfoot and Whitelaw (1988) "Termination and 3' end 
processing of eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D.M. 
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Glover); Proudfoot (1989) Trends Biochem, ScL 14: 105). These sequences direct the 
transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. 
Examples of transcription terrninator/polyadenylation signals include those derived from 
SV40 (Sambrook et al (1989) "Expression of cloned genes in cultured mammalian cells." In 
Molecular Cloning: A Laboratory Manual). 

Usually, the above described components, comprising a promoter, polyadenylation 
signal, and transcription termination sequence are put together into expression constructs. 
Enhancers, introns with functional splice donor and acceptor sites, and leader sequences may 
also be included in an expression construct, if desired. Expression constructs are often 
maintained in a repiicon, such as an extrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as mammalian cells or bacteria. Mammalian replication 
systems include those derived from animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication systems of papovaviruses, such as 
SV40 (Gluzman (1981) Cell 23:175) or polyomavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine papillomavirus and Epstein-Barr 
virus. Additionally, the repiicon may have two replication systems, thus allowing it to be 
maintained, for example, in mammalian cells for expression and in a prokaryotic host for 
cloning and amplification. Examples of such mammalian-bacteria shuttle vectors include 
pMT2 (Kaufinan et al. (1989) MoL Cell Biol 9:946) and pHEBO (Shimizu et al. (1986) Mol 
Cell Biol 5:1074). 

The transformation procedure used depends upon the host to be transformed. Methods 
for introduction of heterologous polynucleotides into mammalian cells are known in the art 
and include dextran-mediated transfection, calcium phosphate precipitation, polybrene 
mediated transfection, protoplast fusion, electroporation, encapsulation of the 
poiynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell lines available from the American Type Culture Collection 
(ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby 
hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma 
cells (e.g., Hep G2), and a number of other cell lines. 
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ii. Plant Cellular Expression Systems 

There are many plant cell culture and whole plant genetic expression systems known 
in the art. Exemplary plant cellular genetic expression systems include those described in 
patents, such as: U.S. 5,693,506; US 5,659,122; and US 5,608,143. Additional examples of 
genetic expression in plant cell culture has been described by Zenk, Phytochemistry 30:3861- 
3863 (1991). Descriptions of plant protein signal peptides may be found in addition to the 
references described above in Vaulcombe et ah, MoL Gen. Genet. 209:33-40 (1987); Chandler 
et al., Plant Molecular Biology 3:407-418 (1984); Rogers, J. Biol. Chem. 260:3731-3738 
(1985); Rothstein et al., Gene 55:353-356 (1987); Whittier et al., Nucleic Acids Research 
15:2515-2535 (1987); Wirsel et ah, Molecular Microbiology 3:3-14 (1989); Yu et al., Gene 
122:247-253 (1992). A description of the regulation of plant gene expression by the 
phytohormone, gibberellic acid and secreted enzymes induced by gibberellic acid can be 
found in R.L. Jones and J. MacMillhx Gibberellins: in: Advanced Plant Physiology,. Malcolm 
B. Wilkins, ed., 1984 Pitman Publishing Limited, London, pp. 21-52. References that 
describe other metabolically-regulated genes: Sheen, Plant Cell, 2:1027-1038(1990); Maas et 
al., EMBOJ. 9:3447-3452 (1990); Benkel and Hickey, Proc. Natl Acad. ScL 84:1337-1339 
(1987) 

Typically, using techniques known in the art, a desired polynucleotide sequence is 
inserted into an expression cassette comprising genetic regulatory elements designed for 
operation in plants. The expression cassette is inserted into a desired expression vector with 
companion sequences upstream and downstream from the expression cassette suitable for 
expression in a plant host. The companion sequences will be of plasmid or viral origin and 
provide necessary characteristics to the vector to permit the vectors to move DNA from an 
original cloning host, such as bacteria, to the desired plant host. The basic bacterial/plant 
vector construct will preferably provide a broad host range prokaryote replication origin; a 
prokaryote selectable marker; and, for Agrobacterium transformations, T DNA sequences for 
Agrobacterium-mediated transfer to plant chromosomes. Where the heterologous gene is not 
readily amenable to detection, the construct will preferably also have a selectable marker gene 
suitable for determining if a plant cell has been transformed. A general review of suitable 
markers, for example for the members of the grass family, is found in Wilmink and Dons, 
1993, Plant MoL Biol Reptr, 1 1(2): 165-185. 
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Sequences suitable for permitting integration of the heterologous sequence into the 
plant genome are also recommended. These might include transposon sequences and the like 
for homologous recombination as well as Ti sequences which permit random insertion of a 
heterologous expression cassette into a plant genome. Suitable prokaryote selectable markers 
include resistance toward antibiotics such as ampicillin or tetracycline. Other DNA sequences 
encoding additional functions may also be present in the vector, as is known in the art. 

The nucleic acid molecules of the subject invention may be included into an 
expression cassette for expression of the protein(s) of interest. Usually, there will be only one 
expression cassette, although two or more are feasible. The recombinant expression cassette 
will contain in addition to the heterologous protein encoding sequence the following 
elements, a promoter region, plant 5' untranslated sequences, initiation codon depending upon 
whether or not the structural gene comes equipped with one, and a transcription and 
translation termination sequence. Unique restriction enzyme sites at the 5' and 3' ends of the 
cassette allow for easy insertion into a pre-existing vector. 

A heterologous coding sequence may be for any protein relating to the present 
invention. The sequence encoding the protein of interest will encode a signal peptide which 
allows processing and translocation of the protein, as appropriate, and will usually lack any 
sequence which might result in the binding of the desired protein of the invention to a 
membrane. Since, for the most part, the transcriptional initiation region will be for a gene 
which is expressed and translocated during germination, by employing the signal peptide 
which provides for translocation, one may also provide for translocation of the protein of 
interest. In this way, the protein(s) of interest will be translocated from the cells in which they 
are expressed and may be efficiently harvested. Typically secretion in seeds are across the 
aleurone or scutellar epithelium layer into the endosperm of the seed. While it is not required 
that the protein be secreted from the cells in which the protein is produced, this facilitates the 
isolation and purification of the recombinant protein. 

Since the ultimate expression of the desired gene product will be in a eucaryotic cell it 
is desirable to determine whether any portion of the cloned gene contains sequences which 
will be processed out as introns by the host's splicosome machinery. If so, site-directed 
mutagenesis of the "intron" region may be conducted to prevent losing a portion of the genetic 
message as a false intron code, Reed and Maniatis, Cell 41 :95-105, 1985. 



WO 99/57280 



14 



PCT/US99/09346 



The vector can be microinjected directly into plant cells by use of micropipettes to 
mechanically transfer the recombinant DN A. Crossway, Mol. Gen. Genet, 202:179-185, 
1985. The genetic material may also be transferred into the plant cell by using polyethylene 
glycol, Krens, et al., Nature, 296, 72-74, 1982. Another method of introduction of nucleic 
acid segments is high velocity ballistic penetration by small particles with the nucleic acid 
either within the matrix of small beads or particles, or on the surface, Klein, et al., Nature, 
327, 70-73, 1987 and Knudsen and Muller, 1991, Planta, 185:330-336 teaching particle 
bombardment of barley endosperm to create transgenic barley. Yet another method of 
introduction would be fusion of protoplasts with other entities, either minicells, cells, 
lysosomes or other fusible lipid-surfaced bodies, Fraley, et al., Proa Natl Acad. ScL USA, 79, 
1859-1863,1982. 

The vector may also be introduced into the plant cells by electroporation. (Fromm et 
al., Proa Natl Acad. Sci. USA 82:5824, 1985). In this technique, plant protoplasts are 
electroporated in the presence of plasmids containing the gene construct. Electrical impulses 
of high field strength reversibly permeabilize biomembranes allowing the introduction of the 
plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form plant callus. 

All plants from which protoplasts can be isolated and cultured to give whole 
regenerated plants can be transformed by the present invention so that whole plants are 
recovered which contain the transferred gene. It is known that practically all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of 
sugarcane, sugar beet, cotton, fruit and other trees, legumes and vegetables. Some suitable 
plants include, for example, species from the genera Fragaria, Lotus, Medicago, Onobrychis, 
Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, 
Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersion, 
Nicotiana, Solanum, Petunia, Digitalis, Majorana, Cichorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea* Triticum, 
Sorghum, and Datura. 

Means for regeneration vary from species to species of plants, but generally a 
suspension of transformed protoplasts containing copies of the heterologous gene is first 
provided. Callus tissue is formed and shoots may be induced from callus and subsequently 
rooted. Alternatively, embryo formation can be induced from the protoplast suspension. 
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These embryos germinate as natural embryos to form plants. The culture media will generally 
contain various amino acids and hormones, such as auxin and cytokinins. It is also 
advantageous to add glutamic acid and proline to the medium, especially for such species as 
corn and alfalfa. Shoots and roots normally develop simultaneously. Efficient regeneration 
will depend on the medium, on the genotype, and on the history of the culture. If these three 
variables are controlled, then regeneration is fully reproducible and repeatable. 

In some plant cell culture systems, the desired protein of the invention may be 
excreted or alternatively, the protein may be extracted from the whole plant. Where the 
desired protein of the invention is secreted into the medium, it may be collected. 
Alternatively, the embryos and embryoless-half seeds or other plant tissue may be 
mechanically disrupted to release any secreted protein between cells and tissues. The mixture 
may be suspended in a buffer solution to retrieve soluble proteins. Conventional protein 
isolation and purification methods will be then used to purify the recombinant protein. 
Parameters of time, temperature pH, oxygen, and volumes will be adjusted through routine 
methods to optimize expression and recovery of heterologous protein. 

hi. Baculo virus Systems 

The polynucleotide encoding the protein can also be inserted into a suitable insect 
expression vector, and is operably linked to the control elements within that vector. Vector 
construction employs techniques which are known in the art. Generally, the components of 
the expression system include a transfer vector, usually a bacterial plasmid, which contains 
both a fragment of the baculovirus genome, and a convenient restriction site for insertion of 
the heterologous gene or genes to be expressed; a wild type baculovirus with a sequence 
homologous to the baculovirus-specific fragment in the transfer vector (this allows for the 
homologous recombination of the heterologous gene in to the baculovirus genome); and 
appropriate insect host cells and growth media. 

After inserting the DNA sequence encoding the protein into the transfer vector, the 
vector and the wild type viral genome are transfected into an insect host cell where the vector 
and viral genome are allowed to recombine. The packaged recombinant virus is expressed 
and recombinant plaques are identified and purified. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, inter 
alia, Invitrogen, San Diego CA ("MaxBac" kit). These techniques are generally known to 
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those skilled in the art and fully described in Summers and Smith, Texas Agricultural 
Experiment Station Bulletin No. 1555 (1987) (hereinafter "Summers and Smith"). 

Prior to inserting the DNA sequence encoding the protein into the baculovirus 
genome, the above described components, comprising a promoter, leader (if desired), coding 
sequence of interest, and transcription termination sequence, are usually assembled into an 
intermediate transplacement construct (transfer vector). This construct may contain a single 
gene and operably linked regulatory elements; multiple genes, each with its owned set of 
operably linked regulatory elements; or multiple genes, regulated by the same set of 
regulatory elements. Intermediate transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance 
in a host, such as a bacterium. The replicon will have a replication system, thus allowing it to 
be maintained in a suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes into 
AcNPV is pAc373. Many other vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters the polyhedrin start codon from 
ATG to ATT, and which introduces a BamHI cloning site 32 basepairs downstream from the 
ATT; see Luckow and Summers, Virology (1989) 77:31. 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al. 
(1988) Ann. Rev. Microbiol, 42:117) and a prokaryotic ampicillin-resistance (amp) gene and 
origin of replication for selection and propagation in E. coli. 

Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus 
promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5* to 3') transcription of a coding sequence (e.g., structural gene) 
into mRNA. A promoter will have a transcription initiation region which is usually placed 
proximal to the 5* end of the coding sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription initiation site. A baculovirus 
transfer vector may also have a second domain called an enhancer, which, if present, is 
usually distal to the structural gene. Expression may be either regulated or constitutive. 

Structural genes, abundantly transcribed at late times in a viral infection cycle, provide 
particularly useful promoter sequences. Examples include sequences derived from the gene 
encoding the viral polyhedron protein, Friesen et al., (1986) "The Regulation of Baculovirus 
Gene Expression, 1 ' in: The Molecular Biology of Baculoviruses (ed. Walter Doerfler); EPO 
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Publ. Nos. 127 839 and 155 476; and the gene encoding the plO protein, Vlak et al., (1988), J. 
Gen. Virol 69:765. 

DNA encoding suitable signal sequences can be derived from genes for secreted insect 
or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) 
Gene, 73:409). Alternatively, since the signals for mammalian cell posttranslational 
modifications (such as signal peptide cleavage, proteolytic cleavage, and phosphorylation) 
appear to be recognized by insect cells, and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, 
leaders of non-insect origin, such as those derived from genes encoding human (alpha) a- 
interferon, Maeda et al., (1985), Nature 315:592; human gastrin-releasing peptide, Lebacq- 
Verheyden et aL, (1988), Molec. Cell. Biol. 5:3129; human IL-2, Smith et al., (1985) Proa 
Nat'l Acad. Sci. USA, 52:8404; mouse IL-3, (Miyajima et al., (1987) Gene 55:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA, 7:99, can also be used to provide for secretion 
in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellularly or, if it is 
expressed with the proper regulatory sequences, it can be secreted. Good intracellular 
expression of nonfused foreign proteins usually requires heterologous genes that ideally have 
a short leader sequence containing suitable translation initiation signals preceding an ATG 
start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein 
by in vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally secreted 
can be secreted from the insect cell by creating chimeric DNA molecules that encode a fusion 
protein comprised of a leader sequence fragment that provides for secretion of the foreign 
protein in insects. The leader sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic 
reticulum. 

After insertion of the DNA sequence and/or the gene encoding the expression product 
precursor of the protein, an insect cell host is co-transformed with the heterologous DNA of 
the transfer vector and the genomic DNA of wild type baculovirus - usually by co- 
transfection. The promoter and transcription termination sequence of the construct will 
usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing 
heterologous DNA into the desired site in the baculovirus virus are known in the art. (See 
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Summers and Smith supra; Ju et al. (1987); Smith et al., Mol Cell Biol. (1983) 3:2156; and 
Luckow and Summers (1989)). For example, the insertion can be into a gene such as the 
polyhedrin gene, by homologous double crossover recombination; insertion can also be into a 
restriction enzyme site engineered into the desired baculovirus gene. Miller et al., (1989), 
Bioessays 4:91 . The DNA sequence, when cloned in place of the polyhedrin gene in the 
expression vector, is flanked both 5' and 3' by polyhedrin-specific sequences and is positioned 
downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is subsequently packaged into an 
infectious recombinant baculovirus. Homologous recombination occurs at low frequency 
(between about 1% and about 5%); thus, the majority of the virus produced after 
cotransfection is still wild-type virus. Therefore, a method is necessary to identify 
recombinant viruses. An advantage of the expression system is a visual screen allowing 
recombinant viruses to be distinguished. The polyhedrin protein, which is produced by the 
native virus, is produced at very high levels in the nuclei of infected cells at late times after 
viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain 
embedded particles. These occlusion bodies, up to 15 in size, are highly refractilc, giving 
them a bright shiny appearance that is readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies. To distinguish recombinant virus 
from wild-type virus, the transfection supernatant is plaqued onto a monolayer of insect cells 
by techniques known to those skilled in the art. Namely, the plaques are screened under the 
light microscope for the presence (indicative of wild-type virus) or absence (indicative of 
recombinant virus) of occlusion bodies. Current Protocols in Microbiology Vol. 2 (Ausubel 
et al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith, supra\ Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection into 
several insect cells. For example, recombinant baculoviruses have been developed for, inter 
alia: Aedes aegypti ,Autographa californica* Bombyx mori, Drosophila melanogaster, 
Spodopterafrugiperda, and Trichoplusia ni (PCT Pub. No. WO 89/046699; Carbonell et al., 
(1985) Virol Jtf:153; Wright (1986) Nature 327:718; Smith et al., (1983) Mol Cell Biol 
3:2156; and see generally, Fraser, et al (1989) In Vitro Cell Dev. Biol 25:225). 

Cells and cell culture media are commercially available for both direct and fusion 
expression of heterologous polypeptides in a baculovirus/expression system; cell culture 
technology is generally known to those skilled in the art. See. e.g., Summers and Smith supra. 
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The modified insect cells may then be grown in an appropriate nutrient medium, 
which allows for stable maintenance of the plasmid(s) present in the modified insect host. 
Where the expression product gene is under inducible control, the host may be grown to high 
density, and expression induced. Alternatively, where expression is constitutive, the product 
will be continuously expressed into the medium and the nutrient medium must be 
continuously circulated, while removing the product of interest and augmenting depleted 
nutrients. The product may be purified by such techniques as chromatography, e.g., HPLC, 
affinity chromatography, ion exchange chromatography, etc.; electrophoresis; density gradient 
centrifugation; solvent extraction, or the like. As appropriate, the product may be further 
purified, as required, so as to remove substantially any insect proteins which are also secreted 
in the medium or result from lysis of insect cells, so as to provide a product which is at least 
substantially free of host debris, e.g., proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host cells derived from the 
transformants are incubated under conditions which allow expression of the recombinant 
protein encoding sequence. These conditions will vary, dependent upon the host cell selected. 
However, the conditions are readily ascertainable to those of ordinary skill in the art, based 
upon what is known in the art. 

iv. Bacterial Systems 

Bacterial expression techniques are known in the art. A bacterial promoter is any DNA 
sequence capable of binding bacterial RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed proximal to the 5' end of the coding 
sequence. This transcription initiation region usually includes an RNA polymerase binding 
site and a transcription initiation site. A bacterial promoter may also have a second domain 
called an operator, that may overlap an adjacent RNA polymerase binding site at which RNA 
synthesis begins. The operator permits negative regulated (inducible) transcription, as a gene 
repressor protein may bind the operator and thereby inhibit transcription of a specific gene. 
Constitutive expression may occur in the absence of negative regulatory elements, such as the 
operator. In addition, positive regulation may be achieved by a gene activator protein binding 
sequence, which, if present is usually proximal (5') to the RNA polymerase binding sequence. 
An example of a gene activator protein is the catabolite activator protein (CAP), which helps 
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initiate transcription of the lac operon in Escherichia coli (E. coli) (Raibaud et aL (1984) 
Annu. Rev. Genet. ] 8:173). Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful promoter 
sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, 
such as galactose, lactose (lac) (Chang et aL (1977) Nature 795:1056), and maltose. 
Additional examples include promoter sequences derived from biosynthetic enzymes such as 
tryptophan (trp) (Goeddel et aL (1980) Nuc. Acids Res. 5:4057; Yelverton et aL (1981) NucL 
Acids Res. 9:731; U.S. Patent 4,738,921; EPO Publ. Nos. 036 776 and 121 775). The beta- 
lactamase (bla) promoter system (Weissmann (1981) "The cloning of interferon and other 
mistakes." In Interferon 3 (ed. I. Gresser)), bacteriophage lambda PL (Shimatake et aL (1 981) 
Nature 292:128) and T5 (U.S. Patent 4,689,406) promoter systems also provide useful 
promoter sequences. 

In addition, synthetic promoters which do not occur in nature also function as bacterial 
promoters. For example, transcription activation sequences of one bacterial or bacteriophage 
promoter may be joined with the operon sequences of another bacterial or bacteriophage 
promoter, creating a synthetic hybrid promoter (U.S. Patent 4,551,433). For example, the tac 
promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon 
sequences that is regulated by the lac repressor (Amann et aL (1983) Gene 25:167; de Boer et 
aL (1983) Proc. Natl. Acad. ScL 50:21). Furthermore, a bacterial promoter can include 
naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial 
RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial 
origin can also be coupled with a compatible RNA polymerase to produce high levels of 
expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter 
system is an example of a coupled promoter system (Studier et aL (1986) J. Mol. BioL 
189:1 13; Tabor et aL (1985) Proc Natl. Acad. Sci. 52:1074). In addition, a hybrid promoter 
can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ. 
No. 267 851). 

In addition to a functioning promoter sequence, an efficient ribosome binding site is 
also useful for the expression of foreign genes in prokaryotes. In E. coli, the ribosome binding 
site is called the Shine-Dalgamo (SD) sequence and includes an initiation codon (ATG) and a 
sequence 3-9 nucleotides in length located 3-1 1 nucleotides upstream of the initiation codon 
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(Shine et al (1975) Nature 254:34). The SD sequence is thought to promote binding of 
mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' end of 
E. coli 16S rRNA (Steitz et al (1 979) "Genetic signals and nucleotide sequences in 
messenger RNA." In Biological Regulation and Development: Gene Expression (ed. R.F. 
Goldberger)). To express eukaiyotic genes and prokaryotic genes with weak ribosome- 
binding site, it is often necessary to optimize the distance between the SD sequence and the 
ATG of the eukaryotic gene (Sambrook et al. (1989) "Expression of cloned genes in 
Escherichia coli." In Molecular Cloning: A Laboratory Manual). 

A DNA molecule may be expressed intracellularly. A promoter sequence may be 
directly linked with the DNA molecule, in which case the first amino acid at the N-terminus 
will always be a methionine, which is encoded by the ATG start codon. If desired, methionine 
at the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen 
bromide or by either in vivo or in vitro incubation with a bacterial methionine N-terminal 
peptidase (EPO Publ. No. 219 237). 

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous bacterial protein, or other stable protein, is 
fused to the 5 1 end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the bacteriophage lambda cell 
gene can be linked at the 5' terminus of a foreign gene and expressed in bacteria. The resulting 
fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the 
bacteriophage protein from the foreign gene (Nagai et al. (1984) Nature 30P:81O). Fusion 
proteins can also be made with sequences from the lacZ (Jia et al. (1987) Gene 60:197), trpE 
(Allen et al. (1987) J. Biotechnol. 5:93; Makoff et al. (1989) Gen. Microbiol. 135: 1 1), and 
Chey (EPO Publ. No. 324 647) genes. The DNA sequence at the junction of the two amino 
acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion 
protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site 
for a processing enzyme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin 
from the foreign protein. Through this method, native foreign protein can be isolated (Miller 
et al. (1989) Bio/Technology 7:698). 

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric 
DNA molecules that encode a fusion protein comprised of a signal peptide sequence fragment 
that provides for secretion of the foreign protein in bacteria (U.S. Patent 4,336,336). The 
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signal sequence fragment usually encodes a signal peptide comprised of hydrophobic amino 
acids which direct the secretion of the protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the periplasmic space, located between the 
inner and outer membrane of the cell (gram-negative bacteria). Preferably there are processing 
sites, which can be cleaved either in vivo or in vitro encoded between the signal peptide 
fragment and the foreign gene. 

DNA encoding suitable signal sequences can be derived from genes for secreted 
bacterial proteins, such as the E. coli outer membrane protein gene (ompA) (Masui et ah 
(1983), in: Experimental Manipulation of Gene Expression', Ghrayeb et aL (1 984) EMBO J. 
3:2437) and the£. coli alkaline phosphatase signal sequence iphoA) (Oka et ah (1985) Proc. 
Natl Acad. Set 52:7212). As an additional example, the signal sequence of the alpha-amylase 
gene from various Bacillus strains can be used to secrete heterologous proteins from B. 
subtilis (Palva et ah (1982) Proc. Nath Acad. Set USA 79:5582; EPO Publ. No. 244 042). 

Usually, transcription termination sequences recognized by bacteria are regulatory 
regions located V to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Transcription termination sequences 
frequently include DNA sequences of about 50 nucleotides capable of forming stem loop 
structures that aid in terminating transcription. Examples include transcription termination 
sequences derived from genes with strong promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Usually, the above described components, comprising a promoter, signal sequence (if 
desired), coding sequence of interest, and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often maintained in a replicon, such as 
an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
bacteria The replicon will have a replication system, thus allowing it to be maintained in a 
prokaryotic host either for expression or for cloning and amplification. In addition, a replicon 
may be either a high or low copy number plasmid. A high copy number plasmid will 
generally have a copy number ranging from about 5 to about 200, and usually about 10 to 
about 150. A host containing a high copy number plasmid will preferably contain at least 
about 10, and more preferably at least about 20 plasmids. Either a high or low copy number 
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vector may be selected, depending upon the effect of the vector and the foreign protein on the 
host. 

Alternatively, the expression constructs can be integrated into the bacterial genome 
with an integrating vector. Integrating vectors usually contain at least one sequence 
homologous to the bacterial chromosome that allows the vector to integrate. Integrations 
appear to result from recombinations between homologous DNA in the vector and the 
bacterial chromosome. For example, integrating vectors constructed with DNA from various 
Bacillus strains integrate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating 
vectors may also be comprised of bacteriophage or transposon sequences. 

Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin 
(neomycin), and tetracycline (Davies et al (197S) Annu. Rev. Microbiol 52:469). Selectable 
markers may also include biosynthetic genes, such as those in the histidine, tryptophan, and 
leucine biosynthetic pathways. 

Alternatively, some of the above described components can be put together in 
transformation vectors. Transformation vectors are usually comprised of a selectable market 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extra-chromosomal replicons or 
integrating vectors, have been developed for transformation into many bacteria. For example, 
expression vectors have been developed for, inter alia, the following bacteria: Bacillus 
subtilis (Palva et al (1982) Proc. Natl Acad. Sci. USA 7P:5582; EPO Publ. Nos. 036 259 and 
063 953; PCT Publ. No. WO 84/04541), Escherichia coli (Shimatake et al (1981) Nature 
292:128; Amann et al (1985) Gene 40:183; Studier et al. (1986) J. Mol Biol 189A 13: EPO 
Publ. Nos. 036 776, 136 829 and 136 907), Streptococcus cremoris (Powell et al (1988) Appl 
Environ. Microbiol 54:655); Streptococcus lividans (Powell et al (1988) Appl Environ. 
Microbiol 54:655\ Streptomyces lividans (U.S. Patent 4,745,056). 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, 
and usually include either the transformation of bacteria treated with CaCl 2 or other agents, 
such as divalent cations and DMSO. DNA can also be introduced into bacterial cells by 
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electroporation. Transformation procedures usually vary with the bacterial species to be 
transformed. (See e.g., use of Bacillus: Masson et al (1989) FEMS Microbiol Lett. 60:273; 
Palva et al. (1982) Proa Natl. Acad. Sci. USA 7P:5582; EPO Publ. Nos. 036 259 and 063 953; 
PCT Publ. No. WO 84/04541; use of Campylobacter: Miller et al. (1988) Proc. Natl. Acad. 
Sci. 55:856; and Wang et al (1 990) J. Bacteriol 1 72:949; use of Escherichia coli: Cohen et 
al (1973) Proc. Natl. Acad. Sci. 6P:2110; Dower et al. (1988) Nucleic Acids Res. 16:6127; 
Kushner (1978) "An improved method for transformation of Escherichia coli with ColEl- 
derived plasmids. In Genetic Engineering: Proceedings of the International Symposium on 
Genetic Engineering (eds. H.W. Boyer and S. Nicosia); Mandel et al. (1970) Mol. Biol. 
JJ:159; Taketo (1988) Biochim. Biophys. Acta 949'3\8\ use of Lactobacillus: Chassy et al 
(1987) FEMS Microbiol. Lett. 44:173; use of Pseudomonas: Fiedler et al (1988) Anal 
Biochem 1 70:38; use of Staphylococcus: Augustin et al (1990) FEMS Microbiol Lett. 
66:203; use of Streptococcus: Barany etal. (1980)7. Bacteriol 144:698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. J. 
Ferretti and R. Curtiss III); Perry et al. (198\) Infect. Immun. 52:1295; Powell et al (1988) 
Appl Environ. Microbiol 54:655; Somkuti et al (1987) Proc. 4th Evr. Cong. Biotechnology 
7:412. 

v. Yeast Expression 

Yeast expression systems are also known to one of ordinary skill in the art. A yeast 
promoter is any DNA sequence capable of binding yeast RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which is usually placed proximal to the 5 1 
end of the coding sequence. This transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a transcription initiation site. A yeast 
promoter may also have a second domain called an upstream activator sequence (UAS), 
which, if present, is usually distal to the structural gene. The UAS permits regulated 
(inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated 
expression may be either positive or negative, thereby either enhancing or reducing 
transcription. 

Yeast is a fermenting organism with an active metabolic pathway, therefore sequences 
encoding enzymes in the metabolic pathway provide particularly useful promoter sequences. 
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Examples include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044), enolase, 
glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase 
(GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and 
pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast PH05 gene, encoding acid 
phosphatase, also provides useful promoter sequences (Myanohara et al. (1983) Proc. Natl 
Acad. Set USA 80:1). 

In addition, synthetic promoters which do not occur in nature also function as yeast 
promoters. For example, UAS sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to 
the GAP transcription activation region (U.S. Patent Nos. 4,876,197 and 4,880,734). Other 
examples of hybrid promoters include promoters which consist of the regulatory sequences of 
either fheADH2, GAL4, GAL 10, OR PHOS genes, combined with the transcriptional 
activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 1 64 556). 
Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of 
such promoters include, inter alia, (Cohen etal (1980) /Voc. Natl Acad. Sci. USA 77:1078; 
Henikoff etal (1981) Nature 255:835; Hollenberg etal (1981) Curr. Topics Microbiol 
Immunol 96:119; Hollenberg et al (1979) "The Expression of Bacterial Antibiotic Resistance 
Genes in the Yeast Saccharomyces cerevisiae," in: Plasmids of Medical Environmental and 
Commercial Importance (eds. K.N. Timmis and A. Puhler); Mercerau-Puigalon et al (1980) 
Gene 77:163; Panthier et al (1980) Curr. Genet. 2:109;). 

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence may 
be directly linked with the DNA molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by 
in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in 
mammalian, plant, baculovirus, and bacterial expression systems. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the yeast or human 
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superoxide dismutase (SOD) gene, can be linked at the 5' terminus of a foreign gene and 
expressed in yeast The DNA sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. See e.g., EPO Publ. No. 196056. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that 
preferably retains a site for a processing enzyme (e.g. ubiquitin-specific processing protease) 
to cleave the ubiquitin from the foreign protein. Through this method, therefore, native 
foreign protein can be isolated (e.g., WO88/024066). 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provide for secretion in yeast of the foreign protein. Preferably, there 
are processing sites encoded between the leader fragment and the foreign gene that can be 
cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell. 

DNA encoding suitable signal sequences can be derived from genes for secreted yeast 
proteins, such as the yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
62:096,086) and the A-factor gene (U.S. Patent 4,588,684). Alternatively, leaders of non- 
yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO 
Publ. No. 060 057). 

A preferred class of secretion leaders are those that employ a fragment of the yeast 
alpha-factor gene, which contains both a "pre" signal sequence, and a "pro" region. The types 
of alpha-factor fragments that can be employed include the full-length pre-pro alpha factor 
leader (about 83 amino acid residues) as well as truncated alpha- factor leaders (usually about 
25 to about 50 amino acid residues) (U.S. Patent Nos. 4,546,083 and 4,870,008; EPO PubL 
No. 324 274). Additional leaders employing an alpha-factor leader fragment that provides for 
secretion include hybrid alpha-factor leaders made with a presequence of a first yeast, but a 
pro-region from a second yeast alphafactor. (See e.g., PCT Publ. No. WO 89/02463.) 

Usually, transcription termination sequences recognized by yeast are regulatory- 
regions located 3 1 to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
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sequence and other yeast-recognized termination sequences, such as those coding for 
glycolytic enzymes. 

Usually, the above described components, comprising a promoter, leader (if desired), 
coding sequence of interest, and transcription termination sequence, are put together into 
expression constructs. Expression constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
yeast or bacteria. The replicon may have two replication systems, thus allowing it to be 
maintained, for example, in yeast for expression and in a prokaryotic host for cloning and 
amplification. Examples of such yeast-bacteria shuttle vectors include YEp24 (Botstein et aL 
(1979) Gene 5:17-24), pCl/1 (Brake et al (1984) Proc. Natl. Acad. Sci USA 57:4642-4646), 
and YRpl7 (Stinchcomb et al, (1982) J. Mol Biol 1 55:157). In addition, a replicon may be 
either a high or low copy number plasmid. A high copy number plasmid will generally have a 
copy number ranging from about 5 to about 200, and usually about 1 0 to about 1 50. A host 
containing a high copy number plasmid will preferably have at least about 10, and more 
preferably at least about 20. Enter a high or low copy number vector may be selected, 
depending upon the effect of the vector and the foreign protein on the host. See e.g., Brake et 
al. y supra. 

Alternatively, the expression constructs can be integrated into the yeast genome with 
an integrating vector. Integrating vectors usually contain at least one sequence homologous to 
a yeast chromosome that allows the vector to integrate, and preferably contain two 
homologous sequences flanking the expression construct. Integrations appear to result from 
recombinations between homologous DNA in the vector and the yeast chromosome (Orr- 
Weaver et al. (1983) Methods in Enzymol 707:228-245). An integrating vector may be 
directed to a specific locus in yeast by selecting the appropriate homologous sequence for 
inclusion in the vector. See Orr- Weaver et al, supra. One or more expression construct may 
integrate, possibly affecting levels of recombinant protein produced (Rine et al (1983) Proc. 
Natl Acad. Sci. USA 50:6750). The chromosomal sequences included in the vector can occur 
either as a single segment in the vector, which results in the integration of the entire vector, or 
two segments homologous to adjacent segments in the chromosome and flanking the 
expression construct in the vector, which can result in the stable integration of only the 
expression construct. 
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Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can be expressed in the yeast host, 
such as ADE2, HIS4, LEU2, TRP1, and ALG7, and the G418 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G418, respectively. In addition, a suitable 
selectable marker may also provide yeast with the ability to grow in the presence of toxic 
compounds, such as metal. For example, the presence of CUP1 allows yeast to grow in the 
presence of copper ions (Butt et al (1987) Microbiol, Rev. 57:351). 

Alternatively, some of the above described components can be put together into 
transformation vectors. Transformation vectors are usually comprised of a selectable marker 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extrachromosomal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For example, 
expression vectors and methods of introducing exogenous DNA into yeast hosts have been 
developed for, inter alia, the following yeasts: Candida albicans (Kurtz, et al. (1986) Mol 
Cell Biol 6:142); Candida maltosa (Kunze, et al (1985) J. Basic Microbiol 25:141); 
Hansenulapolymorpha (Gleeson, et al (1986) J. Gen. Microbiol 732:3459; Roggenkamp et 
al. (1986) Mol Gen. Genet. 202:302); Kluyveromyces fragilis (Das, et al (1984) /. Bacteriol 
755:1 165); Kluyveromyces lactis (De Louvencourt et al. (1983) /. Bacteriol 154:737; Van 
den Berg et al (1990) Bio/Technology <?:135); Pichia guillerimondii (Kunze et al (1985) J. 
Basic Microbiol 25:141); Pichia pastoris (Cregg, et al (1985) Mol Cell Biol 5:3376; U.S. 
Patent Nos. 4,837,148 and 4,929,555); Saccharomyces cerevisiae (Hinnen et al. (1978) Proc. 
Natl Acad. Sci. USA 75:1929; Ito et al (1983)7. Bacteriol 153:163); Schizosaccharomyces 
pombe (Beach and Nurse (1981) Nature 300:706); and Yarrowia lipolytica (Davidow, et al. 
(1985) Curr. Genet. 70:380471 Gaillardin, etal (1985) Curr. Genet 70:49). 

Methods of introducing exogenous DNA into yeast hosts are well-known in the art, 
and usually include either the transformation of spheroplasts or of inlact yeast cells treated 
with alkali cations. Transformation procedures usually vary with the yeast species to be 
transformed. See e.g., [Kurtz et al. (1986) Mol. Cell Biol 5:142; Kunze*/ a/. (1985) J. Basic 
Microbiol 25:141; Candida]; [Gleeson etal (1986)7. Gen. Microbiol 752:3459; 
Roggenkamp et al (1986) Mol Gen. Genet 202:302; Hansenula]; [Das et al (1984) J. 
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BacterioL J 58:1 165; De Louvencourt et al, (1983) J, Bacteriol. 154:1 165; Van den Berg et al 
(1990) Bio/Technology 5:135; Kluyveromyces]; [Cregg et al (1985) Mol Cell Biol 5:3376; 
Kunze et al (1985) 1 Basic Microbiol 25:141; U.S. Patent Nos. 4,837,148 and 4,929,555; 
Pichia]; [Hinnen et al (1978) Proc. Natl Acad. Set USA 75;1929; lto et al (1983) J. 
Bacteriol 753:163 Saccharomyces]; [Beach and Nurse (1981) Nature 300:706; 
Schizosaccharomyces]; [Davidow et al (1985) Cum Genet. 70:39; Gaillardin et al (1985) 
Curr. Genet. 70:49; Yarrowia]. 

Definitions 

A composition containing X is "substantially free of Y when at least 85% by weight 
of the total X+Y in the composition is X. Preferably, X comprises at least about 90% by 
weight of the total of X+Y in the composition, more preferably at least about 95% or even 
99% by weight. 

A "conserved" Neisseria amino acid fragment or protein is one that is present in a 
particular Neisserial protein in at least x% of Neisseria. The value of x may be 50% or more, 
e.g., 66%, 75%, 80%, 90%, 95% or even 1 00% (i.e. the amino acid is found in the protein in 
question in all Neisseria). In order to determine whether an animo acid is "conserved" in a 
particular Neisserial protein, it is necessary to compare that amino acid residue in the 
sequences of the protein in question from a plurality of different Neisseria (a reference 
population). The reference population may include a number of different Neisseria species 
or may include a single species. The reference population may include a number of different 
serogroups of a particular species or a single serogroup. A preferred reference population 
consists of the 5 most common Neisseria strains. 

The term "heterologous" refers to two biological components that are not found 
together in nature. The components may be host cells, genes, or regulatory regions, such as 
promoters. Although the heterologous components are not found together in nature, they can 
function together, as when a promoter heterologous to a gene is operably linked to the gene. 
Another example is where a Neisserial sequence is heterologous to a mouse host cell. 

"Epitope" means antigenic determinant, and may elicit a cellular and/or humoral 
response. 

Conditions for "high stringency" are 65 degrees C in 0.1 xSSC 0.5% SDS solution. 
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An "origin of replication" is a polynucleotide sequence that initiates and regulates 
replication of polynucleotides, such as an expression vector. The origin of replication behaves 
as an autonomous unit of polynucleotide replication within a cell, capable of replication under 
its own control. An origin of replication may be needed for a vector to replicate in a particular 
host cell. With certain origins of replication, an expression vector can be reproduced at a high 
copy number in the presence of the appropriate proteins within the cell. Examples of origins 
are the autonomously replicating sequences, which are effective in yeast; and the viral 
T-antigen, effective in COS-7 cells. 

A "mutant" sequence is defined as a DNA, RN A or amino acid sequence differing 
from but having homology with the native or disclosed sequence. Depending on the 
particular sequence, the degree of homology (sequence identity) between the native or 
disclosed sequence and the mutant sequence is preferably greater than 50% (e.g., 60%, 70%, 
80%, 90%, 95%, 99% or more) which is calculated as described above. As used herein, an 
"allelic variant" of a nucleic acid molecule, or region, for which nucleic acid sequence is 
provided herein is a nucleic acid molecule, or region, that occurs at essentially the same locus 
in the genome of another or second isolate, and that, due to natural variation caused by, for 
example, mutation or recombination, has a similar but not identical nucleic acid sequence. A 
coding region allelic variant typically encodes a protein having similar activity to that of the 
protein encoded by the gene to which it is being compared. An allelic variant can also 
comprise an alteration in the 5 f or 3' untranslated regions of the gene, such as in regulatory 
control regions, (see, for example, U.S. Patent 5,753,235). 

Antibodies 

As used herein, the term "antibody" refers to a polypeptide or group of polypeptides 
composed of at least one antibody combining site. An "antibody combining site" is the 
three-dimensional binding space with an internal surface shape and charge distribution 
complementary to the features of an epitope of an antigen, which allows a binding of the 
antibody with the antigen. "Antibody" includes, for example, vertebrate antibodies, hybrid 
antibodies, chimeric antibodies, humanized antibodies, altered antibodies, univalent 
antibodies, Fab proteins, and single domain antibodies. 

Antibodies against the proteins of the invention are useful for affinity 
chromatography, immunoassays, and distinguishing/identifying Neisseria menB proteins. 
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Antibodies elicited against the proteins of the present invention bind to antigenic polypeptides 
or proteins or protein fragments that are present and specifically associated with strains of 
Neisseria meningitidis menB. In some instances, these antigens may be associated with 
specific strains, such as those antigens specific for the menB strains. The antibodies of the 
invention may be immobilized to a matrix and utilized in an immunoassay or on an affinity 
chromatography column, to enable the detection and/or separation of polypeptides, proteins or 
protein fragments or cells comprising such polypeptides, proteins or protein fragments. 
Alternatively, such polypeptides, proteins or protein fragments may be immobilized so as to 
detect antibodies bindably specific thereto. 

Antibodies to the proteins of the invention, both polyclonal and monoclonal, may be 
prepared by conventional methods. In general, the protein is first used to immunize a suitable 
animal, preferably a mouse, rat, rabbit or goat. Rabbits and goats are preferred for the 
preparation of polyclonal sera due to the volume of serum obtainable, and the availability of 
labeled anti-rabbit and anti-goat antibodies. Immunization is generally performed by mixing 
or emulsifying the protein in saline, preferably in an adjuvant such as Freund's complete 
adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 ng/injection is typically sufficient. Immunization is 
generally boosted 2-6 weeks later with one or more injections of the protein in saline, 
preferably using Freund's incomplete adjuvant. One may alternatively generate antibodies by 
in vitro immunization using methods known in the art, which for the purposes of this 
invention is considered equivalent to in vivo immunization. Polyclonal antisera is obtained by 
bleeding the immunized animal into a glass or plastic container, incubating the blood at 25°C 
for one hour, followed by incubating at 4°C for 2-18 hours. The serum is recovered by 
centrifiigation (e.g., l,000g for 10 minutes). About 20-50 ml per bleed may be obtained from 
rabbits. 

Monoclonal antibodies are prepared using the standard method of Kohler & Milstein 
{Nature (1 975) 256:495-96), or a modification thereof. Typically, a mouse or rat is 
immunized as described above. However, rather than bleeding the animal to extract serum, 
the spleen (and optionally several large lymph nodes) is removed and dissociated into single 
cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent 
cells) by applying a cell suspension to a plate or well coated with the protein antigen. B-cells 
that express membrane-bound immunoglobulin specific for the antigen bind to the plate, and 
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are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
cells, are then induced to fiise with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, "HAT"). The 
resulting hybridomas are plated by limiting dilution, and are assayed for the production of 
antibodies which bind specifically to the immunizing antigen (and which do not bind to 
unrelated antigens). The selected MAb-secrcting hybridomas are then cultured either in vitro 
(e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive 
atoms (particularly 32 P and 125 I), electron-dense reagents, enzymes, and ligands having 
specific binding partners. Enzymes are typically detected by their activity. For example, 
horseradish peroxidase is usually detected by its ability to convert 
3,3\5,5'-tetramethylbenzidine (TMB) to a blue pigment, quantifiable with a 
spectrophotometer. "Specific binding partner" refers to a protein capable of binding a ligand 
molecule with high specificity, as for example in the case of an antigen and a monoclonal 
antibody specific therefor. Other specific binding partners include biotin and avidin or 
streptavidin, IgG and protein A, and the numerous receptor-ligand couples known in the art. It 
should be understood that the above description is not meant to categorize the various labels 
into distinct classes, as the same label may serve in several different modes. For example, 125 I 
may serve as a radioactive label or as an electron-dense reagent. HRP may serve as enzyme or 
as antigen for a MAb. Further, one may combine various labels for desired effect. For 
example, MAbs and avidin also require labels in the practice of this invention: thus, one 
might label a MAb with biotin, and detect its presence with avidin labeled with 125 I, or with 
an anti-biotin MAb labeled with HRP. Other permutations and possibilities will be readily 
apparent to those of ordinary skill in the art, and are considered as equivalents within the 
scope of the instant invention. 

Antigens, immunogens, polypeptides, proteins or protein fragments of the present 
invention elicit formation of specific binding partner antibodies. These antigens, 
immunogens, polypeptides, proteins or protein fragments of the present invention comprise 
immunogenic compositions of the present invention. Such immunogenic compositions may 
further comprise or include adjuvants, carriers, or other compositions that promote or enhance 
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or stabilize the antigens, polypeptides, proteins or protein fragments of the present invention. 
Such adjuvants and carriers will be readily apparent to those of ordinary skill in the art. 

Pharmaceutical Compositions 

Pharmaceutical compositions can comprise (include) either polypeptides, antibodies, 
or nucleic acid of the invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature, when given to a patient that is febrile. The 
precise effective amount for a subject will depend upon the subject's size and health, the 
nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be determined by routine 
experimentation and is within the judgment of the clinician. 

For purposes of the present invention, an effective dose will be from about 0.01 mg/ 
kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to 
which it is administered. 

A pharmaceutical composition can also contain a pharmaceutical^ acceptable carrier. 
The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a 
therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 
The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
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organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutical ly acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., N.J. 1991). 

Pharmaceutical^ acceptable carriers in therapeutic compositions may contain liquids 
such as water, saline, glycerol and ethanol. Additionally, auxiliary substances, such as wetting 
or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles. 
Typically, the therapeutic compositions are prepared as injectables, either as liquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to 
injection may also be prepared. Liposomes are included within the definition of a 
pharmaceutically acceptable carrier. 

Delivery Methods 

Once formulated, the compositions of the invention can be administered directly to the 
subject. The subjects to be treated can be animals; in particular, human subjects can be 
treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneal^, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a lesion. Other 
modes of administration include oral and pulmonary administration, suppositories, and 
transdermal and transcutaneous applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

Vaccines 

Vaccines according to the invention may either be prophylactic (i.e., to prevent 
infection) or therapeutic (i.e., to treat disease after infection). 

Such vaccines comprise immunizing antigen(s) or immunogen(s), immunogenic 
polypeptide, protein(s) or protein fragments, or nucleic acids (e.g., ribonucleic acid or 
deoxyribonucleic acid), usually in combination with "pharmaceutically acceptable carriers,'* 
which include any carrier that does not itself induce the production of antibodies harmful to 
the individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or 
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liposomes), and inactive virus particles. Such carriers are well known to those of ordinary 
skill in the art. Additionally, these carriers may function as immunostimulating agents 
("adjuvants"). Furthermore, the immunogen or antigen may be conjugated to a bacterial 
toxoid, such as a toxoid from diphtheria, tetanus, cholera, H. pylori, etc. pathogens. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum phosphate, 
aluminum sulfate, etc; (2) oil-in-water emulsion formulations (with or without other specific 
immunostimulating agents such as muramyl peptides (see below) or bacterial cell wall 
components), such as for example (a) MF59 (PCT Publ. No. WO 90/14837), containing 5% 
Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts of 
MTP-PE (see below), although not required) formulated into submicron particles using a 
microfluidizer such as Model 1 10Y microfluidizer (Microfluidics, Newton, MA), (b) SAF, 
containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP 
(see below) either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi lmmunochem, Hamilton, 
MT) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall 
components from the group consisting of monophosphorylipid A (MPL), trehalose 
dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS (Detox™); 

(3) saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be 
used or particles generated therefrom such as ISCOMs (immunostimulating complexes); 

(4) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant (IF A); 

(5) cytokines, such as interleukins (e.g., IL-1, EL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), 
interferons (e.g., gamma interferon), macrophage colony stimulating factor (M-CSF), tumor 
necrosis factor (TNF), etc; (6) detoxified mutants of a bacterial ADP-ribosylating toxin such 
as a cholera toxin (CT), a pertussis toxin (PT), or an E. coli heat-labile toxin (LT), particularly 
LT-K63, LT-R72, CT-S109, PT-K9/G129; see, e.g., WO 93/13302 and WO 92/19265; and 
(7) other substances that act as immunostimulating agents to enhance the effectiveness of the 
composition. Alum and MF59 are preferred. 

As mentioned above, muramyl peptides include, but are not limited to, N-acetyl- 
muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-noimuramyl-L-alanyl-D- 
isoglutamine (nor-MDP), N-acetylmuramyl-L-alany l-D-isoglutaminy I-L-alanine-2-( 1 '-2'- 
dipalmitoyl-s/?-glycero-3-huydroxyphosphoryloxy)-ethylamine (MTP-PE), etc. 
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The vaccine compositions comprising immunogenic compositions (e.g., which may 
include the antigen, pharmaceutical^ acceptable carrier, and adjuvant) typically will contain 
diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such 
as wetting or emulsifying agents, pH buffering substances, and the like, may be present in 
such vehicles. Alternatively, vaccine compositions comprising immunogenic compositions 
may comprise an antigen, polypeptide, protein, protein fragment or nucleic acid in a 
pharmaceutical^ acceptable carrier. 

More specifically, vaccines comprising immunogenic compositions comprise an 
immunologically effective amount of the immunogenic polypeptides, as well as any other of 
the above-mentioned components, as needed. By "immunologically effective amount", it is 
meant that the administration of that amount to an individual, either in a single dose or as part 
of a series, is effective for treatment or prevention. This amount varies depending upon the 
health and physical condition of the individual to be treated, the taxonomic group of 
individual to be treated (e.g., nonhuman primate, primate, etc.), the capacity of the 
individuals immune system to synthesize antibodies, the degree of protection desired, the 
formulation of the vaccine, the treating doctor's assessment of the medical situation, and other 
relevant factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

Typically, the vaccine compositions or immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or 
suspension in, liquid vehicles prior to injection may also be prepared. The preparation also 
may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed 
above under pharmaceutical^ acceptable carriers. 

The immunogenic compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Additional formulations suitable for other 
modes of administration include oral and pulmonary formulations, suppositories, and 
transdermal and transcutaneous applications. Dosage treatment may be a single dose schedule 
or a multiple dose schedule. The vaccine may be administered in conjunction with other 
immunoregulatory agents. 

As an alternative to protein-based vaccines, DNA vaccination may be employed (e.g., 
Robinson & Torres (1997) Seminars in Immunology 9:271-283; Donnelly et al (1997) Annu 
Rev Immunol 15:617-648). 
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Gene Delivery Vehicles 

Gene therapy vehicles for delivery of constructs, including a coding sequence of a 
therapeutic of the invention, to be delivered to the mammal for expression in the mammal, can 
be administered either locally or systemically. These constructs can utilize viral or non- viral 
vector approaches in in vivo or ex vivo modality. Expression of such coding sequence can be 
induced using endogenous mammalian or heterologous promoters. Expression of the coding 
sequence in vivo can be either constitutive or regulated. 

The invention includes gene delivery vehicles capable of expressing the contemplated 
nucleic acid sequences. The gene delivery vehicle is preferably a viral vector and, more 
preferably, a retroviral, adenoviral, adeno-associated viral (AAV), herpes viral, or alphavirus 
vector. The viral vector can also be an astrovirus, coronavirus, orthomyxovirus, papovavirus, 
paramyxovirus, parvovirus, picornavirus, poxvirus, or togavirus viral vector. See generally, 
Jolly (1994) Cancer Gene Therapy 1 :51-64; Kimura (1994) Human Gene Therapy 5:845-852, 
Connelly (1995) Human Gene Therapy 6:1 85-193; and Kaplitt (1994) Nature Genetics 
6:148-153. 

Retroviral vectors are well known in the art, including B, C and D type retroviruses, 
xenotropic retroviruses (for example, NZB-X1, NZB-X2 and NZB9-1 (see O'Neill (1985) J. 
Virol 53:160) polytropic retroviruses e.g., MCF and MCF-MLV (see Kelly (1983) J. Virol 
45:291), spumaviruses and lentiviruses. See RNA Tumor Viruses, Second Edition, Cold 
Spring Harbor Laboratory, 1985. 

Portions of the retroviral gene therapy vector may be derived from different 
retroviruses. For example, retrovector LTRs may be derived from a Murine Sarcoma Virus, a 
tRNA binding site from a Rous Sarcoma Virus, a packaging signal from a Murine Leukemia 
Virus, and an origin of second strand synthesis from an Avian Leukosis Virus. 

These recombinant retroviral vectors may be used to generate transduction competent 
retroviral vector particles by introducing them into appropriate packaging cell lines (see US 
patent 5,591,624). Retrovirus vectors can be constructed for site-specific integration into host 
cell DNA by incorporation of a chimeric integrase enzyme into the retroviral particle (see 
W096/37626). It is preferable that the recombinant viral vector is a replication defective 
recombinant virus. 



WO 99/57280 



38 



PCT/US99/09346 



Packaging cell lines suitable for use with the above-described retrovirus vectors are 
well known in the art, are readily prepared (see WO95/30763 and WO92/05266), and can be 
used to create producer cell lines (also termed vector cell lines or "VCLs") for the production 
of recombinant vector particles. Preferably, the packaging cell lines are made from human 
parent cells (e.g., HT1080 cells) or mink parent cell lines, which eliminates inactivation in 
human serum. 

Preferred retroviruses for the construction of retroviral gene therapy vectors include 
Avian Leukosis Virus, Bovine Leukemia, Virus, Murine Leukemia Virus, Mink-Cell 
Focus-Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis Virus and Rous 
Sarcoma Virus. Particularly preferred Murine Leukemia Viruses include 4070A and 1504A 
(Hartley and Rowe (1976) J Virol 19:19-25), Abelson (ATCC No. VR-999), Friend (ATCC 
No. VR-245), Graffi, Gross (ATCC Nol VR-590), Kirsten, Harvey Sarcoma Virus and 
Rauscher (ATCC No. VR-998) and Moloney Murine Leukemia Virus (ATCC No. VR-190). 
Such retroviruses may be obtained from depositories or collections such as the American 
Type Culture Collection ("ATCC") in Rockville, Maryland or isolated from known sources 
using commonly available techniques. 

Exemplary known retroviral gene therapy vectors employable in this invention include 
those described in patent applications GB2200651, EP04 15731, EP0345242, EP0334301, 
WO89/02468; WO89/05349, WO89/09271, WO90/02806, WO90/07936, WO94/03622, 
W093/25698, W093/25234, WO93/11230, WO93/10218, WO91/02805, WO91/02825, 
WO95/07994, US 5,219,740, US 4,405,712, US 4,861,719, US 4,980,289, US 4,777,127, US 
5,591,624. See also Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 
53:962-967; Ram (1993) Cancer Res 53 (1993) 83-88; Takamiya(1992) JNeurosci Res 
33:493-503; Baba (1993) JNeurosurg 79:729-735; Mann (1983) Cell 33:153; Cane (1984) 
Proc Natl Acad Sci 81 :6349; and Miller (1990) Human Gene Therapy 1 . 

Human adenoviral gene therapy vectors are also known in the art and employable in 
this invention. See, for example, Berkner (1988) Bioiechniques 6:616 and Rosenfeld (1991) 
Science 252:431, and WO93/07283, WO93/06223, and WO93/07282. Exemplary known 
adenoviral gene therapy vectors employable in this invention include those described in the 
above referenced documents and in W094/12649, WO93/03769, W093/19191, 
W094/28938, W095/1 1984, WO95/00655, WO95/27071, W095/29993, W095/34671, 
WO96/05320, WO94/08026, WO94/11506, WO93/06223, W094/24299, WO95/14102, 
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W095/24297, WO95/02697, W094/28152, W094/24299, WO95/09241, WO95/25807, 
WO95/05835, W094/18922 and WO95/09654. Alternatively, administration of DNA linked 
to killed adenovirus as described in Curiel (1992) Hum, Gene Then 3:147-154 may be 
employed. The gene delivery vehicles of the invention also include adenovirus associated 
virus (AAV) vectors. Leading and preferred examples of such vectors for use in this invention 
are the AAV-2 based vectors disclosed in Srivastava, WO93/09239. Most preferred AAV 
vectors comprise the two AAV inverted terminal repeats in which the native D-sequences are 
modified by substitution of nucleotides, such that at least 5 native nucleotides and up to 18 
native nucleotides, preferably at least 10 native nucleotides up to 18 native nucleotides, most 
preferably 10 native nucleotides are retained and the remaining nucleotides of the D-sequence 
are deleted or replaced with non-native nucleotides. The native D-sequences of the AAV 
inverted terminal repeats are sequences of 20 consecutive nucleotides in each AAV inverted 
terminal repeat (i.e., there is one sequence at each end) which are not involved in HP 
formation. The non-native replacement nucleotide may be any nucleotide other than the 
nucleotide found in the native D-sequence in the same position. Other employable exemplary 
AAV vectors are pWP-19, pWN-1 , both of which are disclosed in Nahreini (1993) Gene 
124:257-262. Another example of such an AAV vector is psub201 (see Samulski (1987) J. 
Virol 61 :3096). Another exemplary AAV vector is the Double-D ITR vector. Construction of 
the Double-D ITR vector is disclosed in US Patent 5,478,745. Still other vectors are those 
disclosed in Carter US Patent 4,797,368 and Muzyczka US Patent 5,139,941, Chartejee US 
Patent 5,474,935, and Kotin W094/288157. Yet a further example of an AAV vector 
employable in this invention is SSV9AFABTKneo, which contains the AFP enhancer and 
albumin promoter and directs expression predominantly in the liver. Its structure and 
construction are disclosed in Su (1996) Human Gene Therapy 7:463-470. Additional AAV 
gene therapy vectors are described in US 5,354,678, US 5,173,414, US 5,139,941, and US 
5,252,479. 

The gene therapy vectors comprising sequences of the invention also include herpes 
vectors. Leading and preferred examples are herpes simplex virus vectors containing a 
sequence encoding a thymidine kinase polypeptide such as those disclosed in US 5,288,641 
and EP0176170 (Roizman). Additional exemplary herpes simplex virus vectors include 
HFEM/ICP6-LacZ disclosed in WO95/04139 (Wistar Institute), pHSVlac described in Geller 
(1988) Science 241 :1 667-1 669 and in WO90/09441 and WO92/07945, HSV Us3::pgC-lacZ 
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described in Fink (l992)Human Gene Therapy 3:11-19 and HSV 7134, 2 RH 105 and GAL4 
described in EP 0453242 (Breakefield), and those deposited with the ATCC as accession 
numbers ATCC VR-977 and ATCC VR-260. 

Also contemplated are alpha virus gene therapy vectors that can be employed in this 
invention. Preferred alpha virus vectors are Sindbis viruses vectors. Togaviruses, Semliki 
Forest virus (ATCC VR-67; ATCC VR-1247), Middleberg virus (ATCC VR-370), Ross 
River virus (ATCC VR-373; ATCC VR-1246), Venezuelan equine encephalitis vims (ATCC 
VR923; ATCC VR-1250; ATCC VR-1249; ATCC VR-532), and those described in US 
patents 5,091,309, 5,217,879, and WO92/10578. More particularly, those alpha virus vectors 
described in U.S. Serial No. 08/405,627, filed March 15, 1995,W094/21792, WO92/10578, 
WO95/07994, US 5,091,309 and US 5,217,879 arc employable. Such alpha viruses may be 
obtained from depositories or collections such as the ATCC in Rockviile, Maryland or 
isolated from known sources using commonly available techniques. Preferably, alphavirus 
vectors with reduced cytotoxicity are used (see USSN 08/679640). 

DNA vector systems such as eukarytic layered expression systems are also useful for 
expressing the nucleic acids of the invention. SeeWO95/07994 for a detailed description of 
eukaryotic layered expression systems. Preferably, the eukaryotic layered expression systems 
of the invention are derived from alphavirus vectors and most preferably from Sindbis viral 
vectors. 

Other viral vectors suitable for use in the present invention include those derived from 
poliovirus, for example ATCC VR-58 and those described in Evans, Nature 339 (1989) 385 
and Sabin (1973) 7. Biol. Standardization 1:115; rhinovirus, for example ATCC VR-1 1 10 and 
those described in Arnold (1990) J Cell Biochem L401; pox viruses such as canary pox virus 
or vaccinia virus, for example ATCC VR-1 1 1 and ATCC VR-2010 and those described in 
Fisher-Hoch (1989) Proc Natl Acad Sci 86:317; Flexner (1989) Ann NYAcadSci 569:86, 
Flexner (1990) Vaccine 8:17; in US 4,603,112 and US 4,769,330 and WO89/01973; SV40 
virus, for example ATCC VR-305 and those described in Mulligan (1979) Nature 277:108 
and Madzak (1992) J Gen Virol 73:1533; influenza vims, for example ATCC VR-797 and 
recombinant influenza viruses made employing reverse genetics techniques as described in 
US 5,166,057 and in Enami (1990) Proc Natl Acad Sci 87:3802-3805; Enami & Palese (1991) 
J Virol 65:271 1-2713 and Luytjes (1989) Cell 59:1 10, (see also McMichael (1983) NEJMed 
309:1 3, and Yap (1978) Nature 273:238 and Nature (1979) 277:108); human 
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immunodeficiency virus as described in EP-0386882 and in Buchschacher (1992) J. Virol 
66:2731; measles virus, for example ATCC VR-67 and VR-1247 and those described in EP- 
0440219; Aura virus, for example ATCC VR-368; Bebaru virus, for example ATCC VR-600 
and ATCC VR-1240; Cabassou virus, for example ATCC VR-922; Chikungunya virus, for 
example ATCC VR-64 and ATCC VR-1241 ; Fort Morgan Virus, for example ATCC 
VR-924; Getah virus, for example ATCC VR-369 and ATCC VR-1243; Kyzylagach virus, 
for example ATCC VR-927; Mayaro virus, for example ATCC VR-66; Mucambo virus, for 
example ATCC VR-580 and ATCC VR-1244; Ndumu virus, for example ATCC VR-371 ; 
Pixuna virus, for example ATCC VR-372 and ATCC VR-1245; Tonate virus, for example 
ATCC VR-925; Triniti virus, for example ATCC VR-469; Una virus, for example ATCC 
VR-374; Whataroa virus, for example ATCC VR-926; Y-62-33 virus, for example ATCC 
VR-375; O'Nyong virus, Eastern encephalitis virus, for example ATCC VR-65 and ATCC 
VR-1242; Western encephalitis virus, for example ATCC VR-70, ATCC VR-1251, ATCC 
VR-622 and ATCC VR-1252; and coronavirus, for example ATCC VR-740 and those 
described in Hamre ( 1 966) Proc Soc Exp Biol Med 121:190. 

Delivery of the compositions of this invention into cells is not limited to the above 
mentioned viral vectors. Other delivery methods and media may be employed such as, for 
example, nucleic acid expression vectors, polycationic condensed DNA linked or unlinked to 
killed adenovirus alone, for example see US Serial No. 08/366,787 : filed December 30, 1994 
and Curiel (1992) Hum Gene Ther 3:147-154 ligand linked DNA, for example see Wu (1989) 
J Biol Chem 264:16985-16987, eucaryotic cell delivery vehicles cells, for example see US 
Serial No.08/240,030, filed May 9, 1994, and US Serial No. 08/404,796, deposition of 
photopolymerized hydrogel materials, hand-held gene transfer particle gun, as described in 
US Patent 5,149,655, ionizing radiation as described in US5,206,152 and in W092/1 1033, 
nucleic charge neutralization or fiision with cell membranes. Additional approaches are 
described in Philip (1994) Mol Cell Biol 14:2411-2418 and in Woffendin (1994) Proc Natl 
AcadSci 91:1581-1585. 

Particle mediated gene transfer may be employed, for example see US Serial No. 
60/023,867. Briefly, the sequence can be inserted into conventional vectors that contain 
conventional control sequences for high level expression, and then incubated with synthetic 
gene transfer molecules such as polymeric DNA-binding cations like polylysine, protamine, 
and albumin, linked to cell targeting ligands such as asialoorosomucoid, as described in Wu & 
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Wu (1987) 1 Biol Chem. 262:4429-4432, insulin as described in Hucked (1990) Biochem 
Pharmacol 40:253-263, galactose as described in Plank (1992) Bioconjugate Chem 
3:533-539, lactose or transferrin. 

Naked DNA may also be employed to transform a host cell. Exemplary naked DNA 
introduction methods are described in WO 90/1 1092 and US 5,580,859. Uptake efficiency 
may be improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method may be improved 
further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. 

Liposomes that can act as gene delivery vehicles are described in U.S. 5,422,120, 
W095/13796, W094/23697, W09 1/14445 and EP-524,968. As described in USSN. 
60/023,867, on non-viral delivery, the nucleic acid sequences encoding a polypeptide can be 
inserted into conventional vectors that contain conventional control sequences for high level 
expression, and then be incubated with synthetic gene transfer molecules such as polymeric 
DNA-binding cations like polylysine, protamine, and albumin, linked to cell targeting ligands 
such as asialoorosomucoid, insulin, galactose, lactose, or transferrin. Other delivery systems 
include the use of liposomes to encapsulate DNA comprising the gene under the control of a 
variety of tissue-specific or ubiquitously-active promoters. Further non- viral delivery suitable 
for use includes mechanical delivery systems such as the approach described in WofTendin et 
al (1994) Proc. Nail Acad. ScL USA 91(24):1 1581-1 1585. Moreover, the coding sequence 
and the product of expression of such can be delivered through deposition of 
photopolymerized hydrogel materials. Other conventional methods for gene delivery that can 
be used for delivery of the coding sequence include, for example, use of hand-held gene 
transfer particle gun, as described in U.S. 5,149,655; use of ionizing radiation for activating 
transferred gene, as described in U.S. 5,206,152 and W092/1 1033. 

Exemplary liposome and polycationic gene delivery vehicles are those described in 
US 5,422,120 and 4,762,915; inWO 95/13796; W094/23697; and W09 1/14445; in EP- 
0524968; and in Stryer, Biochemistry, pages 236-240 (1975) W.H. Freeman, San Francisco; 
Szoka (1980) Biochem Biophys Acta 600:1 ; Bayer (1 979) Biochem Biophys Acta 550:464; 
Rivnay (1987) Meth Enzymol 149:1 19; Wang (1987) Proc Natl Acad Sci 84:7851; Plant 
(1989) Anal Biochem 176:420. 
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A polynucleotide composition can comprises therapeutically effective amount of a 
gene therapy vehicle, as the term is defined above. For purposes of the present invention, an 
effective dose will be from about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg 
of the DNA constructs in the individual to which it is administered. 

Delivery Methods 

Once formulated, the polynucleotide compositions of the invention can be 
administered (1) directly to the subject; (2) delivered ex vivo, to cells derived from the 
subject; or (3) in vitro for expression of recombinant proteins. The subjects to be treated can 
be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or lesion. 
Other modes of administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject 
are known in the art and described in eg. W093/14778. Examples of cells useful in ex vivo 
applications include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by the following procedures, for example, dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection 
of the DNA into nuclei, all well known in the art. 

Polynucleotide and polypeptide pharmaceutical compositions 

In addition to the pharmaceutically acceptable carriers and salts described above, the 
following additional agents can be used with polynucleotide and/or polypeptide compositions. 
A.Polypeptides 

One example are polypeptides which include, without limitation: asioloorosomucoid 
(ASOR); transferrin; asialoglycoproteins; antibodies; antibody fragments; ferritin; 
interleukins; interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 
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granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating factor 
(M-CSF), stem cell factor and erythropoietin. Viral antigens, such as envelope proteins, can 
also be used. Also, proteins from other invasive organisms, such as the 17 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RII. 

B . Hormones, Vitamins, Etc. 

Other groups that can be included are, for example: hormones, steroids, androgens, 
estrogens, thyroid hormone, or vitamins, folic acid. 

C. Polyalkylenes, Polysaccharides, etc. 

Also, polyalkylene glycol can be included with the desired polynucleotides or 
polypeptides. In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di-, or polysaccarides can be included. In a preferred embodiment of this 
aspect, the polysaccharide is dextran or DEAE-dextran. Also, chitosan and 
poly(lactide-co-glycolide) 

D. Lipids, and Liposomes 

The desired polynucleotide or polypeptide can also be encapsulated in lipids or 
packaged in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are able to 
stably bind or entrap and retain nucleic acid. The ratio of condensed polynucleotide or 
polypeptide to lipid preparation can vary but will generally be around 1:1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for 
delivery of nucleic acids, see, Hug and Sleight (1991) Biochim. Biophys. Acta. 1097:1-17; 
Straubinger (1983) Meth. EnzymoL 101:512-527. 

Liposomal preparations for use in the present invention include cationic (positively 
charged), anionic (negatively charged) and neutral preparations. Cationic liposomes have 
been shown to mediate intracellular delivery of plasmid DNA (Feigner (1987) Proc. Natl 
Acad. Sci. USA 84:7413-7416); mRNA (Malone (1989) Proc. Natl. Acad, Scu USA 
86:6077-6081); and purified transcription factors (Debs (1990)7. Biol. Chem. 
265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 
under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, Feigner 
supra). Other commercially available liposomes include transfectace (DDAB/DOPE) and 
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DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from readily available 
materials using techniques well known in the art. See, eg. Szoka (1978) Proc. Natl Acad. ScL 
USA 75:4194-4198; WO90/1 1092 for a description of the synthesis of DOTAP 
(1 ,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, AL), or can be easily prepared using readily available materials. 
Such materials include phosphatidyl choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed 
with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs). The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See eg. Straubinger (1983) Meth. 
Immunol 101:512-527; Szoka (1978) Proc. Natl Acad. ScL USA 75:4194-4198; 
Papahadjopoulos (1975) Biochim. Biophys. Acta 394:483; Wilson (1979) Cell 17:77); Deamer 
& Bangham (1976) Biochim. Biophys. Acta 443:629; Ostro (1977) Biochem. Biophys. Res. 
Commun. 76:836; Fraley (1979) Proc. Natl Acad. Sci. USA 76:3348); Enoch & Strittmatter 
(1979) Proc. Natl. Acad. ScL USA 76:145; Fraley (1980) J. Biol Chem. (1980)255:10431; 
Szoka & Papahadjopoulos (1978) Proc. Natl Acad. ScL USA 75:145; and Schaefer-Ridder 
(1982) Science 215:166. 
E.Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide or polypeptide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, IDL, LDL, 
and VLDL. Mutants, fragments, or fusions of these proteins can also be used. Also, 
modifications of naturally occurring lipoproteins can be used, such as acetylated LDL. These 
lipoproteins can target the delivery of polynucleotides to cells expressing lipoprotein 
receptors. Preferably, if lipoproteins are including with the polynucleotide to be delivered, no 
other targeting ligand is included in the composition. 

Naturally occurring lipoproteins comprise a lipid and a protein portion. The protein 
portion are known as apoproteins. At the present, apoproteins A, B, C, D, and E have been 
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isolated and identified. At least two of these contain several proteins, designated by Roman 
numerals, AI, AH, A1V; CI, CII, CHI. 

A lipoprotein can comprise more than one apoprotein. For example, naturally 
occurring chylomicrons comprises of A, B, C, and E, over time these lipoproteins lose A and 
acquire C and E apoproteins. VLDL comprises A, B, C, and E apoproteins, LDL comprises 
apoprotein B; and HDL comprises apoproteins A, C, and E. 

The amino acid of these apoproteins are known and are described in, for example, 
Breslow (19S5) Annu Rev. Biochem 54:699; Law (1986) Adv. Exp Med. Biol 151:162; Chen 
(1986)75/0/ Chem 261:12918; Kane (1980) Proc Natl Acad Sci USA 77:2465; and Utermann 
(1984) Hum Genet 65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, cholesterol (free and 
esters), and phopholipids. The composition of the lipids varies in naturally occurring 
lipoproteins. For example, chylomicrons comprise mainly triglycerides. A more detailed 
description of the lipid content of naturally occurring lipoproteins can be found, for example, 
in Meth. Enzymol. 128 (1986). The composition of the lipids are chosen to aid in 
conformation of the apoprotein for receptor binding activity. The composition of lipids can 
also be chosen to facilitate hydrophobic interaction and association with the polynucleotide 
binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by ultracentrifiigation, for 
instance. Such methods are described in Meth. Enzymol. (supra)] Pitas (1980) J. Biochem. 
255:5454-5460 and Mahey (1979) J Clin. Invest 64:743-750. 

Lipoproteins can also be produced by in vitro or recombinant methods by expression 
of the apoprotein genes in a desired host cell. See, for example, Atkinson (1 986) Annu Rev 
Biophys Chem 15:403 and Radding (1958) Biochim Biophys Acta 30: 443. 

Lipoproteins can also be purchased from commercial suppliers, such as Biomedical 
Techniologies, Inc., Stoughton, Massachusetts, USA. 

Further description of lipoproteins can be found in Zuckermann et al., PCT. Appln. 
No. US97/14465. 
F.Polycationic Agents 

Polycationic agents can be included, with or without lipoprotein, in a composition 
with the desired polynucleotide or polypeptide to be delivered. 
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Polycationic agents, typically, exhibit a net positive charge at physiological relevant 
pH and are capable of neutralizing the electrical charge of nucleic acids to facilitate delivery 
to a desired location. These agents have both in vitro, ex vivo, and in vivo applications. 
Polycationic agents can be used to deliver nucleic acids to a living subject either 
intramuscularly, subcutaneously, etc. 

The following are examples of useful polypeptides as polycationic agents: polylysine, 
polyarginine, polyornithine, and protamine. Other examples include histones, protamines, 
human serum albumin, DNA binding proteins, non-histone chromosomal proteins, coat 
proteins from DNA viruses, such as (XI 74, transcriptional factors also contain domains that 
bind DNA and therefore may be useful as nucleic aid condensing agents. Briefly, 
transcriptional factors such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, 
Oct-1 , Oct-2, CREP, and TFIID contain basic domains that bind DNA sequences. 

Organic polycationic agents include: spermine, spermidine, and purtrescine. 

The dimensions and of the physical properties of a polycationic agent can be 
extrapolated from the list above, to construct other polypeptide polycationic agents or to 
produce synthetic polycationic agents. 

Synthetic Polycationic Agents 

Synthetic polycationic agents which are useful include, for example, DEAE-dextran, 
polybrene. LipofectinD, and lipofectAMINED are monomers that form polycationic 
complexes when combined with polynucleotides or polypeptides. 

Immunodiagnostic Assays 

Neisserial antigens of the invention can be used in immunoassays to detect antibody 
levels (or, conversely, anti-Neisserial antibodies can be used to detect antigen levels). 
Immunoassays based on well defined, recombinant antigens can be developed to replace 
invasive diagnostics methods. Antibodies to Neisserial proteins within biological samples, 
including for example, blood or serum samples, can be detected. Design of the immunoassays 
is subject to a great deal of variation, and a variety of these are known in the art. Protocols for 
the immunoassay may be based, for example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, use solid supports, or may be by 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. 
Assays which amplify the signals from the probe are also known; examples of which are 
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assays which utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such 
as ELISA assays. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are 
constructed by packaging the appropriate materials, including the compositions of the 
invention, in suitable containers, along with the remaining reagents and materials (for 
example, suitable buffers, salt solutions, etc.) required for the conduct of the assay, as well as 
suitable set of assay instructions. 

Nucleic Acid Hybridisation 

"Hybridization" refers to the association of two nucleic acid sequences to one another 
by hydrogen bonding. Typically, one sequence will be fixed to a solid support and the other 
will be free in solution. Then, the two sequences will be placed in contact with one another 
under conditions that favor hydrogen bonding. Factors that affect this bonding include: the 
type and volume of solvent; reaction temperature; time of hybridization; agitation; agents to 
block the non-specific attachment of the liquid phase sequence to the solid support 
(Denhardt's reagent or BLOTTO); concentration of the sequences; use of compounds to 
increase the rate of association of sequences (dextran sulfate or polyethylene glycol); and the 
stringency of the washing conditions following hybridization. See Sambrook et at [supra] 
Volume 2, chapter 9, pages 9.47 to 9.57. 

"Stringency" refers to conditions in a hybridization reaction that favor association of 
very similar sequences over sequences that differ. For example, the combination of 
temperature and salt concentration should be chosen that is approximately 1 20 to 200 DC 
below the calculated Tm of the hybrid under study. The temperature and salt conditions can 
often be determined empirically in prehminary experiments in which samples of genomic 
DNA immobilized on filters are hybridized to the sequence of interest and then washed under 
conditions of different stringencies. See Sambrook et al at page 9.50. 

Variables to consider when performing, for example, a Southern blot are (1) the 
complexity of the DNA being blotted and (2) the homology between the probe and the 
sequences being detected. The total amount of the fragment(s) to be studied can vary a 
magnitude of 10, from 0. 1 to 1 ug for a plasmid or phage digest to 1 0' 9 to 1 0" 8 g for a single 
copy gene in a highly complex eukaryotic genome. For lower complexity polynucleotides, 
substantially shorter blotting, hybridization, and exposure times, a smaller amount of starting 
polynucleotides, and lower specific activity of probes can be used. For example, a single-copy 
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yeast gene can be detected with an exposure time of only 1 hour starting with 1 jig of yeast 
DNA, blotting for two hours, and hybridizing for 4-8 hours with a probe of 10 8 cpm/^g. For a 
single-copy mammalian gene a conservative approach would start with 1 0 [ig of DNA, blot 
overnight, and hybridize overnight in the presence of 10% dextran sulfate using a probe of 
greater than 10 8 cpm/^g, resulting in an exposure time of -24 hours. 

Several factors can affect the melting temperature (Tm) of a DNA-DNA hybrid 
between the probe and the fragment of interest, and consequently, the appropriate conditions 
for hybridization and washing. In many cases the probe is not 100% homologous to the 
fragment. Other commonly encountered variables include the length and total G+C content of 
the hybridizing sequences and the ionic strength and formamide content of the hybridization 
buffer. The effects of all of these factors can be approximated by a single equation: 

Tm= 81 + 16.6(log ]0 Ci) + 0.4[%(G + C)]-0.6(%formamide) - 600/«-1.5(%mismatch). 

where Ci is the salt concentration (monovalent ions) and n is the length of the hybrid 
in base pairs (slightly modified from Meinkoth & Wahl (1984) Anal Biochem. 138: 267-284). 

In designing a hybridization experiment, some factors affecting nucleic acid 
hybridization can be conveniently altered. The temperature of the hybridization and washes 
and the salt concentration during the washes are the simplest to adjust. As the temperature of 
the hybridization increases (ie. stringency), it becomes less likely for hybridization to occur 
between strands that are nonhomologous, and as a result, background decreases. If the 
radiolabeled probe is not completely homologous with the immobilized fragment (as is 
frequently the case in gene family and interspecies hybridization experiments), the 
hybridization temperature must be reduced, and background will increase. The temperature of 
the washes affects the intensity of the hybridizing band and the degree of background in a 
similar manner. The stringency of the washes is also increased with decreasing salt 
concentrations. 

In general, convenient hybridization temperatures in the presence of 50% formamide 
are 42DC for a probe with is 95% to 100% homologous to the target fragment, 37DC for 90% 
to 95% homology, and 32DC for 85% to 90% homology. For lower homologies, formamide 
content should be lowered and temperature adjusted accordingly, using the equation above. If 
the homology between the probe and the target fragment are not known, the simplest 
approach is to start with both hybridization and wash conditions which are nonstringent. If 
non-specific bands or high background are observed after autoradiography, the filter can be 



WO 99/57280 



50 



PCT7US99/09346 



washed at high stringency and reexposed. If the time required for exposure makes this 
approach impractical, several hybridization and/or washing stringencies should be tested in 
parallel. 

Nucleic Acid Probe Assays 

Methods such as PCR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes according to the invention can determine the presence of cDNA or 
mRNA. A probe is said to "hybridize" with a sequence of the invention if it can form a duplex 
or double stranded complex, which is stable enough to be detected. 

The nucleic acid probes will hybridize to the Neisserial nucleotide sequences of the 
invention (including both sense and antisense strands). Though many different nucleotide 
sequences will encode the amino acid sequence, the native Neisserial sequence is preferred 
because it is the actual sequence present in cells. mRNA represents a coding sequence and so 
a probe should be complementary to the coding sequence; single-stranded cDNA is 
complementary to mRNA, and so a cDNA probe should be complementary to the non-coding 
sequence. 

The probe sequence need not be identical to the Neisserial sequence (or its 
complement) — some variation in the sequence and length can lead to increased assay 
sensitivity if the nucleic acid probe can form a duplex with target nucleotides, which can be 
detected. Also, the nucleic acid probe can include additional nucleotides to stabilize the 
formed duplex. Additional Neisserial sequence may also be helpful as a label to detect the 
formed duplex. For example, a non-complementary nucleotide sequence may be attached to 
the 5 ! end of the probe, with the remainder of the probe sequence being complementary to a 
Neisserial sequence. Alternatively, non-complementary bases or longer sequences can be 
interspersed into the probe, provided that the probe sequence has sufficient complementarity 
with the a Neisserial sequence in order to hybridize therewith and thereby form a duplex 
which can be detected. 

The exact length and sequence of the probe will depend on the hybridization 
conditions, such as temperature, salt condition and the like. For example, for diagnostic 
applications, depending on the complexity of the analyte sequence, the nucleic acid probe 
typically contains at least 10-20 nucleotides, preferably 15-25, and more preferably at least 30 
nucleotides, although it may be shorter than this. Short primers generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. 
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Probes may be produced by synthetic procedures, such as the triester method of 
Matteucci etal. [J. Am. Chem. Soc. (1981) 103:3185], or according to Urdea et al. [Proc. 
Natl Acad. Sci. USA (1983) 80: 7461], or using commercially available automated 
oligonucleotide synthesizers. 

The chemical nature of the probe can be selected according to preference. For certain 
applications, DNA or RNA are appropriate. For other applications, modifications may be 
incorporated eg. backbone modifications, such as phosphorothioates or methylphosphonates, 
can be used to increase in vivo half-life, alter RNA affinity, increase nuclease resistance etc. 
[eg. see Agrawal & Iyer (1995) Curr Opin Biotechnol 6:12-19; Agrawal (1996) TIBTECH 
14:376-387]; analogues such as peptide nucleic acids may also be used [eg. see Corey (1997) 
TIBTECH 15:224-229; Buchardt et al (1993) TIBTECH 1 1 :384-386]. 

One example of a nucleotide hybridization assay is described by Urdea et al in 
international patent application WO92/02526 [see also US patent 5,124,246], 

Alternatively, the polymerase chain reaction (PCR) is another well-known means for 
detecting small amounts of target nucleic acids. The assay is described in: Mullis et al. [Meth. 
Enzymol (1 987) 1 55: 335-350]; US patent 4,683,195; and US patent 4,683,202. Two 
"primer" nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can comprise sequence that does not hybridize to the sequence of the 
amplification target (or its complement) to aid with duplex stability or, for example, to 
incorporate a convenient restriction site. Typically, such sequence will flank the desired 
Neisserial sequence. 

A thermostable polymerase creates copies of target nucleic acids from the primers 
using the original target nucleic acids as a template. After a threshold amount of target nucleic 
acids are generated by the polymerase, they can be detected by more traditional methods, such 
as Southern blots. When using the Southern blot method, the labelled probe will hybridize to 
the Neisserial sequence (or its complement). 

Also, mRNA or cDNA can be detected by traditional blotting techniques described in 
Sambrook et al [supra]. mRNA, or cDNA generated from mRNA using a polymerase 
enzyme, can be purified and separated using gel electrophoresis. The nucleic acids on the gel 
are then blotted onto a solid support, such as nitrocellulose. The solid support is exposed to a 
labelled probe and then washed to remove any unhybridized probe. Next, the duplexes 
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containing the labeled probe are detected. Typically, the probe is labelled with a radioactive 
moiety. 

EXAMPLES 

The examples describe nucleic acid sequences which have been identified in 
N. meningitidis, and N. gonorrhoeae along with their respective and putative translation 
products. Not all of the nucleic acid sequences are complete ie. they encode less than the full- 
length wild-type protein. 

The examples are generally in the following format: 

• a nucleotide sequence which has been identified in N. meningitidis 

• the putative translation product of said N. meningitidis sequence 

• a computer analysis of said translation product based on database comparisons 

• a corresponding nucleotide sequence identified from N. gonorrhoeae 

• the putative translation product of said N. gonorrhoeae sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a corresponding nucleotide sequence identified from strain A of N. meningitidis 

• the putative translation product of said N. meningitidis strain A sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the K gonorrhoeae sequence. 

• a description of the characteristics of the protein which indicates that it might be 
suitably antigenic or immunogenic. 

Sequence comparisons were performed at NCBI (http://www.ncbi.nlm.nih.gov) using 
the algorithms BLAST, BLAST2, BLASTn, BLASTp, tBLASTn, BLASTx, & tBLASTx [eg. 
see also Altschul et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Research 25:2289-3402]. Searches were performed 
against the following databases: non-redundant GenBank+EMBL+DDBJ+PDB sequences 
and non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR sequences. 

Dots within nucleotide sequences represent nucleotides which have been arbitrarily 
introduced in order to maintain a reading frame. In the same way, double-underlined 
nucleotides were removed . Lower case letters represent ambiguities which arose during 



WO 99/57280 



PCT/US99/09346 



53 

alignment of independent sequencing reactions (some of the nucleotide sequences in the 
examples are derived from combining the results of two or more experiments). 

Nucleotide sequences were scanned in all six reading frames to predict the presence of 
hydrophobic domains using an algorithm based on the statistical studies of Esposti et al 
[Critical evaluation of the hydropathy of membrane proteins (1990) EurJBiochem 190:207- 
219]. These domains represent potential transmembrane regions or hydrophobic leader 
sequences. 

Open reading frames were predicted from fragmented nucleotide sequences using the 
program ORFFINDER (NCBI). 

Underlined amino acid sequences indicate possible transmembrane domains or leader 
sequences in the ORFs, as predicted by the PSORT algorithm (http://www.psort.nibb.ac.jp). 
Functional domains were also predicted using the MOTIFS program (GCG Wisconsin & 
PROSITE). 

For each of the following examples: based on the presence of a putative leader 
sequence and/or several putative transmembrane domains (single-underlined) in the 
gonococcal protein, it is predicted that the proteins from N. meningitidis and K gonorrhoeae, 
and their respective epitopes, could be useful antigens or immunogenic compositions for 
vaccines or diagnostics. 

The standard techniques and procedures which may be employed in order to perform 
the invention {e.g. to utilize the disclosed sequences for vaccination or diagnostic purposes) 
were summarized above. This summary is not a limitation on the invention but, rather, gives 
examples that may be used, but are not required. 

In particular, the following methods were used to express, purify and biochemically 
characterize the proteins of the invention. 
Chromosomal DNA Preparation 

^meningitidis strain 2996 was grown to exponential phase in 100ml of GC medium, 
harvested by centrifugation, and resuspended in 5ml buffer (20%(w/v) Sucrose, 50mM Tris- 
HC1, 50mM EDTA, pH8). After 10 minutes incubation on ice : the bacteria were lysed by 
adding 1 0ml of lysis solution (50mMNaCl, l%Na-Sarkosyl, SOpg'ml Proteinase K), and the 
suspension incubated at 37°C for 2 hours. Two phenol extractions (equilibrated to pH 8) and 
one CHCl 3 /isoamylalcohol (24:1) extraction were performed. DNA was precipitated by 
addition of 0.3M sodium acetate and 2 volumes of ethanol, and collected by centrifugation. 
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The pellet was washed once with 70%(v/v) ethanol and redissolved in 4.0ml TE buffer 
(lOmM Tris-HCl, ImM EDTA, pH 8.0). The DNA concentration was measured by reading 
the OD at 260 nm. 

Oligonucleotide design 

Synthetic oligonucleotide primers were designed on the basis of the coding sequence 
of each ORF, using (a) the meningococcus B sequence when available, or (b) the 
gonococcus/meningococcus A sequence, adapted to the codon preference usage of 
meningococcus as necessary. Any predicted signal peptides were omitted, by designing the 5' 
primers to sequence immediately downstream from the predicted leader sequence. 

For most ORFs, the 5' primers included two restriction enzyme recognition sites 
(Bamffi-Ndel, BamHl-Nhel y EcoRl-Ndel or EcoRl-Nhel\ depending on the restriction pattern 
of the gene of interest. The 3' primers included aXhoI or a Hindlll restriction site (table 1). 
This procedure was established in order to direct the cloning of each amplification product 
(corresponding to each ORF) into two different expression systems: pGEX-KG (using 
BamHl-Xhol BamHl-Hindm, EcoRl-Xhol or EcoRl-Hindlll), and pET2 1 b+ (using Ndel- 
Xhol Mel-Xhol, Ndel-HindVll or Nhel-HindUI). 

5'-end primer tail: CGC GGATCCCATATG (BamHl-Ndel) 

CGCGGATCCGCTAGC (BamHl-NheT) 
CCG GAATTC TA CATATG (EcoRI-Ndel) 
CCG GAATTC TA GCTAGC (EcoRI-Nhel) 

3'-end primer tail: CCCG CTCGAG (Xhol) 

CCCGCTCGAG (Hindlll) 

For cloning ORFs into the pGEX-His vector, the 5' and 3 ' primers contained only one 
restriction enzyme site {EcoRI, Kpnl or Sail for the 5' primers and Pstl, Xbal, Sphl or Sail for 
the 3' primers). Again restriction sites were chosen according to the particular restriction 
pattern of the gene (table 1 ). 

5'-end primer tail: (AAA) AAAGAATTC (EcoRI) 

( AAA ) AAA GGTACC (Kpnl) 
3*-end primer tail : (AAA) AAACTGCAG (Pstl) 

( AAA ) AAA TCTAGA (Xbal) 
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AAAGCATGC (Sptil) 
5' or 3'-end primer tail: AAAAAA GTCGAC (Sail) 

As well as containing the restriction enzyme recognition sequences, the primers 
included nucleotides which hybridized to the sequence to be amplified. The melting 
temperature depended on the number and type of hybridising nucleotides in the whole primer, 
and was determined for each primer using the formulae: 

T m = 4 (G+C)+ 2 (A+T) (tail excluded) 

T m = 64.9 + 0.41 (% GC) - 600/N (whole primer) 

The melting temperatures of the selected oligonucleotides were usually 65-70°C for 
the whole oligo and 50-55°C for the hybridising region alone. 

Table 1 shows the forward and reverse primers used for each amplification. In certain 
cases, the sequence of the primer does not exactly match the sequence of the predicted ORF. 
This is because when initial amplifications were performed, the complete 5' and/or 3' 
sequences for some meningococcal B ORFs were not known. However the corresponding 
sequences had been identified in Gonococcus or in Meningoccus A. Hence, when the 
Meningoccus B sequence was incomplete or uncertain, Gonococcal or Meningococcal A 
sequences were used as the basis for primer design. These sequences were altered to take 
account of codon preference. It can be appreciated that, once the complete sequence is 
identified, this approach will no longer be necessary. 

Oligonucleotides were synthesized using a Perkin Elmer 394 DNA/RNA Synthesizer, 
eluted from the columns in 2.0ml NH4OH, and deprotected by 5 hours incubation at 56°C. 
The oligos were precipitated by addition of 0.3M Na- Acetate and 2 volumes ethanol. The 
samples were centrifuged and the pellets resuspended in either 1 OOp.1 or 1 .0ml of water. The 
OD260 was determined using a Perkin Elmer Lambda Bio spectophotometer and the 
concentration adjusted to 2-lOpmol/pl. 

Amplification 

The standard PCR protocol was as follows: 50-200ng of genomic DNA was used as a 
template in the presence of 20-40jiM of each oligonucletide primer, 400-800jaM dNTPs 
solution, lx PCR buffer (including 1 .5mM MgCl 2 ), 2.5 units TaqI DNA polymerase (using 
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Perkin-Elmer AmpliTaQ, GIBCO Platinum, Pwo DNA polymerase, or Tahara Shiizo Taq 
polymerase). In some cases, PCR was optimsed by the addition of lOjil DMSO or 50|il 2M 
Betaine. 

After a hot start (adding the polymerase during a preliminary 3 minute incubation of 
the whole mix at 95°C), each sample underwent a two-step amplification. The first 5 cycles 
were performed using the hybridization temperature that excluded the restriction enzyme tail 
of the primer (see above). This was followed by 30 cycles using the hybridization temperature 
calculated for the whole length oligos. The cycles were completed with a 10 minute extension 
step at 72°C. The standard cycles were as follows: 





Denaturation 


Hybridisation 


Elongation 


First 5 cycles 


30 seconds 
95°C 


30 seconds 
50-55°C 


30-60 seconds 
72°C 


Last 30 cycles 


30 seconds 
95°C 


30 seconds 
65-70°C 


30-60 seconds 
72°C 



Elongation times varied according to the length of the ORF to be amplified. 
Amplifications were performed using either a 9600 or a 2400 Perkin Elmer GeneAmp PCR 
System. To check the results, 1/10 of the amplification volume was loaded onto a 1-1 .5% 
(w/v) agarose gel and the size of each amplified fragment compared with a DNA molecular 
weight marker. 

The amplified DNA was either loaded directly on a 1 % agarose gel or first 
precipitated with ethanol and resuspended in a volume suitable to be loaded on a 1 .0% 
agarose gel. The DNA fragment corresponding to the band of correct size was purified using 
the Qiagen Gel Extraction Kit, following the manufacturer's protocol DNA fragments were 
eluted in a volume of 30\x\ or 50^1 with either H20 or 1 OmM Tris, pH 8.5. 

Digestion of PCR fragments 

The purified DNA corresponding to the amplified fragment was doubly-digested with 
the appropriate restriction enzymes for; cloning into pET-21b+ and expressing the protein as a 
C-terminus His-tagged ftision, for cloning into pGEX-KG and expressing the protein as a N- 
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terminus GST-fusion, and for cloning into pGEX-His and expressing the protein as a 
N-terminus GST-His tagged fusion. 

Each purified DNA fragment was incubated at 37°C for 3 hours to overnight widi 20 
units of appropriate restriction enzyme (New England Biolabs) in a volume of either 30 or 
40ul in the presence of suitable digestion buffer. Digested fragments were purified using the 
QIAquick PCR purification kit (following the manufacturer's instructions) and eluted in a 
volume of 30ui or 50u\l with either H20 or lOmM Tris, pH 8.5. The DNA concentration was 
determined by quantitative agarose gel electrophoresis (1 .0% gel) in the presence of a titrated 
molecular weight marker. 

Digestion of the cloning vectors (pET22B, pGEX-KG, pTRC-His A, pET21b+, pGEX- 
KG, and pGEX-His) 

The vector pGEX-His is a modified pGEX-2T vector carrying a region encoding six 
histidine residues upstream of the thrombin cleavage site and containing the multiple cloning 
site of the vector pTRC99 (Pharmacia). 10 jig plasmid was double-digested with 50 units of 
each restriction enzyme in 200 ul reaction volume in the presence of appropriate buffer by 
overnight incubation at 37°C. After loading the whole digestion on a 1% agarose gel, the 
band corresponding to the digested vector was purified from the gel using the Qiagen 
QIAquick Gel Extraction Kit and the DNA was eluted in 50 \xl of 10 mM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 26 o of the sample, and adjusted to 50 
|ig/ul. 1 |li1 of plasmid was used for each cloning procedure. 

lOfig of plasmid vector was doubly-digested with 50 units of each restriction enzyme 
in a volume of 200ul with the appropriate buffer overnight at 37°C ; The digest was loaded 
onto a 1.0% agarose gel and the band corresponding to the digested vector purified using the 
Qiagen QIAquick Gel Extraction Kit. DNA was eluted in 50uJ of lOmM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD2 6 onm and the concentration adjusted 
to 50|ag/ul lul of plasmid was used for each cloning procedure. 

Cloning 

For some ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated in both pET22b and pGEX-KG. In a final volume of 20 ul, a molar 
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ratio of 3:1 fragment/vector was ligated using 0.5 ^tl of NEB T4 DNA ligase (400 units/^l), in 
the presence of the buffer supplied by the manufacturer. The reaction was incubated at room 
temperature for 3 hours. In some experiments, ligation was performed using the Boheringer 
"Rapid Ligation Kit", following the manufacturer's instructions. 

In order to introduce the recombinant plasmid in a suitable strain, 100 ^1 2>. coli DH5 
competent cells were incubated with the ligase reaction solution for 40 minutes on ice, then at 
37°C for 3 minutes, then, after adding 800 ^1 LB broth, again at 37°C for 20 minutes. The 
cells were then centrifuged at maximum speed in an Eppendorf microftige and resuspended in 
approximately 200 \i\ of the supernatant. The suspension was then plated on LB ampicillin 
(100 mg/ml ). 

The screening of the recombinant clones was performed by growing 5 
randomly-chosen colonies overnight at 37 °C in either 2 ml (pGEX or pTC clones) or 5ml 
(pET clones) LB broth + 100 ^g/ml ampicillin. The cells were then pelletted and the DNA 
extracted using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions, to a final volume of 30 jil. 5 ^1 of each individual miniprep (approximately lg ) 
were digested with either NdeVXhol or BamHVXhol and the whole digestion loaded onto a 1 - 
1.5% agarose gel (depending on the expected insert size), in parallel with the molecular 
weight marker (1Kb DNA Ladder, GEBCO). The screening of the positive clones was made 
on the base of the correct insert size. 

For other ORFs, the fragments corresponding to each ORE, previously digested and 
purified, were ligated into both pET21b+ and pGEX-KG. A molar ratio of of 3:1 
fragment/vector was used in a final volume of 20^1, that included 0.5^x1 T4 DNA ligase (400 
units/^1, NEB) and ligation buffer supplied by the manufacturer. The reaction was performed 
at room temperature for 3 hours. In some experiments, ligation was performed using the 
Boheringer "Rapid Ligation Kit" and the manufacturer's protocol. 

Recombinant plasmid was transformed into 100^1 of competent E. coli DH5 or 
HB101 by incubating the ligase reaction solution and bacteria for 40 minutes on ice then at 
37°C for 3 minutes. This was followed by the addition of 800^1 LB broth and incubation at 
37°C for 20 minutes. The cells were centrifuged at maximum speed in an Eppendorf 
microfuge, resuspended in approximately 200^1 of the supernatant and plated onto LB 
ampicillin (lOOmg/ml ) agar. 
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Screening for recombinant clones was performed by growing 5 randomly selected 
colonies overnight at 37°C in either 2.0ml (pGEX-KG clones) or 5.0ml (pET clones) LB 
broth + lOOjag/ml ampicillin. Cells were pelleted and plasmid DNA extracted using the 
Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's instructions. 
Approximately Ijig of each individual miniprep was digested with the appropriate restriction 
enzymes and the digest loaded onto a 1-1.5% agarose gel (depending on the expected insert 
size), in parallel with the molecular weight marker (lkb DNA Ladder, GIBCO). Positive 
clones were selected on the basis of the size of insert. 

ORFs were cloned into PGEX-His, by doubly-digesting the PCR product and ligating 
into similarly digested vector. After cloning, recombinant plasmids were transformed into the 
E.coli host W31 10. Individual clones were grown overnight at 37°C in LB broth with 50jig/ml 
ampicillin. 

Certain ORFs may be cloned into the pGEX-HIS vector using EcoRl-Psfl cloning 
sites, or EcoRl-SaH, or Sall-PstL After cloning, the recombinant plasmids may be introduced 
inthe£.coli host W3110. 

Expression 

Each ORF cloned into the expression vector may then be transformed into the strain 
suitable for expression of the recombinant protein product. 1 jil of each construct was used to 
transform 30 ^il of E.coli BL21 (pGEX vector), Ecoli TOP 10 (pTRC vector) or E.coli BL21- 
DE3 (pET vector), as described above. In the case of the pGEX-His vector, the same E.coli 
strain (W3 1 10) was used for initial cloning and expression. Single recombinant colonies were 
inoculated into 2ml LB+Amp (100 Hg/ml), incubated at 37°C overnight, then diluted 1:30 in 
20 ml of LB+Amp (100 |ig/ml) in 100 ml flasks, making sure that the OD 600 ranged between 
0.1 and 0.15. The flasks were incubated at 30°C into gyratory water bath shakers until OD 
indicated exponential growth suitable for induction of expression (0.4-0.8 OD for pET and 
pTRC vectors; 0.8-1 OD for pGEX and pGEX-His vectors). For the pET, pTRC and pGEX- 
His vectors, the protein expression was induced by addiction of ImM IPTG, whereas in the 
case of pGEX system the final concentration of IPTG was 0.2 mM. After 3 hours incubation 
at 30°C, the final concentration of the sample was checked by OD. In order to check 
expression, 1ml of each sample was removed, centrifuged in a microfuge, the pellet 
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resuspended in PBS, and analysed by 12% SDS-PAGE with Coomassie Blue staining. The 
whole sample was centrifiiged at 6000£ and the pellet resuspended in PBS for further use. 

GST-fusion proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on LB+Amp agar plate. 
The bacteria were inoculated into 20 ml of LB+Amp liquid colture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600 ml of fresh medium and allowed to 
grow at the optimal temperature (20-37°C) to OD 550 0.8-1 . Protein expression was induced 
with 0.2mM DPTG followed by three hours incubation. The culture was centrifiiged at 8000 
rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 7.5 ml 
cold PBS. The cells were disrupted by sonication on ice for 30 sec at 40W using a Branson 
sonifier B-15, frozen and thawed two times and centrifiiged again. The supernatant was 
collected and mixed with 150p.I Glutatione-Sepharose 4B resin (Pharmacia) (previously 
washed with PBS) and incubated at room temperature for 30 minutes. The sample was 
centrifuged at 700g for 5 minutes at 4C. The resin was washed twice with 10 ml cold PBS for 
10 minutes, resuspended in 1ml cold PBS, and loaded on a disposable column. The resin was 
washed twice with 2ml cold PBS until the flow-through reached OD 28 o of 0.02-0.06. The 
GST-fusion protein was eluted by addition of 700^1 cold Glutathione elution buffer lOmM 
reduced glutathione, 50mM Tris-HCl) and fractions collected until the OD 2 g 0 was 0. 1 . 21 pi of 
each fraction were loaded on a 12% SDS gel using either Biorad SDS-PAGE Molecular 
weight standard broad range (Ml) (200, 1 16.25, 97.4, 66.2, 45, 31, 21.5, 14.4, 6.5 kDa) or 
Amersham Rainbow Marker (M") (220, 66, 46, 30, 21.5, 14.3 kDa) as standards. As the MW 
of GST is 26kDa, this value must be added to the MW of each GST-fusion protein. 

For other ORFs, for each clone to be purified as a GST-fusion, a single colony was 
streaked out and grown overnight at 37°C on a LB/Amp. (lOO^ig/ml) agar plate. An isolated 
colony from this plate was inoculated into 20ml of LB/Amp (100 ug/ml) liquid medium and 
grown overnight at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml 
LB/Amp (100ng/ml) liquid medium and allowed to grow at the optimal temperature (20- 
37°C) until the OD 55 onm reached 0.6-0.8. Recombinant protein expression was induced by 
addition of IPTG (final concentration 0.2mM) and the culture incubated for a further 3 hours. 
Bacteria were harvested by centrifiigation at 8000xg for 15 mm at 4°C. 
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The bacterial pellet was resuspended in 7.5ml cold PBS. Cells were disrupted by 
sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifliged at 
13 OOOxg for 30 min at 4°C. The supernatant was collected and mixed with 1 50(xl Glutatione- 
Sepharose 4B resin (Pharmacia), previously equilibrated with PBS, and incubated at room 
temperature with gentle agitation for 30 min. The batch-wise preparation was centrifliged at 
700xg for 5 min at 4°C and the supernatant discarded. The resin was washed twice 
(batchwise) with 10ml cold PBS for 10 min, resuspended in 1ml cold PBS, and loaded onto a 
disposable column. The resin continued to be washed with cold PBS, until the OD 28 orm of the 
flow-through reached 0.02-0.01. The GST-fusion protein was eluted by addition of 700|xl cold 
glutathione elution buffer (lOmM reduced glutathione, 50mM Tris-HCl pH 8.0) and fractions 
collected, until the OD 2 80nm of the eluate indicated all the recombinant protein was obtained. 
20^1 aliquots of each elution fraction were analyzed by SDS-PAGE using a 12% gel. The 
molecular mass of the purified proteins was determined using either the Bio-Rad broad range 
molecular weight standard (Ml) (200, 1 16, 97.4, 66.2, 45.0, 31.0, 21.5, 14.4, 6.5 kDa) or the 
Amersham Rainbow Marker (M2) (220, 66.2, 46.0, 30.0, 21.5, 14.3 kDa). The molecular 
weights of GST-fusion proteins are a combination of the 26 kDa GST protein and its fusion 
partner. Protein concentrations were estimated using the Bradford assay. 

His-fusion soluble proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on a LB + Amp agar 
plate. The bacteria were inoculated into 20ml of LB+Amp liquid culture and incubated 
overnight in a water bath shaker. Bacteria were diluted 1 :30 into 600ml fresh medium and 
allowed to grow at the optimal temperature (20-37°C) to OD 550 0.6-0.8. Protein expression 
was induced by addition of 1 mM IPTG and the culture further incubated for three hours. The 
culture was centrifuged at 8000 rpm at 4°C, the supernatant was discarded and the bacterial 
pellet was resuspended in 7.5ml cold lOmM imidazole buffer (300 mM NaCl, 50 mM 
phosphate buffer, 10 mM imidazole, pH 8). The cells were disrupted by sonication on ice for 
30 sec at 40W using a Branson sonifier B-15, frozen and thawed two times and centrifuged 
again. The supernatant was collected and mixed with 150}il Ni 2+ -resin (Pharmacia) 
(previously washed with lOmM imidazole buffer) and incubated at room temperature with 
gentle agitation for 30 minutes. The sample was centrifuged at 700g for 5 minutes at 4°C. 
The resin was washed twice with 10 ml cold lOmM imidazole buffer for 10 minutes, 
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resuspended in 1ml cold lOmM imidazole buffer and loaded on a disposable column. The 
resin was washed at 4°C with 2ml cold lOmM imidazole buffer until the flow-through 
reached the O.D 2 go of 0.02-0.06. The resin was washed with 2ml cold 20mM imidazole 
buffer (300 mM NaCl, 50 mM phosphate buffer, 20 mM imidazole, pH 8) until the flow- 
through reached the O.D 28 o of 0.02-0.06. The His-fusion protein was eluted by addition of 
700ul cold 250mM imidazole buffer (300 mM NaCl, 50 mM phosphate buffer, 250 mM 
imidazole, pH 8) and fractions collected until the O.D 2 go was 0. 1. 21uJ of each fraction were 
loaded on a 1 2% SDS gel. 

His-fusion insoluble proteins large-scale purification. 

A single colony was grown overnight at 37 °C on a LB + Amp agar plate. The 
bacteria were inoculated into 20 ml of LB+Amp liquid culture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600ml fresh medium and let to grow at the 
optimal temperature (37°C) to O.D 550 0.6-0.8. Protein expression was induced by addition 
of 1 mM IPTG and the culture further incubated for three hours. The culture was centrifuged 
at 8000rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 
7.5 ml buffer B (urea 8M, 1 OmM Tris-HCl, 1 OOmM phosphate buffer, pH 8.8). The cells 
were disrupted by sonication on ice for 30 sec at 40W using a Branson sonifler B-l 5, frozen 
and thawed twice and centrifuged again. The supernatant was stored at -20°C, while the 
pellets were resuspended in 2 ml guanidine buffer (6M guanidine hydrochloride, lOOrnM 
phosphate buffer, 10 mM Tris-HCl, pH 7.5) and treated in a homogenizer for 10 cycles. The 
product was centrifuged at 13000 rpm for 40 minutes. The supernatant was mixed with 1 50^1 
Ni 2+ -resin (Pharmacia) (previously washed with buffer B) and incubated at room temperature 
with gentle agitation for 30 minutes. The sample was centrifuged at 700 g for 5 minutes at 
4°C. The resin was washed twice with 10 ml buffer B for 10 minutes, resuspended in 1ml 
buffer B, and loaded on a disposable column. The resin was washed at room temperature 
with 2ml bufferB until the flow-through reached the OD 280 of 0.02-0.06. The resin was 
washed with 2ml buffer C (urea 8M, 1 OmM Tris-HCl, lOOrnM phosphate buffer, pH 6.3) until 
the flow-through reached the O.D 28 o of 0.02-0.06. The His-fusion protein was eluted by 
addition of 700^1 elution buffer (urea 8M, lOmM Tris-HCl, lOOrnM phosphate buffer, pH 
4.5) and fractions collected until the OD 280 was 0. 1 . 2 1 ul of each fraction were loaded on a 
12% SDS gel. 
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Purification of His-fusion proteins. 

For each clone to be purified as a His-fiision, a single colony was streaked out and 
grown overnight at 37°C on a LB/Amp (100 ng/ml) agar plate. An isolated colony from this 
plate was inoculated into 20ml of LB/Amp (100 jig/ml) liquid medium and grown overnight 
at 37°C with shaking. The overnight culture was diluted 1:30 into 600ml LB/Amp (100 
tig/ml) liquid medium and allowed to grow at the optimal temperature (20-3 7°C) until the 
OD 55 onni reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG 
(final concentration 1 .OmM) and the culture incubated for a further 3 hours. Bacteria were 
harvested by centrifiigation at 8000xg for 15 min at 4°C. 

The bacterial pellet was resuspended in 7.5ml of either (i) cold buffer A (300mM 
NaCl, 50mM phosphate buffer, lOmM imidazole, pH 8.0) for soluble proteins or (ii) buffer B 
(8M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8) for insoluble proteins. Cells 
were disrupted by sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. For insoluble proteins, pellets were 
resuspended in 2.0 ml buffer C (6M guanidine hydrochloride, lOOmM phosphate buffer, 
lOmM Tris-HCl, pH 7.5) and treated with a Dounce homogenizer for 1 0 cycles. The 
homogenate was centrifuged at 13 OOOxg for 40 min and the supernatant retained. 

Supernatants for both soluble and insoluble preparations were mixed with 1 50j.il Ni 2+ - 
resin (previously equilibrated with either buffer A or buffer B, as appropriate) and incubated 
at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast 
Flow (Pharmacia), prepared according to manufacturers protocol. The batch-wise preparation 
was centrifuged at 700xg for 5 min at 4°C and the supernatant discarded. The resin was 
washed twice (batch-wise) with 10ml buffer A or B for 10 min, resuspended in 1.0 ml buffer 
A or B and loaded onto a disposable column. The resin continued to be washed with either (i) 
buffer A at 4°C or (ii) buffer B at room temperature, until the OD 28 omn of the flow-through 
reached 0.02-0.01. The resin was further washed with either (i) cold buffer C (300mM NaCl, 
50mM phosphate buffer, 20mM imidazole, pH 8.0) or (ii) buffer D (8M urea, lOmM Tris- 
HCl, lOOmM phosphate buffer, pH 6.3) until the the OD 28 onm of the flow-through reached 
0.02-0.01. The His-fusion protein was eluted by addition of 700^1 of either (i) cold elution 
buffer A (300mM NaCl, 50mM phosphate buffer, 250mM imidazole, pH 8.0) or (ii) elution 
buffer B (8 M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 4.5) and fractions 



WO 99/57280 



64 



PCT/US99/09346 



collected until the O.D 2 80nm indicated all the recombinant protein was obtained. 20jil aliquots 
of each elution fraction were analyzed by SDS-PAGE using a 12% gel. Protein concentrations 
were estimated using the Bradford assay. 

His-fusion proteins renaturation 

In the cases where denaturation was required to solubilize proteins, a renaturation step . 
was employed prior to immunization. Glycerol was added to the denatured fractions obtained 
above to give a final concentration of 10%(v/v). The proteins were diluted to 200^g/ml using 
dialysis buffer I (10% (v/v) glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced 
glutathione, 0.5mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same 
buffer for 12-14 hours at 4°C. Further dialysis was performed with buffer II (10% (v/v) 
glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced glutathione, 0.5mM 
oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Alternatively, 10% glycerol was added to the denatured proteins. The proteins were 
then diluted to 20|ig/ml using dialysis buffer I (10% glycerol, 0.5M arginine, 50mM 
phosphate buffer, 5mM reduced glutathione, 0.5mM oxidised glutathione, 2M urea, pH 8 8) 
and dialysed against the same buffer at 4°C for 12-14 hours. The protein was further dialysed 
against dialysis buffer II (10% glycerol, 0.5M arginine, 50mM phosphate buffer, 5mM 
reduced glutathione, 0.5mM oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Protein concentration was evaluated using the formula: 

Protein (mg/ml) = (L55 x OD 28 o) - (0.76 x OD 260 ) 

Purification of proteins 

To analyse the solubility, pellets obtained from 3.0ml cultures were resuspended in 
SOOfil buffer Ml (PBS pH 7.2). 25fil of lysozyme (lOmg/ml) was added and the bacteria 
incubated for 15 min at 4°C. Cells were disrupted by sonication on ice four times for 30 sec at 
40W using a Branson sonifier 450 and centrifuged at 13 OOOxg for 30 min at 4°C. The 
supernatant was collected and the pellet resuspended in buffer M2 [8M urea, 0.5M NaCl, 
20mM imidazole and 0. 1 M NaH 2 P0 4 ] and incubated for 3 to 4 hours at 4°C. After 
centrifugation, the supernatant was collected and the pellet resuspended in buffer M3 [6M 
guanidinium-HCl, 0.5M NaCl, 20mM imidazole and 0.1M NaH 2 P0 4 ] overnight at 4°C. The 
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supematants from all steps were analysed by SDS-PAGE. Some proteins were found to be 
soluble in PBS, others need urea or guanidium-HCl for solubilization. 

For preparative scale purifications, 500ml cultures were induced and fusion proteins 
solubilized in either buffer Ml, M2 or M3 using the procedure described above. Crude 
extracts were loaded onto a Ni-NTA superflow column (Quiagen) equilibrated with buffer 
Ml, M2 or M3 depending on the solubilization buffer employed. Unbound material was 
eluted by washing the column with the same buffer. The recombinant fusion protein was 
eluted with the corresponding buffer containing 500mM imidazole then dialysed against the 
same buffer in the absence of imidazole. 

Mice immunisations 

20ng of each purified protein are used to immunise mice intraperitoneal ly. In the case 
of some ORFs, Balb-C mice were immunised with Al(OH) 3 as adjuvant on days 1, 21 and 42, 
and immune response was monitored in samples taken on day 56. For other ORFs, CD1 mice 
could be immunised using the same protocol. For ORFs 25 and 40, CD1 mice were 
immunised using Freund's adjuvant, and the same immunisation protocol was used, except 
that the immune response was measured on day 42, rather than 56. Similarly, for still other 
ORFs, CD1 mice were immunised with Freund's adjuvant, but the immune response was 
measured on day 49. Alternatively, 20^g of each purified protein was mixed with Freund's 
adjuvant and used to immunise CD1 mice intraperitoneal^. For many of the proteins, the 
immunization was performed on days 1,21 and 35, and immune response was monitored in 
samples taken on days 34 and 49. For some proteins, the third immunization was performed 
on day 28, rather than 35, and the immune response was measured on days 20 and 42, rather 
than 34 and 49. 

ELISA assay (sera analysis) 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 7ml of Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD 6 2o- The 
bacteria were let to grow until the OD reached the value of 0.3-0.4. The culture was 
centrifiiged for 10 minutes at 10000 rpm. The supernatant was discarded and bacteria were 
washed once with PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated 
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for 2 hours at room temperature and then overnight at 4°C with stirring. 100|il bacterial cells 
were added to each well of a 96 well Greiner plate and incubated overnight at 4°C. The wells 
were then washed three times with PBT washing buffer (0. 1 % Tween-20 in PBS). 200 jil of 
saturation buffer (2.7% Polyvinylpyrrolidone 10 in water) was added to each well and the 
plates incubated for 2 hours at 37°C. Wells were washed three times with PBT. 200 ^1 of 
diluted sera (Dilution buffer: 1% BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to 
each well and the plates incubated for 90 minutes at 37°C. Wells were washed three times 
with PBT. 100 \xl of HRP-conjugated rabbit anti-mouse (Dako) serum diluted 1 :2000 in 
dilution buffer were added to each well and the plates were incubated for 90 minutes at 37 C C. 
Wells were washed three times with PBT buffer. 1 00 fil of substrate buffer for HRP (25 ml of 
citrate buffer pH5, 10 mg of O-phenildiamine and 10 jil of H 2 0) were added to each well and 
the plates were left at room temperature for 20 minutes. 100 |il H 2 S0 4 was added to each 
well and OD490 was followed. The ELISA was considered positive when OD490 was 2.5 
times the respective pre-immune sera. 

Alternatively, The acapsulated MenB M7 strain was plated on chocolate agar plates 
and incubated overnight at 37°C. Bacterial colonies were collected from the agar plates using 
a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD 620 . The bacteria 
were let to grow until the OD reached the value of 0.3-0.4. The culture was centrifuged for 10 
minutes at 10 OOOrpm. The supernatant was discarded and bacteria were washed once with 
PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated for 1 hour at 37°C 
and then overnight at 4°C with stirring. 100^1 bacterial cells were added to each well of a 96 
well Greiner plate and incubated overnight at 4°C. The wells were then washed three times 
with PBT washing buffer (0.1% Tween-20 in PBS). 200pJ of saturation buffer (2.7% 
Polyvinylpyrrolidone 10 in water) was added to each well and the plates incubated for 2 hours 
at 37°C. Wells were washed three times with PBT. 200^1 of diluted sera (Dilution buffer: 1% 
BSA, 0.1% Tween-20, 0.1 % NaN 3 in PBS) were added to each well and the plates incubated 
for 2 hours at 37°C. Wells were washed three times with PBT. 1 00^.1 of HRP-conjugated 
rabbit anti-mouse (Dako) serum diluted 1:2000 in dilution buffer were added to each well and 
the plates were incubated for 90 minutes at 37°C. Wells were washed three times with PBT 
buffer. 100^1 of substrate buffer for HRP (25ml of citrate buffer pH5, lOmg of O- 
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phenildiamine and 10p.l of H 2 0 2 ) were added to each well and the plates were left at room 
temperature for 20 minutes. lOOu] of 12.5% H 2 S0 4 was added to each well and OD 490 was 
followed. The ELISA titers were calculated abitrarely as the dilution of sera which gave an 
OD 49 o value of 0.4 above the level of preimmune sera. The ELISA was considered positive 
when the dilution of sera with OD490 of 0.4 was higher than 1 :400. 

FACScan bacteria Binding Assay procedure. 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 4 tubes containing 8ml each Mueller-Hinton Broth (Difco) 
containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following 
OD620. The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture 
was centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and the pellet was 
resuspended in blocking buffer (1% BSA in PBS, 0.4% NaN 3 ) and centrifuged for 5 minutes 
at 4000rpm. Cells were resuspended in blocking buffer to reach OD 620 of 0.07. lOOul bacterial 
cells were added to each well of a Costar 96 well plate. 100u.l of diluted (1 : 100,' 1 :200, 1 :400) 
sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4°C. 
Cells were centrifuged for 5 minutes at 4000rpm, the supernatant aspirated and cells washed 
by addition of 200u]/well of blocking buffer in each well. lOOul of R-Phicoerytrin conjugated 
F(ab) 2 goat anti-mouse, diluted 1 :100, was added to each well and plates incubated for 1 hour 
at 4°C. Cells were spun down by centrifugation at 4000rpm for 5 minutes and washed by 
addition of 200|al/well of blocking buffer. The supernatant was aspirated and cells 
resuspended in 200u.l/well of PBS, 0.25% formaldehyde. Samples were transferred to 
FACScan tubes and read. The condition for FACScan (Laser Power 15mW) setting were: FL2 
on; FSC-H threshold:92; FSC PMT Voltage: E 01; SSC PMT: 474; Amp. Gains 6.1; FL-2 
PMT: 586; compensation values: 0. 

OMV preparations 

Bacteria were grown overnight on 5 GC plates, harvested with a loop and resuspended 
in 10 ml 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30 minutes and the 
bacteria disrupted by sonication for 10' on ice ( 50% duty cycle, 50% output ). Unbroken 
cells were removed by centrifugation at 5000g for 10 minutes and the total cell envelope 
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fraction recovered by centrifugation at 50000^ at 4°C for 75 minutes. To extract cytoplasmic 
membrane proteins from the crude outer membranes, the whole fraction was resuspended in 
2% sarkosyl (Sigma) and incubated at room temperature for 20 minutes. The suspension was 
centrifuged at 10000^ for 10 minutes to remove aggregates, and the supernatant further 
ultracentrifuged at SOOOOg for 75 minutes to pellet the outer membranes. The outer 
membranes were resuspended in lOmM Tris-HCl, pH8 and the protein concentration 
measured by the Bio-Rad Protein assay, using BSA as a standard. 

Whole Extracts preparation 

Bacteria were grown overnight on a GC plate, harvested with a loop and resuspended 
in 1ml of 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30' minutes. 

Western blotting 

Purified proteins (500ng/lane), outer membrane vesicles (5jig) and total cell extracts 
(25fig) derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and 
transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 150mA at 
4°C, using transfer buffer (03 % Tris base, 1 .44 % glycine, 20% (v/v) methanol). The 
membrane was saturated by overnight incubation at 4°C in saturation buffer (10% skimmed 
milk, 0.1% Triton XI 00 in PBS). The membrane was washed twice with washing buffer (3% 
skimmed milk, 0.1% Triton X100 in PBS) and incubated for 2 hours at 37°C with mice sera 
diluted 1 :200 in washing buffer. The membrane was washed twice and incubated for 90 
minutes with a 1 :2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The 
membrane was washed twice with 0.1% Triton XI 00 in PBS and developed with the Opti- 
4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water. 

Bactericidal assay 

MC58 and 2996 strains were grown overnight at 37°C on chocolate agar plates. 5-7 
colonies were collected and used to inoculate 7ml Mueller-Hinton broth. The suspension was 
incubated at 37°C on a nutator and let to grow until OD 620 was in between 0.5-0.8. The 
culture was aliquoted into sterile 1.5ml Eppendorf tubes and centrifuged for 20 minutes at 
maximum speed in a microfuge. The pellet was washed once in Gey's buffer (Gibco) and 
resuspended in the same buffer to an OD 620 of 0.5, diluted 1 :20000 in Gey 's buffer and stored 
at 25°C. 
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50\i\ of Gey's buffer/ 1% BSA was added to each well of a 96-well tissue culture plate. 
25^1 of diluted (1:100) mice sera (dilution buffer: Gey's buffer/0.2% BSA) were added to 
each well and the plate incubated at 4°C. 25p.l of the previously described bacterial 
suspension were added to each well. 25pl of either heat-inactivated (56°C waterbath for 30 
minutes) or normal baby rabbit complement were added to each well. Immediately after the 
addition of the baby rabbit complement, 22jil of each sample/well were plated on Mueller- 
Hinton agar plates (time 0). The 96-well plate was incubated for 1 hour at 37°C with rotation 
and then 22^1 of each sample/well were plated on Mueller-Hinton agar plates (time 1). After 
overnight incubation the colonies corresponding to time 0 and time lh were counted. 

Gene Variability 

The ORF4 and 979 genes were amplified by PCR on chromosomal DNA extracted 
from various Neisseria strains (see list of strains). The following oligonucleotides used as 
PCR primers were designed in the upstream and downstream regions of the genes: 

orf 4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 

919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 
919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

The PCR cycling conditions were: 

1 cycle 2 min. at 94° 

30 cycles 30 sec. at 94° 

30 sec. at ~ 54° or ~ 60° ( in according to Tm of the primers) 
40 sec. at 72° . 
1 cycle 7 min. at 72° 

The PCR products were purified from 1 % agarose gel and sequenced using the 
following primers: 

orf 4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.2 (forward) CGACCGCGCCTTTGGGACTG 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 
orf 4.4 (reverse) TCTTTGAGTTTGATCCAACC 
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919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 

919.2 (forward) ATCCTTCCGCCTCGGCTGCG 

919.3 (forward) AAAACAGCGGCACAATCGAC 

919.4 (forward) ATAAGGGCTACCTCAAACTC 

919.5 (forward) GCGCGTGGATTATTTTTGGG 

919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

919.7 (reverse) CCCAAGGTAATGTAGTGCCG 

919.8 (reverse) TAAAAAAAAGTTCGACAGGG 

919.9 (reverse) CCGTCCGCCTGTCGTCGCCC 

919.10 (reverse) TCGTTCCGGCGGGGTCGGGG 

All documents cited herein are incorporated by reference in their entireties. 
-The following Examples are presented to illustrate, not limit, the invention 

EXAMPLE 1 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 1: Oligonucleotides used for PCR for Examples 2-10 



ORF 


Primer 


Sequence 


Restriction sites 


279 


Forward 
Reverse 


CGCGGATCCCATATG-TTGCCTGCAATCACGATT 
<SEQID3021> 

CCCGCTCGAG-TTTAGAAGCGGGCGGCAA <SEO 
ID 3022> 


BamHI-Ndel 
Xhol 


519 


Forward 
Reverse 


CGCGGATCCCATATG-TTCAAATCCTTTGTCGTCA 
<SEQ ID 3023> 

CCCGCTCGAG-TTTGGCGGTTTTGCTGC <SEQ ID 
3024> 


BamHI-Ndel 
Xhol 


576 


Forward 
Reverse 


CGCGGATCCCATATG-GCCGCCCCCGCATCT 
<SEQ ID 3025> 

CCCGCTCGAG-ATTTACTTTTTTGATGTCGAC 
<SEQ ID 3026> 


BamHI-Ndel 
Xhol 


919 


Forward 
Reverse 


CGCG G ATCCC ATATG-TG CC AAAG C AAG AGC ATC 
<SEQ ID 3027> 

CCCGCTCGAG-CGGGCGGTATTCGGG <SEO ID 
3028> 


BamHI-Ndel 
Xhol 


121 


Forward 


CGCGGATCCCATATG-GAAACACAGCTTTACAT 
<SEQ ID 3029> 


BamHI-Ndel 
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Reverse 


CCCGCTCGAG-ATAATAATATCCCGCGCCC <SEO 
ID 3030> 


Xhol 


128 


Forward 
Reverse 


CGCGGATCCCATATG-ACTGACAACGCACT <SEO 
ID 3031 > 

CCCGCTCGAG-GACCGCGTTGTCGAAA <SEQ ID 
3032> 


BamHI-Ndel 
Xhol 


206 


Forward 
Reverse 


CGCG G ATCCC ATATG -AAACAC CG C CAACC GA 
<SEQ ID 3033> 

CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC 
<SEQ ID 3034> 


BamHI-Ndel 
Xhol 


287 


Forward 
Reverse 


CCGGAATTCTAGCTAGC-CTTTCAGCCTGCGGG 
<SEQ ID 3035> 

CCCGCTCGAG-ATCCTGCTCTTTTTTGCC <SEQ ID 
3036> 


EcoRI-Nhel 
Xhol 


406 


Forward 
Reverse 


CG CGG ATCCCATATG -TG C G GGACACTG AC AG 
<SEQ ID 3037> 

CCCGCTCGAG-AGGTTGTCCTTGTCTATG <SEQ 
ID 3038> 


BamHI-Ndel 
Xhol 



Localization of the ORFs 

The following DNA and amino acid sequences are identified by titles of the following 
form: [g, m, or a] [#].[seq or pep], where "g" means a sequence from N. gonorrhoeae, W 
means a sequence from N. meningitidis B, and "a" means a sequence from N. meningitidis A; 
"#" means the number of the sequence; "seq" means a DNA sequence, and "pep" means an 
amino acid sequence. For example, "gOOl.seq" refers to an N. gonorrhoeae DNA sequence, 
number 1. The presence of the suffix "-1" to these sequences indicates an additional sequence 
found for the same ORF, thus, data for an ORF having both an unsuffixed and a suffixed 
sequence designation applies to both such designated sequences. Further, open reading 
frames are identified as ORF #, where "#" means the number of the ORF, corresponding to 
the number of the sequence which encodes the ORF, and the ORF designations may be 
suffixed with ".ng" or ".a", indicating that the ORF corresponds to a N. gonorrhoeae sequence 
or a N. meningitidis A sequence, respectively. The word "partial" before a sequence indicates 
that the sequence may be a partial or a complete ORF. Computer analysis was performed for 
the comparisons that follow between "g", "m", and "a" peptide sequences; and therein the 
"pep" suffix is implied where not expressly stated. Further, in the event of a conflict between 
the text immediately preceding and describing which sequences are being compared, and the 
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designated sequences being compared, the designated sequence controls and is the actual 
sequence being compared. 



ORF: contig: 

279 gnni4.seq 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3039>: 
ni279.seq 

1 ATAACGCGGA TTTGCGGCTG CTTGATTTCA ACGGTTTTCA GGGCTTCGGC 

51 AAGTTTGTCG GCGGCGGGTT TCATCAGGCT GCAATGGGAA GGTACGGACA 

101 CGGGCAGCGG CAGGGCGCGT TTGGCACCGG CTTCTTTGGC GGCAGCCATG 

151 GCGCGTCCGA CGGCGGCGGC GTTGCCTGCA AT CACGATTT GTCCGGGTGA 

201 GTTGAAGTTG ACGGCTTCGA CCACTTCGCT TTGGGCGGCT TCGGCACAAA 

251 TGGCTTTAAC CTGCTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA GGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT TCAATGCGCC GGCGGCAACG AGTG CGGTGT 

401 ATTCGCCGAG GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

451 TCTAAATAG 

This corresponds to the amino acid sequence <SEQ ID 3040; ORF 279>: 
m279.pep 

1 ITRICGCLIS TVFRASASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 

51 ARPTAAA LPA ITICPGELKL TASTTSLWAA SAQMALTCSS SKPRIAAIAP 

101 TPCGTADCIS SARRRTSLTA SAKFNAPAAT SAVYSPRLCP ATAAGVL PPA 

151 SK* 



The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 3041>: 

g279.seq 

1 atgacgcgga tttgcggctg cttgatttca acggttttga gtgtttcggc 

51 aagtttgtcg gcggcgggtt tcatcaggct gcaatgggaa ggaacggata 

101 ccggcagcgg cagggcgcgt ttggcrccgg cttctttggc ggcagccatg 

151 gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcga 

201 gttgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 

251 tctgcctgac ctgttcatct tccaaaccca aaatggccgc cattgcgcct 

3 01 acgccttgcg gtacggcgga ctgcatcagt tcggcgcgca ggcggacgag 

3 51 tttgacggca tcggcaaaat ccaatgcttc ggcggcgaca agcgcggtgt 
401 attcgccgag gctgtgtccg gcaacggcgg caggcgtttt gccgcccact 

4 51 tccaaatag 

This corresponds to the amino acid sequence <SEQ ID 3042; ORF 279.ng>: 

g279.pep 

1 MTRICGCLIS TVLSVSASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 

51 VR PTAAA LPA ITTCPGELKL TASTTSPCAD SAQICLTCSS SKPKMAAIAP 

101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVLPPT 

151 SK* 



ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

m2 79 . pep ITRICGCLISTWRASASLSAAGFIRLOWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

= 11111111111= ^llllllllllllllllllllllllllllllllilhiMiiiiii 

g^ 79 MTRICGCLISTVLSVSASLSAAGFI RLQWEGTDTGSGRARLAPASLAAAI4VRPTAAALPA 

10 20 30 40 50 60 
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7 0 80 90 100 HO 120 

m279 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 

II lllllllllllll I llh IMIIIIh:||lllli!llllll||||||!|||| 

g279 I TTC PG ELKLTASTTS P CAD SAO I CLTCS S S KP KM AA I A PT PCGT AD CIS S ARRRTSLTA 

70 80 90 100 110 120 



130 140 150 

m2 7 9 . pep SAKFNAPAATS AVYS PRLCPATAAGVLPPAS KX 

III M MlllllillllllMIMIIhlll 

g2 7 9 SAKSNASAATS AVYS PRLC PATAAGVL PPTS KX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3043>: 

a2 7 9. seq 

1 ATGACNCNGA TTTGCGGCTG CTTGATTTCA ACGGTTTMNA GGGCTTCGGC 

51 GAGTTTGTCG GCGGCGGGTT TCATGAGGCT GCAATGGGAA GGTACNGACA 

101 CNGGCAGCGG CAGGGCGCGT TTGGCGCCGG CTTCTTTGGC GGCAAGCATA 

151 GCGCGCTCGA CGGCGGCGGC ATTGCCTGCA ATCACGACTT GTCCGGGCGA 

201 GTTGAAGTTG ACGGCTTCAA CCACTTCATC CTGTGCGGAT TCGGCGCAAA 

251 TTTGTTTTAC CTGTTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA NGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT CCAATGCGCC GGCGGCAACN AGTGCGGTGT 

401 ATTCGCCGAN GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

451 TCCGAATAG 

This corresponds to the amino acid sequence <SEQ ID 3044; ORF 279.a>: 

a279 .pep 

1 MTXICGCLIS TVXRASASLS AAGFMRLQWE GTDTGSGRAR LAPASLAASI 

51 ARSTAAALPA ITTCPGELKL TASTTSSCAD SAQICFTCSS SKPRIAAIAP 

101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP ATAAGVLPPA 

151 SE* 



m279/a279 ORFs 279 and 279.a showed a 88.2% identity in 1 52 aa overlap 

10 20 30 40 50 60 

m279.pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

: I M I I M M I I I I II : II I I I I I III I t I t II ; II I I I I:: | | || | | | | I 

a279 MTXICGCLISTVXRASASL3AAGFMRLQV7EGTDTGSGRARLAPASLAAS1 ARSTAAALPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m279.pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 

II lllllllllllll I III: r | t I J t I I I J i I 1 1 I 1 I | | f | | [ 1 | | t I I 

a279 ITTCPGELKLTASTTSSCADSAQICFTCSSSKPRIAAIAPTPCGTADCIS SARXRTSLTA 

70 8C 90 100 110 120 

130 140 150 

m279 .pep SAKFNAPAATSAVYSPRLCPATAAGVLPPASKX 
IN MINIMUM 1 I I I I I I I t I r I I I : I 
a2 7 9 SAKSNAPAAT S AVYSPXLC FAT AAGVLP PAS EX 

130 140 150 



519 and 519-1 gnm7.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3045>: 

m519.aeq (partial) 

1 . - TCCGTTATCG GGCGTATGGA GTTGGACAAA ACGTTTGAAG AACGCGACGA 

51 AATCAACAGT ACTGTTGTTG CGGCTTTGGA CGAGGCGGCC GGGgCTTgGG 

101 GTGTGAAGGT TTTGCGTTAT GAGATTAAAG ACTTGQTTCC GCCGCAAGAA 

151 ATCCTTCGCT CAATGCAGGC GCAAATTACT GCCGAACGCG AAAAACGCGC 

201 CCGTATCGCC GAATCCGAAG GTCGTAAAAT CGAACAAATC AACCTTGCCA 
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251 GTGGTCAGCG CGAAGCCGAA ATCCAACAAT CCGAAGGCGA GGCTCAGGCT 

3 01 GCGGTCAATG CGTCAAATGC CGAGAAAATC GCCC3CATCA ACCGCGCCAA 
351 AGGTGAAGCG GAATCCTTGC GCCTTGTTGC CGAAGCCAAT GCCGAAGCCA 

4 01 TCCGTCAAAT TGCCGCCGCC CTTCAAACCC AAGGCGGTGC GGATGCGGTC 
4 51 AATCTGAAGA TTGCGGAACA ATACGTCGCT GCGTTCAACA ATCTTGCCAA 
501 AGAAAGCAAT ACGCTGATTA TGCCCGCCAA TGTTGCCGAC ATCGGCAGCC 
551 TGATTTCTGC CGGTATGAAA ATTATCGACA GCAGCAAAAC CGCCAAaTAA 



This corresponds to the amino acid sequence <SEQ ID 3046; ORF 519>: 

m519.pep (partial) 



1 . . SVIGRMELDK TFEERDEINS TWAALDEAA GAWGVKVLRY EIKDLVPPQE 

51 ILRSMQAQIT AEREKRARIA ESEGRKIEQI NLASGQREAE IQQSEGEAQA 

101 AVNASNAEKI ARINRAKGEA ESLRLVAEAN AEAIRQIAAA LQTQGGADAV 

151 NLKIAEQYVA AFNNLAKESN TLIMPANVAD IGSLISAGMK IIDSSKTAK* 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3047>: 

g519.seq 



1 atggaatttt tcattatctt gttggcagcc gtcgccgttt tcggcttcaa 

51 atcctttgtc gtcatccccc agcaggaagt ccacgttgtc gaaacgctcg 

101 ggcgtttcca tcgcgccctg acggccggtt tgaatatttt gattcccttt 

151 atcgaccgcg tcgcctaccg ccattcgctg aaagaaatcc ctttagacgt 

201 acccagccag gtctgcatca cgcgcgataa tacgcaattg actgttgacg 

251 gcatcatcta tttccaagta accgatccca aactcgcctc atacggttcg 

301 agcaactaca ttatggcaat tacccagctt gcccaaacga cgctgcgttc 

351 cgttatcggg cgtatggagt tggacaaaac gtttgaagaa cgcgacgaaa 

401 tcaacagtac cgtcgtctcc gccctcgatg aagccgccgg ggcttggggt 

451 gtgaaagtcc tccgttacga aatcaaggat ttggttccgc cgcaagaaat 

501 ccttcgcgca atgcaggcac aaattaccgc cgaacgcgaa aaacgcgccc 

551 gtattgccga atccgaaggc cgtaaaatcg aacaaatcaa cctfcgccagt 

601 ggtcagcgtg aagccgaaat ccaacaatcc gaaggcgagg ctcaggctgc 

651 ggtcaatgcg tccaatgccg agaaaatcgc ccgcatcaac cgcgccaaag 

701 gcgaagcgga atccctgcgc cttgttgccg aagccaatgc cgaagccaac 

751 cgtcaaattg ccgccgccct tcaaacccaa agcggggcgg atgcggtcaa 

801 tctgaagatt gcgggacaat acgttaccgc gttcaaaaat cttgccaaag 

851 aagacaatac gcggattaag cccgccaagg ttgccgaaat cgggaaccct 

901 aattttcggc ggcatgaaaa attttcgcca gaagcaaaaa cggccaaata 

951 a 



This corresponds to the amino acid sequence <SEQ ID 3048; ORF 5l9.ng>: 

g 5 19. pep 



1 MEFFIILLAA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR uVAEANAEAN 

251 RQIAAALQTQ SGADAVNLKI AGQYVTAFKN LAKEDNTRIK PAKVAEIGNP 

301 NFRRHEKFSP EAKTAK* 



ORF 519 shows 87.5% identity over a 200 aa overlap with a predicted ORF (ORF 519.ng) 
from N. gonorrhoeae: 

m5l9/g519 



m519 .pep 



10 20 30 

SVIGRMELDKTFEERDE INS TWAALDEAA 



g519 



YFQVTDPKIASYGSSNYIMAITQLAOTTLRSVIGRMBLDKTFEERDEINSTVVSALDEAA 
90 100 110 120 130 140 



illinium iiimiiiihiiini 



m519 -pep 



40 50 60 70 80 90 

GAWGVKVLRY E I KDLVPPQE I LRSMQAQ ITAEREKRARI AESEGRKI EQ I NLASGQREAE 



WO 99/57280 



75 



PCT/US99/09346 



NMIIIIillllMlilllllhlll Mill IIMII III III IIIIIIIMMIIIM 

951? GAWGVKVLRYEIKDLVPPOEILRAMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
150 160 170 180 190 200 

100 110 120 130 140 150 

m5 1 9 . pep I QQS EGEAQ AAVNASNAE KI AR I NRAKGE AES LRLVAEANAEAIRQ I AAALQTQGGADA V 

I I I I I I I I I I I I ! I I I I I I I I I M I I I I I I I I I I I | | | | | M I I I I I I I I I I I : I I I I I 
g 5 1 9 I QQSEGEAQAAVNASNAE KI ARI NRAKGEAES LRLVAEANAEANRQI AAALQTOSGADAV 

210 220 230 240 250 260 

160 170 180 190 200 

m519 .pep NLKI AEQYV AAFNNLAKE SNTL I MP ANVAD I G S L - 1 SAGMKI I DS S KTAK 

Mill I I I : I I : I I I I I : I I I I I = I I = t | = h :|||| 

9 5 1 9 NLKI AGQYVTAFKNLAKEDNTRI KPAXVAEI GN PN FRRI IE KFS PE AKTAK 

270 280 290 300 310 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3049>: 

£519. seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

401 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

451 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3050; ORJF 519.a>: 
a519.pep 

1 MEFFIILLAA VWFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLA3 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

zn519/a519 ORFs 519 and 519. a showed a 99.5% identity in 199 aa overlap 

10 20 30 

m519. pep SVIGRMELDKTFEERDEINSTVVAALDEAA 

I I i I ! I I I I I M I I | | | | | | | | |: | | j | | | 
a519 YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRM2LDKTFEERDEINSTVVSALDEAA 
90 100 110 120 130 140 

40 50 60 70 80 90 

m5 1 9 . pep GAWG VKVLRYE I KDLVP PQE I LR SMQAQI T AEREKRARI AES E GRKIEQ1 NLASGQREAE 

M I ! I I I I t I I I I I I I I I I (II | || | || in | M || nun in i || i I'n i i 

a519 GAWGVKVLR YE I KDLVP PQE I LRSMQAQI TAEREKRARI AESEGRKIEQINLASGQREAE 

150 160 170 180 190 200 



100 113 120 130 140 150 
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m519.pep 



I QQS E G E AQAA VN A S N AE K I ARI N RAKG E AE S LR L VAE AN AE A I RQ I AAALQTQG G A D A V 
I I I i t I I I I i I I I I I I I I I I ! I i 1 I I I I I I t I I I I I i i I f I f| I I I I I 1 I I I t M I I i i I 
IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
210 220 230 240 250 260 



a519 



m519. pep 



a519 



160 170 180 190 200 

N LK I AEQ Y VAAFNN LAKE SNT L I M P AN V AD I G S L I S AGMK 1 1 DS S KT AKX 
I I ! I 1 M t I I I M I I I I I I I I M | | | | | | | I [ | | | | | | | | | | | | | | | || | 
NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 
270 280 290 300 310 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 305 1>: 

m519-l.seq 



1 ATGGAATTTT TCATTATCTT GTTGGTAGCC GTCGCCGTTT TCGGTTTCAA 

51 ATCCTTTGTT GTCATCCCAC AACAGGAAGT CCACGTTGTC GAAAGGCTGG 

101. GGCGTTTCCA TCGCGCCCTG ACGGcCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTT AT CGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

<01 TCAACAGTAC TGTTGTTGCG GCTTTGGACG AGGCGGCCGG GGCTTGGGGT 

451 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC CGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

iOl GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCTGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 3052; ORF 519-1>: 

m519-l. 



1 MEFFIILLVA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWA ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS HQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAE IQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQ Y VAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



The following DNA sequence was identified in AT. gonorrhoeae <SEQ ID 3053>: 

g519-l.seq 



1 ATGGAATTTT TCATTATCTT GTTGGCAGCC GTCGCCGTTT TCGGCTTCAA 

51 ATCCTTTGTC GTCATCCCCC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGATAA TACGCAATTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGATCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCAAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGTAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG GGCTTGGGGT 

451 GTGAAAGTCC TCCGTTACGA AATCAAGGAT TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCGCA ATGCAGGCAC AAATTACCGC CGAACGCGAA AAACGCGCCC 

551 GTATTGCCGA ATCCGAAGGC CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGTG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCCAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GCGAAGCGGA ATCCCTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGGGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTAGCCGC GTTCAACAAT CTTGCCAAAG 
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851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GCATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3054; ORF 519-Lng>: 

g519-l.pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

20i GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



m519-l/g519-l ORFs 519-1 and 519-1. ng showed a 99.0% identity in 315 
overlap 

10 23 30 40 50 60 

g519-l.pep ME F FI I LLAA V A V FG FK S F</V I PQQE VH WE R LGR FH RALT AG LN I L I P F I DR VAYRHS L 
Nlltll[:llill!liilltitll||f|MIIIIII!||||||f|||||||||||||t| 
m5 1 9- 1 MEFFII LLVAVAVFGFKS FWI PQQE VH WERLGRFHRALTAGLN I LI P FI DRVAYRHS L 

10 20 30 40 50 60 

70 80 90 100 110 120 

g519-l . pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I | | | | ■ | | | I || | | | 

m519-l KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

70 80 90 100 110 120 

130 140 150 160 170 180 

g519-l . pep RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 

i IM INN IIMIrlllllllllMIIIII: IMIIIHIIII: IIIMIIIIl 

IT1519-1 RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

130 140 150 160 170 130 

190 200 210 220 230 240 

g519-l . pep KRARIAESEGRKIEQXNLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

III Ml lllllll I IIIIIIIIIIHIIIIIIIMIMI Ml IllliniiMIIIIII I 

m519-l KRAR I AESEGRKIEQINLASGQREAE I QQSEGEAQAAVNASNAEKIAR IN RAKGEAESLR 

190 200 210 220 230 240 

250 260 270 280 290 300 

g519-l.pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
l>i ID lllll II IIIIIIMMIIIIIIIIM IIIMIIII IIIIMII IIIIMMII 
m51 9- 1 LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 

250 260 270 280 290 300 

310 

g519-l.pep ISAGMKIIDSSKTAKX 
I I 1 I I I I I I I I I I I I I 
ra5 1 9 - 1 1 SAGMKI I DS S KTAKX 

310 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3055>: 

a519-l.seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 
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401 

451 

501- 

551 

601 

651 

701 

751 

801 

851 

901 



TCAACAGCAC 
GTGAAGGTTT 
CCTTCGCTCA 
GTATCGCCGA 
GGTCAGCGCG 
GGTCAATGCG 
GTGAAGCGGA 
CGTCAAATTG 
TCTGAAGATT 
AAAGCAATAC 
ATTTCTGCCG 



CGTCGTCTCC 
TGCGTTATGA 
ATGCAGGCGC 
ATCCGAAGGT 
AAGCCGAAAT 
TCAAATGCCG 
ATCCTTGCGC 
CCGCCGCCCT 
GCGGAACAAT 
GCTGATTATG 
GTATGAAAAT 



GCCCTCGATG 
GATTAAAGAC 
AAATTACTGC 
CGTAAAATCG 
CCAACAATCC 
AGAAAATCGC 
CTTGTTGCCG 
TCAAACCCAA 
ACGTCGCCGC 
CCCGCCAATG 
TATCGACAGC 



AAGCCGCCGG 
TTGGTTCCGC 
TGAACGCGAA 
AACAAATCAA 
GAAGGCGAGG 
CCGCATCAAC 
AAGCCAATGC 
GGCGGTGCGG 
GTTCAACAAT 
TTGCCGACAT 
AGCAAAACCG 



AGCTTGGGGT 
CGCAAGAAAT 
AAACGCGCCC 
CCTTGCCAGT 
CTCAGGCTGC 
CGCGCCAAAG 
CGAAGCCATC 
ATGCGGTCAA 
CTTGCCAAAG 
CGGCAGCCTG 
CCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 3056; ORF 519-l.a>: 

a519-l .pep. 

1 MEFFIILLAA WVFG FKSFV VIPOOEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK+ 



m519-l/a519-l 

overlap 



ORFs 519-1 and 51 9-1. a showed a 99.0% identity in 315 aa 



a519-l .pep 
m519-l 

a519-l .pep 
m519-l 

a519-l.pep 
n\519-l 

a519-l .pep 
m519-l 

a519-l.pep 
m519-l 

a519-l.pep 
m519-l 



10 20 30 40 50 60 

MEFFIILLAAWVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
■ I 1 I I I I I : I I : I I I i I I i I | | | | | i | 1 | | | | | | | | i j f | j | | | | | | | | | | | ,,, | , | , | 

ME FFI I LLVAVAVFG FKS FWI PQQEVH WERLGRFHRALTAGLN ILI P FI DRVAYRHSL 
10 20 30 40 50 60 

70 80 90 100 110 120 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
I I I i I I I I I M ! I M i f I I I I I I I I I I I I I | | I | | M ! | | j ( | | | | | | | | h | ( | | | | { | 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
70 80 90 100 110 120 

130 140 150 160 170 180 

RMELDKTFEERDEINSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
M M i I I 1 II I! I I II I I I : I I I I I I I I I I I I I | | | | | | | | | | | | j 1 | | | | | | | , | , , , | 
RME L DKT FEE R DE I N S T WAALD E AAG AWG VKVLR YE I KD L V P PQE I LRSMQAQ I T AE RE 

130 140 150 160 170 180 

190 200 210 220 230 240 

KRARIAESEGRKIEQINLASGQREAEIOOSEGEAQAAVNASNAEKIARINRAKGEAESLR 
I I I M t I i II I I I I I I I I I I I I | | | M I I I i II I I (i M | | | | | n I I I li I I I | I | | I | 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

190 200 210 220 230 240 

250 260 270 280 290 300 

LV7VEANAEAIRQIAAALQTDGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I I M I I I II I I I I I M I I I I I I I I I I I | | | | | | | | | t | | | | | | | | | | | | | | | | 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
250 260 270 280 290 300 

31C 

ISAGMKIIDSSKTAKX 
I I II I I I I I I I I I I i I 
I SAGMKI I DSSKTAKX 
310 
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576 and 576- 1 gnm22.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3057>: 



m576. seq. . 


(partial) 




1 


. . ATGCAGCAGG 


CAAGCTATGC 


51 


GCAAATGAAG 


GAACAGGGCG 


101 


CCATGCAGGC 


AGTGTATGAC 


151 


GCTCAGGAAG 


TCATGATGAA 


201 


AGAAAAACAC 


AAGGCGGACG 


251 


TTCTGAAAGA 


AAATGCCGCC 


201 


CTGCAATACA 


AAATCACCAA 


351 


CGACATCGTT 


ACCGTGGAAT 


401 


TCGACAGCAG 


CAAAGCCAAC 


451 


GTGATTCCGG 


GTTGGACCGA 


501 


AGCCACGTTC 


TACATCCCGT 


551 


GCGACAAAAT 


CGGTCCGAAC 


601 


AAAATCGGCG 


CACCCGAAAA 


651 


CATCAAAAAA 


GTAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 3058; ORF 576>: 

m576.pep. . {partial) 

1 . .MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 
51 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 
101 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 
151 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 
201 KIGAPENAPA KQPAQVDIKK VN + 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3059>: 

g576. seq. . (partial) 

1 ..atgggcgtgg acatcggacg ctccctgaaa caaatgaagg aacagggcgc 

51 ggaaatcgat ttgaaagtct ttaccgatgc catgcaggca gtgtatgacg 

101 gcaaagaaat caaaatgacc gaagagcagg cccaggaagt gatgatgaaa 

151 ttcctgcagg agcagcaggc taaagccgta gaaaaacaca aggcggatgc 

201 gaaggccaac aaagaaaaag gcgaagcctt cctgaaggaa aatgccgccg 

251 aagacggcgt gaagaccact gcttccggtc tgcagtacaa aatcaccaaa 

301 cagggtgaag gcaaacagcc gacaaaagac gacatcgtta ccgtggaata 

351 cgaaggccgc ctgattgacg gtaccgtatt cgacagcagc aaagccaacg 

401 gcggcccggc caccttccct ttgagccaag tgattccggg ttggaccgaa 

451 ggcgtacggc ttctgaaaga aggcggcgaa gccacgttct acatcccgtc 

501 caaccttgcc taccgcgaac agggtgcggg cgaaaaaatc ggtccgaacg 

551 ccactttggt atttgacgtg aaactggtca aaatcggcgc acccgaaaac 

601 gcgcccgcca agcagccgga tcaagtcgac atcaaaaaag taaattaa 

This corresponds to the amino acid sequence <SEQ ID 3060; ORF 576.ng>: 
g576. pap. . (partial) 

1 . .MGVDIGRSLK QMKEQGAEID LKVFTDAMQA VYDGKEIKMT EEQAQEVMMK 
51 FLQEQQAKAV EKHKADAKAN KEKGEAFLKE NAAEDGVKTT ASGLQYKITK 
101 QGEGKQPTKD DIVTVEYEGR LIDGTVFDSS KANGGPATFP LSQVIPGWTE 
151 GVRLLKEGGE ATFYIPSNLA YREQGAGEKI GPNATLVFDV KLVKIGAPEN 
201 APAKQPDQVD IKKVN* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m576/g576 ORFs 576 and 576. ng showed a 97.2% identity in 215 aa overlap 

10 20 30 40 50 60 

m576.pep MQOASYAMGVDIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMT'EEQAQEVMMKFLQ 

" I I I I I I I I I I I I I I I I | ! | | : | | | | | | | | | | | | , | , , ,| , , | , |, , , , 
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9^7 6 MGVDIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKE1KI-1TEEQAQEVMMKFLQ 

10 20 30 40 50 

70 80 90 100 110 120 

m57 6 . pep EQQAKAVEKHKADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIV 
ttlMIIII IIMIIIIMIIIIItMIII:||||||IIMIIII| II) IIHIIMIII 
957 6 EQQAKAVEKHKADAKANKEKGEAFLKENAAEDGVKTTASGLQYKITKQGEGKQPTKDDIV 
60 70 80 90 100 110 

130 140 150 160 170 180 

m57 6 . pep TVE YEGR LI DGTVFDSSKANGGPVT FPLSQV I PGWTEGVQLLKEGGEATFY I PSNLAYRE 

r 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 » i r 1 1 1 : i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 i i j 1 1 1 1 1 1 1 1 1 1 1 1 

g576 TVEYEGRLIDGTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEATFYIPSNLAYRE 
120 130 140 150 160 170 

190 200 210 220 

m57 6 . pep QGAGDKIGPNATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
MM:MfMMIIIIIMIIIIIII!|jIM! IliflilM 
g57 6 QGAGEK IG PN ATLVFDVKLVK I GAPENA PAKQPDQVDIKKVNX 

180 190 200 210 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 306 1>: 

a576.seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

401 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

451 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAAT CGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3062; ORF 576.a>: 

a576.pep 

1 MNTIPKISAL TLSAALALSA CGKKEAAPAS ASEPAAASSA QGDTS5IGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FT1SAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YI PSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

ra576/a57 6 ORFs 576 and 57 6. a showed a 99.5% identity in 222 aa overlap 

10 20 30 

m576.pep MQQASYAMGVDIGRSLKQMKEQGAEIDLKV 

I I M I I ! 1 1 I 1 1 I I I I II I M M I I I 1 1 I I 
a576 CGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGVDIGRSLKQMKEQGAEIDLKV 
30 40 50 60 70 80 

4 0 50 60 70 80 90 

m576 . pep FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQOAKAVEKHKADAKANKEKGEAFLKENAA 
• M I I I 1 1 I 1 1 I I I I I 1 1 1 I I M 1 1 1 I I I I I I I i I I | 1 1 1 1 1 1 1 M I M It I I I I I 1 1 ( I 

a576 FTEAMQAVYDGKEII<MTEEQAQEV>1MKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 
90 100 110 120 130 140 
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100 110 120 130 140 150 

m57 6 . pep KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFD3SKANGGPVTFPLSQ 

i mini iiiiiiiiiiuitiiiniii mi iiimii nun inmniii mi 

a576 KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVRYEGRLIDGTVFDSSKANGGPVTFPLSQ 
150 160 170 180 190 200 

160 170 180 190 200 210 

rn57 6 . pep VIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

>i mum I I I I I I f I I I I I I I I I I I f I I I I t I I I I I I I I I I I | 1 I | | | f | | 

a576 V I LGWTEGVQLLKEGGE ATFY I PSNLAYREQGAG OKI G PNATLVFDVKLVK I GAPEN APA 

210 220 230 240 250 260 

220 

m57 6 . pep KQPAQVDIKKVNX 
I I I I II I I I I I II 
a 57 6 KQPAQVDIKKVNX 
270 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3063>: 

m57 6-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

451 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 7CGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3064; ORF 576-1 >: 

m576-l .pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEI DLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLI D GTVFDSSKAN GGPVTFPLSQ 

201 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 K I GAPEN A PA KQPAQVDIKK VN* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3065>: 

g576-l.seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTGCCGCG CAGGGCGACA CCTCTTCAAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC AATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 ACAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGATG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCCCAGGAAG TGATGATGAA ATTCCTGCAG GAGCAGCAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGATG CGAAGGCCAA CAAAGAAAAA GGCGAAGCCT 

401 TCCTGAAGGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGT 

451 CTGCAGTACA AAATCACCAA ACAGGGTGAA GGCAAACAGC CGACAAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACCGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG CCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACGG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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7 01 CCGAAAAAAT CGGTCCGAAC GCCACTTTGG TATTTGACGT GAAACTGGTC 
751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG ATCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3066; ORF 576-l.ng>: 

g576-l.pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASAA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTDAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPATFPLSQ 

201 VIPGWTEGVR LLKEGGEATF YIPSNLAYRE QGAGEKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPDQVDIKK VN+ 



g576-l/m576-l ORFs 576-1 and 57 6-l.ng showed a 97.8% identity in 272 aa 

overlap 

10 2D 30 40 50 60 

g576-l . pep MNT I FK I S ALTL S AALALS ACGKKEAAPASAS E PAAASAAQG DTS S I G S TMQQAS YAMGV 

' 1 1 i ■ 1 1 i 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 f j 1 1 1 1 1 1 1 1 1 

m576-l MNT I FKI S ALT LS AALALS ACGKKEAAP AS AS E PAAASSAQG DTS S IG STMQQAS YAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

g576-l . pep DIGRSLKQMKEQGAE I DLKVFTDAMQAV YDGKE I KMTEEQAQEVMMKFLQEQQAKAVEKH 

I I I I I M I M I I I I I I I I I I I I : I I I I I | M I M I I i I I I | | M I I I I I I I I I I I 

m57 6- 1 DIGRSLKQMKEQGAE I DLKVFTEAMQAVYDGKE I KMTEEQAQEVMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

g576-l . pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
NMII llllllll IllllllHllllllllltlll III lllllllllllll I IIIIlM 

IT1576-1 KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
130 140 150 160 170 180 

190 200 210 220 230 240 

g576-l.pep GTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEATFYIPSNLAYREQGAGEKTGPN 

I I M I t I I I I I I I : I I I I M I | I | | | j | | : | | | | | | I I I I I t I I I I I I I : I I M | 

m576-l GTVFDSSKANGGPVT FPLSQV I PGWTEGVQLLKEGGEATFY I PSNLAYREQGAGDKIG PN 

190 200 210 220 230 240 

250 260 270 

g57 6-l .pep ATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
I f I I M I M I M f I I I I I I i I I I IIIMIIM 
m57 6-1 ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3067>: 

a57 6-1. seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 
751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3068; ORF 576-La>: 

a57 6-l .pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFUVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

a576-l/m576-l ORFs 576-1 and 576-1. a showed a 99.6% identity in 272 

overlap 



10 20 30 40 50 60 

a57 6- 1 . pep MNT I FKI SALTL5AALALSACGKKEAAPASAS E PAAAS SAQGDTSS I GSTMQQAS YAMGV 
I MIIIIIIMMI IIIIIIIIIMKMIt III IIIIIIMtli Ml ||j in ||f | M 
m576-l MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 

10 20 30 40 50 60 

7 0 80 90 100 110 120 

a57 6-1 . pep DIGRSLKQMKEQGAE I DLKVFTEAMQAVYDGKE IKMTEEQAQEVMMKFLQEQQAKAVEKH 

I I I I I H I I M I I I I I I I I I I I M I ! M I I I I I I i I M M I I I I I I II I I M I II 

m576-l DIGRSLKQMKEQGAE I DLKVFTEAMQAVYDGKE IKMTEEQAQEVMMKFLQEQQAKAVEKH 

70 B0 90 100 110 120 

130 140 150 160 170 180 

a576-l . pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
IIMIIIIMIMI IMIM lllllllllllllllll IIIIIMIIIIIIIII IIIIMI 
m576-l KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
130 140 150 160 170 180 

190 200 210 220 230 240 

a576-l.pep GTVFDSSKANGGPVTFPLSQVILGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 

I N I M f M I I I I I I H I I II llllllllli IIIMIIIIIMIMIIIIIII 

m576-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 

1^0 200 210 220 230 240 

250 260 270 

a576-l.pep ATLVFDVKLVK I GAFENAPAKQ PAQVD I KKVNX 
I It I I I 1 1 M 1 1 F M I I I I 1 1 1 I 1 1 I I I I I | | | 
m57 6-1 ATLV FDVK LVK IGA PEN APAKQPAQVD I KKVNX 

250 260 270 



919 gnm43.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3069>: 

tn919.seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TACGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTC TATACCGTTG TACCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCTTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAG CCTTG 

4 01 CCGGTACGGT TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGACAGG 
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4 51 CGGACGGCAC AAGCCOGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

501 CATACCGCCG ACCTCTCCcG ATTCCCCATC ACCGCGCGCA CAACAGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

751 GAAGACCCTG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT A CATC CG CAT CGGCTATGCC GACAAAAACG 

851 AACATCCyTA CGTTTCCATC GGACGCTATA TGGCGGATAA GGGCTACCTC 

901 AAACTCGGAC AAACCTCCAT GCAGGGCATT AAGTCTTATA TGCGGCAAAA 

951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCCGCTGA TGGGGGAATA TGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGTGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTC CTACCCAACG 

1301 GTATGAAGCC CGAATAC CGc CCGTAA 



This corresponds to the amino acid sequence <SEQ ID 3070; ORF 919>: 
m919.pep 

1 MKKYLFRAAL YGIAAAILAA CQSKSIQTFP OPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSRFPI TARTTAIKGR FSGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

3 01 KLGQTSMQGI KSYMRQMPQR LAEVLGQNPS YIFFRELAGS SNDGPVGALG 
351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAI KG 

4 01 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 307 1>: 

g919.seq 

1 ATGAAAAAAC ACCTGCTCCG CTCCGCCCTG TACGGcatCG CCGCCgccAT 

51 CctcgCCGCC TGCCAAAgca gGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG CCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TTGCCGGCGG CGGGGCCGTC TATACCGTTG TGCCGCACCT 

201 GTCCATGCCC CACTGGGCGG CGCaggATTT TGCCAAAAGC CTGCAATCCT 

2 51 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTGCAT TCCTTTCAGG CAAAGcGgTT 

351 TTTTGAACGC TATTTCACGC cgtGGCaggt tgcaggcaAC GGAAGcCTTG 

401 Caggtacggt TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACC-GCAGG 

4 51 CGGACGGAAC GGGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCG CTGCCTGCCG GTTTGCGGGG CGGAAAAAAC CTTGTCCGCA 

551 TCAGGCAGac ggGGAAAAAC AGCGGCACGA TCGACAATGC CGGCGGCACG 

601 CATACCGCCG ACCTCTCCCG ATTCCCCATC ACCGCGCGCA CAACGGcaat 

651 caaaGGCAGG TTTGAaggAA GCCGCTTCCT CCCTTACCAC ACGCGCAACC 

701 AAAtcaacGG CGGCgcgcTT GACGGCAAag cccCCATCCT CggttacgcC 

751 GAagaccCcG tcgaacttTT TTTCATGCAC AtccaaggCT CGGGCCGCCT 

801 GAAAACCCcg teeggcaaat acatCCGCAt cggaTacgcc gacAAAAACG 

851 AACAtcegTa tgtttccatc ggACGctaTA TGGCGGACAA AGGCTACCTC 

901 AAGctcgggc agACCTCGAT GCAGGgcatc aaagcCTATA TGCGGCAAAA 

951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC GGCAATGAGG GCCCCGTCGG CGCACTGGGC 

1051 ACGCCACTGA TGGGGGAATA CGCCGGCGCA ATCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CAGGCAGCGC GATCAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGTTAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATACGT CTGGCAGCTC CTGCCCAACG 

1301 GCATGAAGCC CGAATACCGC CCGTGA 



WO 99/57280 



85 



PCTAJS99/09346 



This corresponds to the amino acid sequence <SEQ YD 3072; ORF 919.ng>: 

g919.pep 

1 MKKHLLRSAL YGIAAAILAA CQSRSIQTFP QPDTSVINGP DRPAGIPDPA 

51 GTTVAGGGAV YTWPHLSMP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKRFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDGR 

151 RTERARFPIY GIPDDFISVP LPAGLRGGKN LVRIRQTGKN SGTIDNAGGT 

2 01 HTADLSRFPI TARTTAI KGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGOTSMQGI KAYMRQNPQR LAEVLGQNPS YIFFRELAGS GNEGPVGALG 

351 TPLMGEYAGA IDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK OKTTGYVWQL LPNGMKPEYR P* 



ORF 919 shows 95.9 % identity over a 441 aa overlap with a predicted ORF (ORF 919.ng) 
from N. gonorrhoeae: 

m919/g919 

10 20 30 40 50 60 

m919.pep MKKYLFRAALYGIAAAItiAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

llhhMIIIIMIIIII MhllMIIMI! || IMIIhl IUI ! III MINI 

99 19 MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
10 20 30 40 50 60 

70 80 90 100 110 120 

m919 . pep YTWPHLSLPHWAAQDFAKSIX^SFRIjGCANLKNRQGWODVCAQAFQTPVHSFQAKQFFSR 

MMIMhllllllllllllllllllllllllMIIIIMIIIIIIIIMIIIhllll 

g 9 1 9 YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGV7QDVCA0AF0TPVHS FQAKRFFER 

70 80 90 100 110 120 

130 140 150 160 170 180 

ra919 . pep Y FTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFP I YGr PDD FI S VPL PAGLRSGKA 

IIMIIIIIIIIIIIIMIillllllll Ml MIMMM1MMIIIMIIIIMI 

9919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPI YGI PDDFI SVPLPAGLRGGKN 

130 140 150 ISO 170 180 

190 200 210 220 230 240 

m9 1 9 . pep LVRIRQTGKN3GT I DNTGGTHTADLSRFPI TARTTAI KGR FEGS R FL P YHTRNQ INGGAL 

IN M MINI Mill HI III III IIIIIM I IIMII III IN II I II II I II MM I 

g919 LVRIRQTGKNSGTIENAGGTHTADLSRFPI TARTTAI KGRFEGSRFLP YHTRNQ INGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919 . pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYIj 

III M III Ml I II II III III Ml Ml II M MM III II Ml Mill I II IMIMI 

9919 DGKAPILGYA3DPVELFFMH I QGSGRLKTPSGKY I RIGYADKNEHPWS I GRYMADKGYL 

250 260 270 280 290 300 



310 320 330 340 350 360 

m919 . pep KLGQTSMQGI XS YMRQN PQRLAEVLGQNPS Y I FFRELAGSSNDGP VGALGTPLMGE YAGA 

Mill IMIM Ml II IMIM Ml III M MIMIMIMI Ml III M! M II Ml M 

g919 KI^QTSMQG I KAYWRQNPORLAEVLGQNPSYIFFRE LAG SG^IEGP VGALGTPLMGE YAGA 



310 



320 



330 



340 



350 



360 



370 380 390 400 410 420 

m9 19 . pep VDRHYITLGAPLFVATAHPVTRKAIJ^RLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 

MIMIII III IMIIIMIMIM Ml IIIHIIIIIMIIIIIIIHMIlliMIM 

g9 1 9 IDRHYI TLGAPLFVATAHPVTRKALNRLIMAQDTGSAI KGAVRVD Y FWG YGD EAGELAG K 

370 38C 390 400 410 420 
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430 440 
m9 1 9 . pep Q KTTG YVWQLLPNGMKPE YRPX 

II II III 11! II I II I MINI 

g 9 1 9 QKTTGYVWQLLPNGMKPEYRPX 
430 440 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3073>: 

a919.seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TGCGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTT TATACCGTTG TGCCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCGTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

401 CCGGTACGGT TACCGGCTAT TACGAGCCGG TGCTGAAGGG CGACGACAGG 

451 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGA7TTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTG7CCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCCA ATTCCCCATC ACTGCGCGCA CAACGGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

751 GAAGACCCCG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCCTA CGTTTCCATC GGACGCTATA TGGCGGACAA AGGCTACCTC 

901 AAGCTCGGGC AGACCTCGAT GCAGGGCATC AAAGCCTATA TGCAGCAAAA 

951 CCCGCAACGC CTCGCCGAAG TTTTGGGGCA AAACCCCAGC TATATCTTTT 

1001 TCCGAGAGCT TACCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCGGCTGA TGGGCGAGTA CGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTT CTGCCCAACG 

1301 GTATGAAGCC CGAATACCGC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 3074; ORF 919.a>: 

a919.pep 

1 MKKYLFRAAL CGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQS FRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SVQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSQFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KAYMQQHPQR LAEVLGQNPS Y1FFRELTGS SNDGPVGALG 

351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

m919/a9i9 ORFs 919 and 919.a showed a 98.6% identity in 441 aa overlap 

10 20 30 40 50 60 

m919.pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 

tiillMIM f I 1 I I I 1 I I I 1 I t I I I 1 1 I I t I f I 1 I I I I t I I I I I I I I I f i I I I 1 1 I I I 
a919 MKKYLFRAALCG I AAAI LAACQSKS IQT FPQP DTSV INGPDR PVG I PDPAGTTVGGGGAV 

10 • 20 30 40 50 60 

70 80 90 100 110 120 

m9 1 9 . pep YTWPHLSLPHWAAQDFAKSLQSFRLGCANLBCNRQGWQDVCAQAFQTPVHSFQAKQFFER 

'MM II M 1 1 I 1 1 i I 1 1 I I I l l 1 1 l 1 1 i 1 1 t | | | 1 1 1 1 | | i | | | i | i | | i 1 1 i 

a919 YTVVPHLSLPHWAAQDFAKSLQS FRLGCAN LKNRQGWQDVCAQAFQTPVHSVQAKQFFER 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m919 . pep YFTPWQVAGNGSLA3TVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
N I II I I I I N I I I I I I I I I I I I I I I I I I I | | | | | | | 1 | | | | M | || | 1 | | | | | | | | | | | 
a919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGTPDDFISVPLPAGLRSGKA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ra919.pep LVRIRQTGKNSGTIDNTGGTHTADLSKFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
IIIIIIIMMI IIIIIIIIIIIIM:!! IIMIIIMII III HI IIIMItlMI III 
a919 LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919.pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

I II IMilllll IIIIIMMIMII MMiMIIIIIIIIIIM III INI IMIill I 
a919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
250 260 270 280 290 300 

310 320 330 340 350 360 

m91 9. pep KLGQTSMQGI KS YMRQN PQRLAE VLGQN PS Y I FFRE LAGS SNDGPVGALGT PLMGE YAGA 

i ' 1 1 « 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 j i r 1 1 1 1 1 1 { 1 1 1 1 1 = 1 1 1 1 1 1 1 1 } 1 1 1 p 1 1 1 1 1 1 1 1 1 

a919 K LGQT SMQGIKAYMQQN PQRLAE VLGQNPSYIFFRELTGSSNDGPVGALG TP LMGE YAGA 

310 320 330 340 350 360 

370 380 390 400 410 420 

m91 9 . pep VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
M I I ! I I I I I I I I I I I I I I I I I I I I I [ I M I I I I I I I I I I I t M I M M I I I I i [ I II I I 
a919 VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 

370 383 390 400 410 420 

430 440 
m9 1 9 . pep QKTTG YVWQLLPNGMKPE YRPX 

I I t I I I I I I I I I I I I I I I I I I I 
a919 QKTTG YVWQLLPNGMKPE YRPX 

430 440 



121 and 121-1 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3075>: 

ml 21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

401 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

451 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

501 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

551 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

601 xxxxxxCAGC TTCCTTACGA CAAAAACGGT GCAAAGTCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACGCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGACG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGnATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
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1051 GCAACCGGCG CATCCAAACC GTGTATTCTG AnCGCGGGAT ATTATTATTG 
1101 A 

This corresponds to the amino acid sequence <SEQ ID 3076; ORF 12 1>: 

ml 21. pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGO 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGG1RNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPKK 

351 ATGASKPCIL XAGYYY* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3077>: 

gl21.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATTATG TCGGGAACCA GTATGGACGG 

51 GGCGGATGCC GTGCTGGTAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCTGACC GGTTGCGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCAC AGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCTCCGTGC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTtac AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACTGa cgcggatttT TACCGTCggc gacttcCGCA 

401 GCCGCGACCT TGCTGCCGGC GGa caAGGTG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA TGACAGGGAA ACACGCGTGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGGCGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGAcgcgtg gacgcaggca 

501 cacTGGcagc TGCCTTACGA CAAAAacggt gcAAAGgcgg cacAAGGCAA 

551 catatTGCcg cAACTGCTCG gcaggctGCT CGCCcaccCG TATTTCTCAC 

701 AACCCcaccc aaAAAGCACG GGgcGCGaac TgtttgcccT AAattggctc 

751 gaaacctAcc ttgacggcgg cgaaaaccga tacgacgtat tgcggacgct 

801 ttcccgattc accgcgcaaA ccgTttggga cgccgtctca CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCTCAATG GGTGGAGGCG gccgCATTtg 

1001 cgtggttggC GGCGTGTTGG ATTAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCGACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3078: ORF 121. ng>: 

gl21.pop 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRWLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGEKR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 
from N. gonorrhoeae: 

ml21/gl21 

10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGACAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I M I I I I I I I I I I I I I I I : I I I I I I! I II I I I I I I I M I I I i : i I I I I I I I : | | I 
9121 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I • I I I I I M I I I I U I I I I I I I I I I I ! I I I I I I I I I I I I I I : I II I f I I I I I I I I I 
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gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 
7 0 80 90 100 HO 120 

130 140 150 160 170 180 

ml2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
I : : . 

gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLP^GA 
130 140 150 160 170 180 

1^0 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQL P YDKNGAKSAQGN I L PQLLDRLLAH P YFAQRH PKST 

: - I H I I ! I i I I : I I I I I I I I I I l|||||||:| Hill 

gl21 PAFGFDTGPGNMLMDAWTQAHWOLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
NIIII:INIIIIIII|{|!lliMM||i|||f I I | | : | | ; | I I I I I I I IMIIil 

9121 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml21 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLMCWINRIPGSPHKATGASKPCIL 
M I I I I I I I I I I I I I I I I I : I | | I | | .| | | | | I I I I I i { I | | | | ( I I I I I 1 I M I | | | | | 

gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 
310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3079>: 

al21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCC CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC SGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3080; ORF 121.a>: 

a!21.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQI FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGG1 ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 
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301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWKAACW VNRIPGSPHK 
351 ATGASKPCIL GAGYYY* 



ml21/al21 ORFs 121 and 121. a showed a 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml 21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
M I I I I I I I II I I I I I I I I I I 1 1 I I I! I I I 1 1 I I I Ml I li I M I M : I I I M I 1 1 I 1 1 I 
al21 METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ral21.pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I H I : I i I t M 1 I I i I t i I I i II H I I I I I | | | | | | | | | ! f M | | || | : i | : | | | | , | | | 
a 121 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
al21 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGN1LPQLLDRLLAHPYFAQRHPKST 
: I I I M I I I i I : I M I I I I I I I I i : | | | I I | | | ||f|i 

al21 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPOLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21.pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I I l: I I I II I II I II I I 1 I I I III I Mil I I I I I M M I t IE I I I II I Mlilil 
al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21 . pep LMADLAECFGTRVS^HSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
M H I I I I I I 1 1 I 1 II I M : I I I I I I I M I I i I I : I I I I M I I M I ! II I II I I I i I I I 
al21 LMA DLAE CFGTRVSIjHS T AE LNL D PQW VE AAA FAWMAAC W VNR I PG S PH KATG A SK PC I L 

310 320 330 340 350 360 

ml21.pep XAGYYYX 
I M I I I 

al21 GAGYYYX 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 308 1>: 
ml21-l.seq 

1. ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GT ACT GAT AC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC 3ACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC 3GACAAGGCG CGCCACTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 
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851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3082; ORF 121-1>: 

irl21-l .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

3 51 ATGASKPCIL XAGYYY* 

ml21-l/gl21 ORFs 121-1 and 121. ng showed a 95.6% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
1 I I I M I I M I I H I I 1 M I I I : I I I I I M I M I I I I I I I I I II I I: I I I I III |: || | 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110. 120 

ml21-l.pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGOTVRHAPEHGYSIQLADLPLL 
I I I I : I I I I I I I I I I I I M I I i I I I I I I I I I I I I I I I I M I i I I I I I I I I I I M | | | | | 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l.pep AERTRI FT VG D FRSRDLAAGGQG APLV PAFHEALFRDNRETRAVLN I GG I AN I SVLP PDA 
I I 1 I I I I I I I f I I f I I I I I I I 1 I I I I II I I I I 1 I I I : I 1 I I : I I I | | | | | | | | | | | | | 
gl 2 1 AELTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRVVLN IGG I ANI SVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 2 1-1 .pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

mm m miiiimmmiiMiiiiiiii 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l .pep GRE L FALNW LET YL DGGEN R Y D V LRT L S R FT AQT VC DA V S HAAADARQM Y I CG GG I RNP V 
I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I II I III I I I I I I I I | I | | | !|| Ml | 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l.pep I^IADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
M Ml I III II! 1 I I i I I I : I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I i I M I I I 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3083>: 

a!21-l.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 
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51 
101 
151 
201 
251 
301 
351* 
401 
451 
501 
551 
601 
651 
701 
751 
601 
£51 
SOI 
951 
1C01 
1C51 
1101 



GGCGGATGCC 
AAGGGCACGC 
GATTTGCAGG 
GCAAGAACTC 
GTCAAAACCT 
ACCGTCAGAC 
GCCGCTGCTG 
GCCGCGACCT 
CACGAAGCCC 
CGGCGGGATT 
GCTTCGACAC 
CACTGGCAGC 
CATATTGCCG 
AACCCCACCC 
GAAACCTACC 
TTCCCGATTC 
CAGATGCCCG 
TTAATGGCGG 
CACCGCCGAA 
CATGGATGGC 
GCAACCGGCG 
A 



GTACTGATAC 
CTTTACCCCC 
ACACAGGCGC 
AGCCGCCTGT 
CGCGCCGTCC 
ACGCGCCGGA 
GCGGAACGGA 
TGCGGCCGGC 
TGTTCCGCGA 
GCCAACATCA 
AGGACCGGGC 
TTCCTTACGA 
CAACTGCTCG 
T AAAAG C ACG 
TTGACGGCGG 
ACCGCGCAAA 
TCAAATGTAC 
ATTTGGCAGA 
CTGAACCTCG 
GGCGTGTTGG 
CATCCAAACC 



3GATGGACGG 
TACCCCGGCA 
GGACGAACTG 
ACGCGCAAAC 
GACATTACCG 
ACACAGTTAC 
CTCAGATTTT 
GGACAAGGCG 
CGACAGGGAA 
GCGTACTCCC 
AATATGCTGA 
CAAAAACGGT 
ACAGGCTGCT 
GGGCGCGAAC 
CGAAAACCGA 
CCGTTTTCGA 
ATT TGCGGCG 
ATGTTTCGGC 
ATCCGCAATG 
GTCAACCGCA 
GTGTATTCTG 



CGGCAAATGG 
GGTTACGCCG 
CACCGCAGCA 
CGCCGCCGAA 
CCCTCGGCTG 
AGCGTACAGC 
TACCGTCGGC 
CGCCGCTCGT 
ACACGCGCGG 
CCCCGACGCA 
TGGACGCGTG 
GC AAAGGCGG 
CGCCCACCCG 
TGTTTGCCCT 
TACGACGTAT 
CGCCGTCTCA 
GCGGCATCCG 
ACACGCGTTT 
GGTAGAAGCC 
TTCCCGGTAG 
GGCGCGGGAT 



CTGGGCGCGG 
CAAATTGCTG 
GGATGTTGTC 
CTGCTGTGCA 
CCACGGGCAA 
TTGCCGATTT 
GACTTCCGCA 
CCCCGCCTTT 
TACTGAACAT 
CCCGCCTTCG 
GATGCAGGCA 
CACAAGGCAA 
TATTTCGCAC 
AAATTGGCTC 
TGCGGACGCT 
CACGCAGCGG 
CAATCCTGTT 
CCCTGCACAG 
GCCGCGTTCG 
TCCGCACAAA 
ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 3084; ORF 121-1 .a>: 

al21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQT FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21-l/al21-l ORFs 121-1 and 121-1. a showed a 96.4% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 f 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 r 1 1 1 

al21-l METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 
10 20 30 40 50 50 

70 80 90 100 110 120 

ml21-l.pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQT^/RHAPEHGYSIQLADLPLL 
Ml 1:1 I I I I I II I II I li || | M | | | | | M It | | | || | | | | | | | | |:| |; || || | , | j 
al21-l HRSRMLSQELSRLYAQTAAELLCSQNLAPSD IT ALGCHGQTVRHAPEHSY SVQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l . pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
tIM:! I I I I I I I I I! I II | || IM | | | | | | || || | }:| | | | | | | | | | || |t | | | | | || | 
a!21-l AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGI ANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l.pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGN I LPQLLDRLLAHPYFAQPHPKST 

1 1 1 1 1 1 1 1 1 1 1 1 1 m i Mini i r 1 1 f 1 1 1 1 f 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 r 1 1 1 1 1 

al21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGN I LPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ral21-l . pep GRE LFALN WLET YLCGGENRYDVLRTLSRFTAQTVC DAVS HAAADARQMY ICG GG I RN PV 
IMMMMIIIIIIIIIIIMIIIIIMIIMII lllil I i I 1 1 I I M II I I I I I i I 
al21-l GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 
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310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
! I I M I I I I I I I I I I I I I I : | | | ) M I I I I I I I I : I I I I : I t I I I | | M I I I I I I I I I I 
al21 1MADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWN1AACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 35G 360 



ml 2 1 - 1 . pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



128 and 128-1 



ral28 . seq 


(partial) 




1 


ATGACTGACA 


ACGCACTGCT 


51 


AATCAAAACC 


GAAGACATCA 


101 


CGCGCGAACA 


AATCGCCGCC 


151 


AACACTGTCG 


AACCCCTGAC 


201 


GGGCGTGGTG 


TCGCACCTCA 


251 


CCGTCTATAA 


CGAACTGATG 


301 


GGACAAGACA 


TCGAGCTGTA 


351 


CGAATTCGAC 


ACCCTCTCCC 


1 


TACGCCAGCG 


AAAAACTGCG 


51 


wGTCAAAAAA 


TAyTTCCCyG 


101 


AAmTCAAAAA 


ACTmTACGGC 


151 


TGGCACAAAG 


ACGTGCGCTA 


201 


AGGCGGCGTT 


TATATGGATT 


251 


CGTGGATGAA 


CGACTACAAA 


301 


CAAyTGCCCA 


CCGCCTACCT 


351 


CAGGGAAGCC 


CGCyTGAGCC 


401 


CCGGACACGG 


GCTGCACCAC 


451 


TCCGGCATCA 


ACGGCGTAkA 


501 


TATGGAAAAT 


TTCGTTTGGG 


551 


ACGAAGAAAC 


CGGcgTTCCC 


601 


GCCGCCAAAA 


ACTTCCAAsG 


651 


CGCCCTCTTT 


GATATGATGA 


701 


AAAACTGGCA 


ACAGGTTTTA 


751 


CAGCCGCCCG 


AATACAACCG 


801 


AGGCGGCTAT 


TCCGCAGCTn 


851 


GCGCGGACGC 


ATACGCCGCC 


901 


GGCAAACGCT 


TTTGGCAGGA 


951 


nGCAGAATCC 


TTCAAAGCCT 


1001. 


TCTTGCGCCA 


CAGCGGTTTC 



CTGCTTACCC AAGTGGACGA ACTGGGCGTA 



This corresponds to the amino acid sequence <SEQ ID 3086; ORF 128>: 

ml 2 8. pep (partial) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



// 



1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRPSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 

301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 
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The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 3087>: 

gl28.seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CG TATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

6 51 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGccaaa cTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 

1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 

1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 

1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 

1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 

1851 cAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 

1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 

1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ED 3088; ORF 128.ng>: 

gl28.pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTI KNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE ILTLFHBTGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
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651 AAESFKAFRG REPSIDALLR QSGFDNAA* 



ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORP (ORF 128.ng) 
from N. gonorrhoeae: 

ral28/gl28 

10 20 30 40 50 60 

gl28 . pep M I DNALLHLGE E PR FNQI QTEDI KPAVQTAI AEARGQ I AAV KAQTHTGWANTVE RLTG I T 

i IIIIMMIIII|:|hlMIII|:||||||[| IMhIIIMMIIMII ||||[ 
ml2 6 MTDNALLHLGEE PRFDQI KTEDIKPALQTAI AEAREQIAAI KAQTHTGWANTVE PLTG IT 



10 



20 



30 



40 



50 



60 



gl28 .pep 
ml28 

g!28 .pep 
ml28 

gl28 .pep 
ml 2 8 



70 80 90 100 110 120 

ERVGR I WG WSH LNS WDT P E LRAV YNE LM P E I TV FFT E I GQD I E L YNRF KT I KNS P E FA 

llllllllllllll hllMIMIIIIIIIIINIIIIlllllMiMiiiMIIIII 

ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 

130 140 150 160 170 180 

TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGACLSAKFSQNVLDATDAFGIY 

nun ii i hi 

TLS PAQKTKLNH 
130 



// 



340 350 360 

YAGEKLREAKYAFSETEVKKYFPVGKVLAG 

Ihlllllllllllll llllllll || I 

Y AS EKLRE AKYAF S E TX VKKY F P VGXVLNG 
10 20 30 



370 380 390 400 410 420 

gl28 . pep LFAQ I KKLYG IGFAEKTVPVWHKDVRYFELQQNGKT I GGVYMDLYAREGKRGGAWMNDYK 

I'll IIIIMIhllllllllllMI lllllh:MIIIIMIMIIIIIIMIII!l 

ml28 LFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWMNDYK 
40 50 60 70 80 90 

430 440 450 460 470 480 

gl 28 . pep GRRRFADGTLQL PTAYLVCN FA P PVGGKEARL S HDE I LTL FHETGHGLHHLLTQVDEL3V 

i nihil ii mii ii 1 1 ii i ii nihil in i ii 1 1 inn mi iii iii iii ii i 

ml 2 8 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDE I LI LFHETGHGLHHLLTQVDELGV 



100 



110 



120 



130 



140 



150 



gl28.pep 
ml28 



490 500 510 520 530 540 

SGI NGVEWDAVE LPS Q FMEN FVWE YNVLAQMS AHEETGE PL P KEL FDKMLAAKNFQRGM F 

mill nniiiiiiiiniiiiiiin innii nun n mini in 

SGINGVXWDAVELPSQFTIENFVWEYNVLAQXSAHEETGVPLPKELXDKXlxAAKNFQXGMF 
160 170 180 190 200 210 



550 560 570 580 590 600 

gl28.pep LVRQME FALFDMMI YS ESDECRLKNWQQVLDSVRKEVAV I Q P PE YNRFANS FGHI FAGG Y 

in iiniMMiihii i i 1 1 1 r r 1 1 1 1 1 1 1 ^ 1 1 1 1 1 1 1 1 i 1 1 1 1 miiMiii 

ml28 XVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIOPPEYNRFALSFGHIFAGGY 
220 230 240 250 260 270 

610 620 630 640 650 660 

g 1 2 8 . pep S AGY YS YAWAEVLSTDAYAAFEESDDVAATGKRFWQE I LAVGGS RSAAES FKAFRGRE *S 

in mmiiihiiii miinnniiiiii iihiiiinimiii 

ml2B SAAXYS YAWAEVLSADAYAAFEESDDVAATGKRFWQE I LAVGXSRSGAES FKAFRGREPS 

280 290 300 310 320 330 
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670 679 
g 1 2 8 . pep I DALLRQSGFDNAAX 

lllllhlllllh 

ml 2 8 I DAL LRH S G FDNAVX 

340 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3089>: 

al28.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACAC3CCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AG ACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

G51 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

SOI GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1C01 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1C5.1 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 C C AAAT C AAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA 3ACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC SCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

1401 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 3090; ORF 128.a>: 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAOSEGKTGY KIGLOIFHYL AVTQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 
501 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
551 AAESFKAFRG REPSIDALLR HSGFDNAA* 

128/al28 ORFs 128 and 128.a showed a 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml28 . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREOIAAIKAQTHTGWANTVEPLTGIT 
Ml llll Mi II I Ml II II II llll IM III | III III llll III Ml IN I MM III 
al28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 

7 ° 80 90 100 110 120 

ir.128.pep ERVGRIWGV\^SHLNCVADTPELRAVYNELMPEITVFFTEIGODIELYNRFKTIKNSPEFD 
I I I I I I I I I I I I I I t r f | | I | | | - | | f | | | 1 | I E ( ] I I I I I I f | | | | 1 | | | | f j | | f | | 
al28 E RVGR I WG WS HLN S VT DTPE LRAA YNE LM PE I T V FFTE I GQD I EL YNRFKT I KN S PE FD 

70 80 90 100 110 120 

130 

ml 2 8 . pep TLS PAQKTKLNH 

I I I I I I I I I I I 

al28 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 



ml28.pep 
a!28 



FDDAAP LAG I PE DAL AM FAAAAQSE GKTG YK I G LQ I PHY LAV I QY A DN RKLRE Q I Y RAY V 
190 200 210 220 230 240 



ml 28. pep 
al28 

ml28.pep 
al28 

m!28.pep 
al28 

ml28.pep 
al28 

ml28.pep 
al28 

ml28 .pep 
al28 



TRAS E L S D DGKFDNT AN I DRT LEN ALQT AK L LG FKN Y AE L S LATKMA DT PEQV LN FLH D L 
250 260 270 280 



290 



300 



140 150 
YASEKLREAKYAFSETXVKKYFPVGX 
I I : I I I I I I | | | I | i | I | i | I | { | 
ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLRZAKYAFSETEVKKYFPVGK 
310 3 20 330 340 350 360 

160 170 180 190 200 210 

VLNG L F AQX KK L YG 1 G FTE KT V P VW HKD VR YXELQQNGEX I G G V YMD L Y ARE GKRGG AWM 
I I M I I I I I I I I I II I I I ! II I I I I I | | I | | | | | | | | : | | | | | | || | || | | , | , , | , | 
V LNG L FAQ I KK L YG I G FTF.KT V PVW HK D VR Y FE LQQNG ET I GG V YMD L Y AREGKRGGAWM 
370 380 390 400 410 420 

220 230 240 250 260 270 

NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
'Hill IN M III Mill III ] M :HMI:l HI III Ml I i ! 11 I II II I I I I I i i 
N D YKG RRR FSDGTLQLP T A Y LVCN FT P P VGG KE ARLSH DE I LT L FHE TGHG LHH LLT Q V D 
43 0 440 450 460 470 480 

280 290 300 310 320 330 

ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
I I M I I I I I I I I I I I I | 1 | | | | | | | | | | | | | || || | | | | || | | | , , , || iiiiiii 

ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

340 3 50 360 370 380 390 

XGMFXVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGH T F 
Ml Ml I I I I I I I I I I I M I I I I I I I | | | | | | | | | | : | | | | | | | j | | | ,{,,:, 

RGMFLVRQME FALFDMMIYS EDDEGRLKNW QQVLDSVRKEVAWRPPEYNRFAKfSFGH IF 
550 560 570 580 590 600 
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400 410 420 430 440 450 

ml28 . pep AGGYSAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRG 
IMIII: I I I I I 1 I I I II I II I If I I I I I I I I I || i | | | j | | | | | IMilltlllMi 
al28 AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
61 ° 620 630 640 65G 660 

460 470 
ml 2 8. pep RE PS I DALLRHSG FDNAVX " 

I I I I I I I I I I I I I I I I I : 
al28 REPSIDALLRHSGFDNAAX 
670 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 309 1>: 

ml28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 

651 ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAACTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGCAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTGA ACCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCAG CGAAAAACTG CGCGAAGCCA AATACGCGTT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATA7GGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGAC7ACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 

1351 GGCAGGGAAG CCCGCCTGAG CCACGACGAA ATCCTCATCC TCTTCCACGA 

1401 AACCGGACAC GGGCTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC 3ACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGCATGT TCCTCGTCCG 3CAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 

1751 TCCAGCCGCC CGAATACAAC CGCTTCGCCT TGAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 3092; ORF 128-l>: 
ml28-l.pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVADTP ELRAVYNELM PEITVFFTEI 
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101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLANALQTAK LLGFKNYAEL SLATKKADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

451 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKN FQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAES FKAFRG REPSIDALLR HSGFDNAV* 



The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3093>: 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCG AAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCG 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

101 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

£51 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

SOI GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1C01 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG SCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGC GGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 
1301 . TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

1401 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

1451 TGTCCGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEQ ID 3094; ORF 128-Lng>: 
gl28-l.pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI ?E DALAM FAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 



ml28-l/g!28-l ORFs 128-1 and 128-1. ng showed a 94.5% identity in 4 
overlap 

10 20 30 40 50 60 
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gl28»l.pep midnaluilgeeprfnqiktedikpavqtaiaeargqiaavkaqthtgwantverltgit 
I M I I I I I I I I I I I : I I I I I I I I I I : I I I I I I I I I I I I : I M ! I I I ! I I i I i I I I I 
ml28-l MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAOTHTGWANTVEPLTGIT 
10 20 30 40 50 60 

70 60 90 100 110 120 

gl28-l.pep ERVGRIWGWSHLNSVVDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

I IIMIIIIIIII ll|:|||||t||| flMIIIIMII tlilitiliiiiiiiiiini 
ml28-l ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 140 150 160 170 160 

gl28-l .pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
M I I I I II I I : I I I I I I I t I I I I I I I |: ( I I | | | | | | | | | | | | | | | | | | | | | | | || | | | | 
ml28-l TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 

1^0 200 210 220 230 240 

gl28-l.pep FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYAGNRELREQIYRAYV 
N I I I I I I I I I I M M I I I M I I I I : I I I I I f I I I I I | | | | | | | | | | | | ( | | | | | | | | | 
IU128-1 FDDAAPLAGJ.PEDALAMFAAAAQSESKTGYKIGLQIPHYLAVIQYADNRELREQIYRAYV 
190 200 210 220 230 240 

250 260 270 280 290 300 

gl28-l.pep TRASELSNDGKFDNTAN I DRTLENALKTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
IHHM:MIIIIIIIIIIII I 1 1 r 1 | F I 1 | r t I I I I I I f I J | i | f | i } ! [ | t I 1 I J I 
ir:128-l TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

310 320 330 340 350 360 

gl28-l.pep ARRAKPYAEKDLAEVKAFAREHLGLADPQPWDLSYAGEPCLREAKYAFSETSVKKYFPVGK 
I I I H Ml I I I I I I I I I I I I I CM) I f I I I r f 1 r J | | | | I | I ! r I I t I J I I f I I I r t 
ml28-l ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

370 380 390 400 410 420 

gl28-l . pep VLAGLFAQIKKLYGIGFAEKTVPVWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWM 

N I I M I I I I I : i I I I H I I i I I I t I I I I I I I : I I I j II ! I I I I I I I i I j I | ! | 

ml28-l VLNGLFAQIKKLYGIGtTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

430 440 450 460 470 480 

gl28-l.pep NDYKGRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

Mllllill:IMMIMIilltm :!llllllll| I I I I I I I I I I I I I ! I I I 

ml 2 8- 1 NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFRETGHGLHHLLTQVD 

430 440 450 460 470 480 

490 

gl 2 8- 1 . pep ELGVSGINGVK 
I I I I I I I I I I: 

ml28-l eix;vsgingvewdavelpsqfmenfvweynvij^ 

490 500 510 520 530 540 

The following DNA sequence was identified in meningitidis <SEQ ID 3095>: 

a!28-l.seq 

1 atgactgaca acgcactgct ccatttgggc gaagaacccc gttttgatca 
51 aatcaaaacc gaagacatca aacccgccct gcaaaccgcc attgccgaag 
101 cgcgcgaaca aatcgccgcc atcaaagccc aaacgcacac cggctgggca 
151 aacactgtcg aacccctgac cggcatcacc gaacgcgtcg gcaggatttg 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

2 51 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
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501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

731 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

831 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTT3AG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA AT7TCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 3096; ORF 128-1 .a>: 

al28-l .pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAA* 



ml28-l/al28-l ORFs 128-1 and 128-l.a showed a 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

al28-l . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

MMilMIIIIMIMlilllllllllHIIIIIHIMIIMillllillll 

ml28-l MT DNALLHLGEE PRFDQI KTE DIKPALQTAI AEAREQI AAI KAQTHTGWANTVE PLTGTT 

10 20 30 40 50 60 

70 80 90 100 110 120 

al28-l.pep ERVGRIWGWSHLNS^/TDTPELRAAYNELMPEITVFFTEIGCDIELYNRFKTIKNSPEFD 

ItllCIIMliniillllMiMIIIM IIMIIIHII 
ml28-l ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 



130 140 150 160 170 180 



WO 99/57280 



102 



PCT/US99/09346 



al28-"_ .pep TLSIlAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSUNVLDATDAFGiy 
HI i I N I i t I I I I It I t I I I I | I | M I I I I I I I I I I I | M | | | ! | | 1 | | | M I I I H I 
ml28-l TLSPAQKTBCLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 

190 200 210 220 230 240 

al28-l.pep FDDAAPLAGI PEDALAMFAAAAQSEGKTGYKIGLQI PHYLAVIQYADNRKLREQI YRAYV 
' H I I I I I I I I I I | I | | i | | | | | | | : | | ! | | | | | | | | | | | | | | | | ( , | | : | , j | | | ,, | j 
ml 2 8-1 FDD AAPLAG I PE DALAM FAAAAQ SE SKTGYKIGLQIPHY LAV I Q Y ADNRE LRE Q I YRAYV 

190 200 210 220 230 240 

250 260 270 280 290 300 

a!28-l . pep TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
I I i I I I I t t I I I I I I II I I I I I I I I I I I I | I I | I | | | | | I | | M | | | |j | | | | | | || | , 
mI28-l TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

310 320 330 340 350 360 

al28-l . pep ARRAKPYAEKDLAEVKAFARESLGLADLOPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
I I I I J I I I I I I 1 I I I I I I t I I I I ^ I f I I I I I 1 I I 1 I : 1 f i I I I j I I I 1 I | ! f | 1 | | | | | | 
ml 2 8 - 1 ARRAKP YAEKDLAEVKAFARES LNLADLQPWDLGYASEKLRE AKYAFS ETEVKKY FP VG K 

310 320 330 340 350 360 

370 380 390 400 410 420 

al28- 1 . pep VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
iMIMIMIMIlllltlllllllMIMIIIIII 1 i I 1 1 I I f I I I 1 ! | | | | 1 | | | | | 
ml28-l VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 



al28-l.pep 
ral28-l 

al28-l .pep 
m!28-l 



430 440 450 460 470 430 

NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
I H M M I Hi I M I I i I I M I i I I : I I I I I : I f I | I I M II I I I I I I It [ I ! I I I I I I 
NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

490 500 510 520 530 540 

ELGVSGINGVEWDAVELPSQHWENFVWEYNVLAQMSAHEETGVPLPKELFDI^LAAKNFQ 
I I M I I I I I I I I I I I I I I | I | | | | || | | | | | M | | | | | | | | | | | | | | | | | | | | | | | | | j , 

ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 



550 560 570 580 590 600 

al28-l . pep RGMFLVRQME FALFDMMI YSEDDEGRLKNWQQVLDS VRKEVAWRPPEYNR FAN S FGH I F 
M I I I E I I ! I I I I I I I ! I I I I I I I | I | | | | t I I I I I I I r r I 1 | r : I | f | J | [ I ||||i| 
ml28-l RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 

550 560 570 580 590 600 

610 620 630 640 650 660 

al28-l.pep AGG YS AG Y Y S YAW AEV L S AD AYAA FEE S DD V AAT GKR FWQE I LAVGG S R S AAE S FKA FRG 

> 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 a 

ml28-l AGG YS AG Y Y S Y AW AEVL S ADA YAA FEE S D D VAAT GKR FW QE I LAVGG S R S AAE S FKA FRG 

610 620 630 640 650 660 



aI28-l .pep 
m!28-l 



670 679 
RE P S I DALLRH SG FDNAAX 
I I I I I I I I I I II I I I I I : 
RE P S I DALLRH SG FDN A VX 

670 



206 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3097>: 

m206.seq 
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1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGC AG CGGCA 

251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGsAAA ATCCCCGAcA GCCGCyTCAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAAC CGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3098; ORF 206>: 

m206 -pep. . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTI KTEKIiS TPFYAKNYLG AHTFFTE* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3099>: 

g206.seq 

1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggastca tcggcacgcc 

201 ctacaaatgg ggcggcagca gcaccgcaac cggcttcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

301 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

3 51 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 
401 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 

4 51 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 
501 ctaccttgga gcgcatacgt tttttacaga atga 

This corresponds to the amino acid sequence <SEQ ID 3100; ORF 206.ng>: 

g206.pep 

1 MFSPDKTLFL CLGALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIGRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 

101 ARDMAAASRK I PDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 



ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from K gonorrhoeae: 

m206/g206 

10 20 30 40 50 60 

MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMljHS 

M llllllllhMIIIIIIIIIMIIIIMIIIIIIIMIIIIII I { r 1 1 1 1 1 1 1 1 1 

MFSPDKTLFLCLGALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIGRTQGSQELMLHS 
10 20 30 40 50 60 

70 80 90 100 110 120 

LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 

IIMI!IIIIIIIIIIIIIIIIMI|:||||||I!IIIIIIIIIIIIIIIMMI Mil 

LGLIGTPYKWGGSSTATGFDCSGMIQLVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 
70 80 90 100 110 120 

130 i40 150 160 170 

LVFFNTGGAHRYSH\'GLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
: I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
IVFFNTGGAHRYSH\'GLYIGNGEFIHAPGSGKTIKTEKLSTPFYAKNYLGAHTFFTE 



m206 .pep 
g206 

m206 .pep 
g206 

m206 .pep 
g206 
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130 140 150 160 170 

The following partial DNA sequence was identified in JV. meningitidis <SEQ ID 3101>: 

a206. seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TC AAAACCGA . AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3102; ORF 206.a>: 

a206.pep 

1 MFPPDK TLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK I PDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 ORFs 206 and 206.a showed a 99.4% identity in 1 77 aa overlap 

10 20 30 40 50. 60 

m206 . pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRTSHIDRTQGSQELMLHS 
MilMI III III IMIIIIMIIIMIIIIII IIIIMI IIIMIIMI KM Ml III 

a206 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 
10 20 30 40 50 60 

70 80 90 100 110 120 

ra206.pep LGLI GT P YKWGG S ST ATG F DC SGM I QFV YKN ALN VKLPRTARDMAAAS RKI P DS RXKAG D 

MIMMIlitll I I I I I M I I I I I | M | | | | f | | | | | | | | | || | | | | | J mi 

a206 LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKI PDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

ra206.pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

1 1 1 1 1 1 1 1 n i ii n i ii 1 1 1 1 1 1 (i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 ii i in 1 1 1 1 n 1 1 1 1 1 

a206 LVFFNTGGAHRYSHVGLY IGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

130 140 150 160 170 



287 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3103>: 

m287.seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAGC 

51 CTGCGGGGGC GGCGGTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AGAGGCAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 

201 AGGCAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC CGAATCACAC 

351 CCCGGATCCG AATATGCTTG CCGGAAATAT GGAAAATCAA GCAACGGATG 

401 CCGGGGAATC GTCTCAGCCG GCAAACCAAC CGGATATGGC AAATGCGGCG 

4 51 GACGGAATGC AGGGGGACGA TCCGTCGGCA GGCGGGCAAA ATGCCGGCAA 

501 TACGGCTGCC CAAGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 

551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGOGAA TGGCGGTAGC 
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601 AATTTTGGAA GGGTTGATTT GGCTAATGGC GTTTTGATTG ACGGGCCGTC 

651 GCAAAATATA ACGTTGACCC ACTGTAAAGG CGATTCTTGT AGTGGCAATA 

701 ATTTCTTGGA TGAAGAAGTA CAGCTAAAAT CAGAATTTGA AAAATTAAGT 

751 GATGCAGACA AAATAAGTAA TTACAAGAAA GATGGGAAGA ATGATAAATT 

801 TGTCGGTTTG G7TGCCGATA GTGTGCAGAT GAAGGGAATC AATCAATATA 

851 TTATCTTTTA TAAACCTAAA CCCACTTCAT TTGCGCGATT TAGGCGTTCT 

901 GCACGGTCGA GGCGGTCGCT TCCGGCCGAG ATGCCGCTGA TTCCCGTCAA 

951 TCAGGCGGAT ACGCTGATTG TCGATGGGGA AGCGGTCAGC CTGACGGGGC 

1001 ATTCCGGCAA TATCTTCGCG CCCGAAGGGA ATTACCGGTA TCTGACTTAC 

1051 GGGGCGGAAA AATTGCCCGG CGGATCGTAT GCCCTTCGTG TTCAAGGCGA 

1101 ACCGGCAAAA GGCGAAATGC TTGCGGGCGC GGCCGTGTAC AACGGCGAAG 

1151 TACTGCATTT CCATACGGAA AACGGCCGTC CGTACCCGAC CAGGGGCAGG 

1201 TTTGCCGCAA AAGTCGATTT CGGCAGCAAA TCTGTGGACG GCAT7ATCGA 

1251 CAGCGGCGAT GATTTGCATA TGGGTACGCA AAAATTCAAA GCCGCCATCG 

1301 ATGGAAACGG CTTTAAGGGG ACTTGGACGG AAAATGGCAG CGGGGATGTT 

1351 TCCGGAAAGT TTTACGGCCC GGCCGGCGAG GAAGTGGCGG GAAAATACAG 

1401 CTATCGCCCG ACAGATGCGG AAAAGGGCGG ATTCGGCGTG TTTGCCGGCA 

14 51 AAAAAGAGCA GGATTGA 

This corresponds to the amino acid sequence <SEQ ID 3104; ORF 287>: 

m287.pep 

1 MFKRSVIAMA CI FALSA CGG GGGGSPDVKS ADTLSKPAAP WSEKETEAX 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

351 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 

401 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV 

4 51 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3105>: 

g287.seq 

1 atgtttaaac gcagtgtgat tgcaatggct tgtatttttc ccctttcagc 

51 ctgtgggggc ggcggtggcg gatcgcccga tgtcaagtcg gcggacacgc 

101 cgtcaaaacc ggccgccccc gttgttgctg aaaatgccgg ggaaggggtg 

151 ctgccgaaag aaaagaaaga tgaggaggca gcgggcggtg cgccgcaagc 

201 cgatacgcag gacgcaaccg ccggagaagg cagccaagat atggcggcag 

251 tttcggcaga aaatacaggc aatggcggtg cggcaacaac ggacaacccc 

301 aaaaatgaag acgcgggggc gcaaaatgat atgccgcaaa atgccgccga 

351 atccgcaaat caaacaggga acaaccaacc cgccggttct tcagattccg 

4 01 cccccgcgtc aaaccctgcc cctgcgaatg gcggtagcga ttttggaagg 

4 51 acgaacgtgg gcaattctgt tgtgattgac ggaccg-cgc aaaatataac 

501 gttgacccac tgtaaaggcg attcttgtaa tggtga.aat ttattggatg 

551 aagaagcacc gtcaaaatca gaatttgaaa aattaagtga tgaagaaaaa 

601 attaagcgat ataaaaaaga cgagcaacgg gagaat~ttg tcggtttggt 

651 tgctgacagg gtaaaaaagg atggaactaa caaata-atc atcttctata 

7 01 cggacaaacc acctactcgt tctgcacggt cgaggaggtc gcttccggcc 

751 gagattccgc tgattcccgt caatcaggcc gatacgctga ttgtggatgg 

801 ggaagcggtc agcctgacgg ggcattccgg caatatcttc gcgcccgaag 

851 ggaattaccg gtatctgact cacggggcgg aaaaattgcc cggcggatcg 

901 tatgccctcc gtgtgcaagg cgaaccggca aaaggcgaaa tgcttgttgg 

S51 cacggccgtg tacaacggcg aagtgctgca tttccatatg gaaaacggcc 

1001 gtccgtaccc gtccggaggc aggtttgccg caaaagtcga tttcggcagc 

1051 aaatctgtgg acggcattat cgacagcggc gatgatttgc atatgggtac 

1101 gcaaaaattc aaagccgcca tcgatggaaa cggctttaag gggacttgga 

1151 cggaaaatgg cggcggggat gtttccggaa ggttttacgg cccggccggc 

1201 gaggaagtgg cgggaaaata cagctatcgc ccgacagatg ctgaaaaggg 

1251 cggattcggc gtgtttgccg gcaaaaaaga tcgggattga 

This corresponds to the amino acid sequence <SEQ ID 3106; ORF 287.ng>: 
g287.pep 

1 MFKRSVIAMA CIFPLSA CGG GGGGSPDVKS ADTPSKPAAP VVAENAGEGV 
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51 LPKEKKDEEA AGGAPQADTQ DATAGEGSQD MAAVSAENTG NGGAATTDNP 

101 KNEDAGAQND MPQNAAESAN QTGNNQPAGS SDSAPASNPA PANGGSDFGR 

151 TNVGNSWID GPSQNITLTH CKGDSCNGDN LLDEEAPSKS EFEKLSDEEK 

201 IKRYKKDEQR ENFVGLVADR VKKDGTNKYI IFYTDKPPTR SARSRRSLPA 

251 EIPLIPVNQA DTLIVDGEAV SLTGHSGNIF APEGNYRYLT YGAEKLPGGS 

301 YALRVQGEPA KGEMLVGTAV YNGEVLHFHM ENGRPYPSGG RFAAKVDFGS 

351 KSVDGIIDSG DDLHMGTQKF KAAIDGNGFK GTWTENGGGD VSGRFYGPAG 

4 01 EEVAGKYSYR PTDAEKGGFG VFAGKKDRD* 



n287/g287 ORFs 287 and 287.ng showed a 70.1% identity in 499 aa overlap 



10 20 30 40 49 

m287 . pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSE KETEA 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I | I | | | : | | : || 

g287 MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPWAENAGEGVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

m287 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 

Nil :l I :::lllltllll i I I I I I I I : I : I I I I I I I [HIM Mil 
g287 AGGAPQADTQD — ATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAA — 

70 80 90 100 110 

110 120 130 140 150 160 169 

ra2 87 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 

g287 



170 180 130 200 210 220 229 

m287 . pep AQG ANQAGNNQAAG S S D P I PAS N PA PAN G G S N FGR V DLANG V L I DG PS QN I T LTHC KG D S 

= =111 = 1111 I M I I M 1 1 I I II M I I : I I I :::: I : | : | | | | H | | 1 1 | | | | | | | 
g287 -ESANQTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDS 
120 130 140 150 160 170 

230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

MINIM MM I ! I I : : : I I I I I I | | : I MMMI 
g287 CNGDNLLDEE1APSKSE FEKLSDEEKIKRYKKDEQRENFVGLVADR VKKDGTNKYI I FYTD 

180 190 200 210 220 230 

290 300 310 320 330 340 349 

m287.pep KPTSFARFRRSARSRRSLPAEMPLI PVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 

II : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

g287 KPPT RSARSRRSLPAEIPLI PVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 

240 250 260 270 280 290 

350 360 370 380 390 400 409 

m287 . pep YGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFI1TENGRPYPTRGRFAAKVDFGS 

I II I I I I M I II M II II II : I: II I II II I I II M I I II I : I I 1 I I I II I I I 

g287 YGAEKLPGGSYALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGS 
300 310 320 330 340 350 

410 420 430 440 450 460 469 

m287.pep KSVDGIIDSGDDLHNGTQKFKAAIDGNG FKGTWTENGSGDVSGKFYGPAGEEVAGKYSYR 

I IN I I II II I II I I I I I I I I I I I I I I I I I I | | | II Ml | | N: I I I I I I I I I I I II I I I 
g287 KSVDGriDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYR 
360 370 380 390 400 410 



m287 .pep 



470 480 489 

PTDAEKGGFGVFAGKKEQDX 
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g287 



I I I I I I i I I I I I I I I I :: I I 
PTDAEKGGFGVFAGXKDRDX 
420 430 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3107>: 



a287. seq 




1 


ATGTTTAAAC 


51 


CTGTGGGGGC 


101 


TGTCAAAACC 


151 


CTGCCGAAAG 


201 


CGATACGCAG 


251 


TTTCGGCAGA 


301 


GAAAATAAAG 


351 


TACAGATAGT 


401 


GAGATATGGG 


<51 


AACCAACCGG 


501 


GTCGGCAGGG 


551 


CTGAAAACAA 


601 


CCTAACGCCA 


651 


TGG CATCAAG 


701 


AAGACAAAGT 


751 


TCAGAATTTG 


801 


AGACGAGCAA 


651 


AGAATGGAAC 


901 


TCTTCATCTG 


S51 


GGCCGAGATG 


1G01 


ATGGGGAAGC 


1C51 


GAAGGGAATT 


1101 


ATCGTATGCC 


1151 


CGGGCACGGC 


1201 


GGCCGTCCGT 


1251 


CAGCAAATCT 


1301 


GTACGCAAAA 


1351 


TGGACGGAAA 


1401 


CGGCGAAGAA 


1451 


AGGGCGGATT 



' TGCAATGGCT 
; GATCGCCCGA 
' GTTGTTACTG 
i TGAGGAGGCG 
i CCGGAAAAGG 
: AATGGCGGTG 
: GCAAAATGAT 
i ATCACACCCC 
> .CCGGATGCCG 
i TGCGGCGGAC 
I GCAATACGGC 
: GGCTCTCAAA 
; CAGCGATTTT 
; GTTCGGAAAA 
. GATTTCTTAG 
1 TGATGAAGAA 
TTGTCGGTTT 
GTCATCATTT 
GCGTTCTGCA 
CCGTCAATCA 
ACGGGGCATT 
GACTTACGGG 
AAGGCGAACC 
GGCGAAGTGC 
AGGCAGGTTT 
TTATCGACAG 
GTTATCGATG 
GGATGTTTCC 
AATACAGCTA 
GCCGGCAAAA 



TGTATTGTTG 
TGTTAAGTCG 
AAGATGTCGG 
GTGAGTGGTG 
CGGTCAAGAT 
CGGCAACAAC 
ATGCCGCAAA 
TGCACCGAAT 
GGGAATCGGC 
GGAATGCAGG 
AGATCAAGCT 
ATCCTGCCTC 
GGAAGGATAA 
TGTAACGTTG 
ATGAAGAAGC 
AAAATTAATA 
GGTTGCTGAC 
ATAAAGACAA 
CGGTCGAGGC 
GGCGGATACG 
CCGGCAATAT 
GCGGAAAAAT 
GGCAAAAGGC 
TGCATTTCCA 
GCCGCAAAAG 
CGGCGATGAT 
GAAACGGCTT 
GGAAGGTTTT 
TCGCCCGACA 
AAGAGCAGGA 



CCCTTTCAGC 
GCGGACACGC 
GGAAGAGGTG 
CGCCGCAAGC 
ATGGCGGCAG 
GGATAATCCC 
ATGCCGCCGA 
ATGCCAACCA 
ACAACCGGCA 
GGGACGATCC 
GCAAATCAAG 
TTCAACCAAT 
ATGTAGCTAA 
ACACATTGTA 
ACCACCAAAA 
AATATAAAAA 
AGGGTAGAAA 
GTCCGCTTCA 
GGTCGCTTCC 
CTGATTGTCG 
CTTCGCGCCG 
TGTCCGGCGG 
GAAATGCTTG 
TATGGAAAAC 
TCGATTTCGG 
TTGCATATGG 
TAAGGGGACT 
ACGGCCCGGC 
GATGCGGAAA 
TTGA 



This corresponds to the amino acid sequence <SEQ ED 3108; ORF 287.a>: 

a287 .pep 

1 MFKRSVIAMA CIVALSA CGG GGGGSPDVKS ADTLSKPAAP WTEDVGEEV 

51 LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 

101 ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 

151 NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQAENNQVG GSQNPASSTN 

201 PNATNGGSDF GRINVAKGIK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 

251 SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKUKSAS 

301 SS3ARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 

351 EGNYRYLTYG AEKLSGGSYA LSVQGEPAKG EMLAGTAVYN GEVLHFHMEN 

401 GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VTDGNGFKGT 

451 WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 

m287/a287 ORFs 287 and 287. a showed a 77.2% identity in 501 aa overlap 



m287 .pep 
a287 

m287.pep 
a287 



10 20 30 40 49 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

IIMIUMHI I I f 1 I t I I M I M I J t I E I I M I I M I I 1 r | |- || 

MFKRS VI AMAC I VALSACGGGGGGSPDVKS ADTLSKPAAP WTEDVGEEVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 

11,1 : 1 I = : : I : I 1 M I I I I I : I : I It : I : I I I I I I I I i i I | 

VSGAPQADTQ— DATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADT 
70 80 90 100 110 
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HO 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSOPANQPDMANAADGMQGDDPSAGGQNAGNTA 
MINIMI III : : I III N I I I : I It I II I I ! II I N I I i I I 1 I I I : II I I I I 
a287 DSSTPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAG-ENAGNTA 
120 130 140 150 160 170 

170 180 190 200 210 220 229 

IT287 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 

NINI M I = : I I : : i ::|| :| I II : I || : : : | | | : :|: |:|:||}||| 
a287 DQAANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKV 
180 190 200 210 220 230 

230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

N MUM: II I I f I I i f MMMNI : ::||||||l I: M I : I : I : i I 
a287 CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKD 
240 250 260 270 280 290 

290 300 310 320 330 340 

m287 . pep KP--TSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 
i M M I I I I I I II I I I I II I I I I I M I I I I II I I I M I I It I I I I I I I I | | | | | | | 
a287 KSASSSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 
300 310 320 330 340 350 

350 360 370 380 390 400 

m2 8 7 . pep LTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDF 

MINIM MINI M I I I M I I J I M r I I | I I | j | | | | ||||| 1: MINIMI 
a287 LTYGAEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDF 
360 370 380 390 400 410 



410 420 430 440 450 460 

m287.pep GSKSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYS 

MM I I Milt! II I 111111111:11 M II III I I! ||:|| !ll:| I II I I II I li II I 
a287 GSKSVDGIIDSG D DLHMGT QK FKAV I DGNG FKGTW T ENG G GDV SGRFYGPAGEE V AG KY S 

420 430 440 450 460 470 

470 480 489 

m287 .pep YRPTDAEKGGFGVFAGKKEQDX 

Ml MM I I III II III I II II 
a287 YRPTDAEKGGFGVFAGKKEQDX 
480 490 



406 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3109>: 

m4 06.seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TTGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 CACTATGGCC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

301 GATTACACCT ATCCAOGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CTCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 

601 ATCGACGTAT TCG GAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 
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701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACGGAAG GATTAATGGT CGATTTCTCC GATATCCGAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CATC CGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GTAGTGCGAC AACATAGACA 

951 AGGACAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 31 10; ORF 406>: 

m406 .pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TBMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIRPYGNHTG MSAPSVEADN 

301 SHEGYGYSDE WRQHRQGOP * 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3 1 1 1>: 

g406.seq 

1 ATG CGGGCAC GG CTG CTGAT ACCTATTCTT TTTT CAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGCAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGG CATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGCGGC GAATACATAA ACAGCCCTGC CGTCCGCACC 

301 GATTACACCT ATCCGCGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACGGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CGCGCACCCA ATCAGACGGT AGCGGAAGTA GGAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CCAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTGCA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

701 GAACCAATAA AAAATTGCTC ATCAAACCCA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGGAATCAAA CCGACGGAAG GATTGATGGT CGATTTCTCC GATATCCAAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC AACATAGACA 

951 AGGGCAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 31 12; ORF 406>: 

g406.pep 

1 MRARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSRSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHTG NSAPSVEADN 

301 SHEGYGYSDE AVRQHRQGQP * 



ORF 406 shows 98.8% identity over a 320 aa overlap with a predicted ORF (ORF406.a) from 
N. gonorrhoeae: 

g406/n406 

10 20 30 40 50 60 

9406 .pep MRARLLI PI LFSVFILSACGTLTGI PSHGGGKRFAVEQELVAAS ARAAVKDMDLQALHGR 

I^MIIIIIIIIIIIIIIIIIIIIIIIIIlllllllliliiiiiiiiiiiiiiiiiiiH 

^4 0 6 MQARLLI P I LFSVFI LS ACGTLTG I PSHGGGKRFAVEQELVAAS ARAAVKDMDLQALHGR 
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10 20 30 40 50 60 

70 80 90 100 110 120 

g406 .pep KVALYIATMGDQGSGSLTGGRYSrDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

I MM IIMM III II IIIMIM IN | MM! MM MIMII! IIMMIIIIMMI 

m4 06 KVALYI ATMGDQGSGS LTGGR YS I DAL I RGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 



130 140 150 160 170 180 

g406 . pep LTTSLSTLNAPALSRTQSDGSGSRSSLGLNIGGMGDYl^ETLTTNPROTAFLSHLVQTVF 
I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I II | | | | | | | | | | 
m406 LTTSLSTLNAPALSRTQSIX^GSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 



190 200 210 220 230 240 

g406 . pep FLRGI DWSPANADTDVFINIDVFGTI RNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 I M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

m406 FLRGI DWS PANADTDVF INI DVFGT I RNRTEMHLYNAETLKAQTKLE YFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

g406 .pep I KPKTNAFEAAYKENYALWMG P YKVS KG I KPTEGLMVDFS D I QP YGNHTGNSAPS VEADN 

Ml Mil Ml IMIM IIMMIIM II Mill IIIIIIMMI II IMIIIIIIIMII 

m406 I KPKTNAFEAAYKENYALWMG P YKVS KG I KPTEGLMVD FSD IRP YGNHTGNSAPS VEADN 

250 260 270 280 290 300 



310 320 
g4 0 6 . pep SHEGYGYSDEAVRQHRQGQPX 
III I IIIIIMII III Hill 
m4 0 6 SHEGYGYSDE WRQHRQGQPX 

310 320 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 13>: 

a406.seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT 7TATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

301 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CGCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACGGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

7 51 GCCTATAAAG AAAATTACGC ATTGTGGATG GGACCG1ATA AAGTAAGCAA 

801 AGGAATTAAA CCGACAGAAG GATTAATGGT CGATTTCTCC GATATCCAAC 

851 CATACGGCAA TCATATGGGT AACTCTGCCC CATCCGTAGA GGCTGATAAC 

931 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC GACATAGACA 

951 AGGGCAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 31 14; ORF 406.a>: 

a406.pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGI DWS P ANADTDVFIN 

201 IDVPGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 
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251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHMG NSAPSVEACN 
301 SHEGYGYSDE AVRRHRQGQP * 

m406/a406 ORFs 406 and 406. a showed a 98.8% identity in 320 aa overlap 

10 20 30 40 50 60 

ra4 06 . pep MQARLLI PILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAAS71RAAVKDMDLOALHGR 

I I I I > I I I I t t I I 1 1 Tl I 1 1 I I! I I I Ml I 1 1 I I I I I M I 1 1 I 1 1 1 I I M I i 1 1 I I I I I I 
a 4 0 6 MQARLLI PI LFS VFI LSACGTLTGI PSHGGGKRFAVEQELVAAS ARAAVKDMDLQALHGR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m4 0 6 . pep KVALY IATMGDQGSGSLTGGRYS I DALI RGE Y IN S PAVRTDYT Y PRYETTAETTSGGLTG 

lllillilMMMIt MIIIMIIIIII Itlilllllll lit MMIi II I Mill III 
a 4 0 6 KVALY IATMGDQGSGSLTGGRYS I DALIRGE Y IN S PAVRTDYT Y PRYETTAETT SGGLTG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m406.pep LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

Hltlllilllllllllll I II I I I I I I 1 I I I I I I I I | | | | t f 1 I I 1 

a406 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 
130 140 150 160 170 180 

190 200 210 220 230 240 

m4 0 6 . pep FLR G I D V V S P AN ADT D V F IN I DV FGT I RNRT EMHL YN AET LKAQTKLE Y FAV DRTN KKLL 

I t I i 1 I i I 1 1 1 I I 1 1 I 1 1 I I f I 1 I 1 1 f | | 1 1 | | | | ] | 1 1 t i I I I ! I 1 1 I I I ] t f | 1 1 | | ! 
a406 FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m4 0 6 . pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
MIIIIIIIIIIIMillllllllMIIIMIIIIIIIIIMrlNII I I I I t I I I I I I 
a4 0 6 I KP KTN A FE AAYTCEN YALWMG P YK V S KG I K PTEG LM V D FS D I Q P YGNHMGN SAP S VE ADN 

250 260 270 280 290 300 

310 320 
m4 0 6 . pep S HEG YG Y S DE WRQHRQGQPX 

II II I I 11 I I: I I: I I I I I I I 
a4 0 6 SHEGYG YSDEAVRRHRQGQPX 

310 320 



EXAMPLE 2 
Expression of ORF 919 
The primer described in Table 1 for ORF 919 was used to locate and clone ORF 919. 
The predicted gene 919 was cloned in pET vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 919-His fusion protein purification. Mice were immunized with the purified 919- 
His and sera were used for Western blot (panel B), FACS analysis (panel C) 5 bactericidal 
assay (panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker, PP, 
purified protein, TP, K meningitidis total protein extract; OMV, N. meningitidis outer 
membrane vesicle preparation. Arrows indicate the position of the main recombinant protein 
product (A) and the AT. meningitidis immunoreactive band (B). These experiments confirm 
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that 919 is a surface-exposed protein and that it is a useful immunogen. The hydrophilicity 
plots, antigenic index, and amphipatic regions of ORF 919 are provided in Figure 10. The 
AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, 1 Immunol 
143:3007; Roberts etal. 1996, AIDS Res Human Retroviruses 12:593; Quakyi etal. 1992, 
Scand J Immunol Suppl 1 1:9). The nucleic acid sequence of ORF 919 and the amino acid 
sequence encoded thereby is provided in Example 1 . 

EXAMPLE 3 
Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
The predicted gene 279 was cloned in pGex vector and expressed in E. colL The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 279-GST purification. Mice were immunized with the purified 279-GST and sera 
were used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay 
(panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. 
meningitidis total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. 
Arrows indicate the position of the main recombinant protein product (A) and the K 
meningitidis immunoreactive band (B). These experiments confirm that 279 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 279 are provided in Figure 1 1 . The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1 989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi etal. 1992, Scand J Immunol Suppl 1 1 :9). 
The nucleic acid sequence of ORF 279 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 4 
Expression of ORF 576 and 576-1 
The primer described in Table 1 for ORF 576 was used to locate and clone ORF 576. 
The predicted gene 576 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 576- 
GST fusion protein purification. Mice were immunized with the purified 576-GST and sera 
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were used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), 
and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis 
total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. Arrows 
indicate the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B).. These experiments confirm that ORF 576 is a surface-exposed 
protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and 
amphipatic regions of ORF 576 are provided in Figure 12. The AMPHI program is used to 
predict putative T-ccll epitopes (Gao et al 1989, 7. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 576 and the amino acid sequence encoded thereby is 
provided in Example 1. 

EXAMPLE 5 
Expression of ORF 519 and 519-1 
The primer described in Table 1 for ORF 519 was used to locate and clone ORF 519. 
The predicted gene 519 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 519- 
His fusion protein purification. Mice were immunized with the purified 519-His and sera were 
used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), and 
ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis total 
protein extract; OMV, K meningitidis outer membrane vesicle preparation. Arrows indicate 
the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B). These experiments confirm that 5 19 is a surface-exposed protein 
and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and amphipatic 
regions of ORF 519 are provided in Figure 13. The AMPHI program is used to predict 
putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, AIDS Res 
Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 11:9). The nucleic 
acid sequence of ORF 5 19 and the amino acid sequence encoded thereby is provided in 
Example 1. 
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EXAMPLE 6 
Expression of ORF 121 and 121-1 
The primer described in Table 1 for ORF 121 was used to locate and clone ORF 121 . 
The predicted gene 121 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 121- 
His fusion protein purification. Mice were immunized with the purified 121-His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 121 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 121 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 121 are provided in 
Figure 14. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1 992, Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 1 2 1 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 7 
Expression of ORF 128 and 128-1 
The primer described in Table 1 for ORF 1 28 was used to locate and clone ORF 1 28. 
The predicted gene 128 was cloned in pET vector and expressed in E. colL The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 128- 
His purification. Mice were immunized with the purified 128-His and sera were used for 
Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel D) and 
ELISA assay (panel E). Results show that 128 is a surface-exposed protein. Symbols: Ml, 
molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. meningitidis 
outer membrane vesicle preparation. Arrows indicate the position of the main recombinant 
protein product (A) and the N. meningitidis immunoreactive band (B). These experiments 
confirm that 128 is a surface-exposed protein and that it is a useful immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 128 are provided in 
Figure 15. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
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Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 128 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 8 
Expression of ORF 206 
The primer described in Table 1 for ORF 206 was used to locate and clone ORF 206. 
The predicted gene 206 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 206- 
His purification. Mice were immunized with the purified 206-His and sera were used for 
Western blot analysis (panel B). It is worthnoting that the immunoreactive band in protein 
extracts from meningococcus is 38 kDa instead of 17 kDa (panel A). To gain information on 
the nature of this antibody staining we expressed ORF 206 in E. coli without the His-tag and 
including the predicted leader peptide. Western blot analysis on total protein extracts from E. 
coli expressing this native form of the 206 protein showed a recative band at a position of 38 
kDa, as observed in meningococcus. We conclude that the 38 kDa band in panel B) is specific 
and that anti-206 antibodies, likely recognize a multimeric protein complex. In panel C is 
shown the FACS analysis, in panel D the bactericidal assay, and in panel E) the ELISA assay. 
Results show that 206 is a surface-exposed protein. Symbols: Ml, molecular weight marker; 
TP, N. meningitidis total protein extract; OMV, N. meningitidis outer membrane vesicle 
preparation. Arrows indicate the position of the main recombinant protein product (A) and the 
N. meningitidis immunoreactive band (B). These experiments confirm that 206 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 519 are provided in Figure 16. The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1 :9). 
The nucleic acid sequence of ORF 206 and the amino acid sequence encoded thereby is 
provided in Example 1. 
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EXAMPLE 9 
Expression ofORF287 
The primer described in Table 1 for ORF 287 was used to locate and clone ORF 287. 
The predicted gene 287 was cloned in pGex vector and expressed in E. coll The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 287- 
GST fusion protein purification. Mice were immunized wilh the purified 287-GST and sera 
were used for FACS analysis (panel B), bactericidal assay (panel C), and ELISA assay (panel 
D). Results show that 287 is a surface-exposed protein. Symbols: Ml, molecular weight 
marker. Arrow indicates the position of the main recombinant protein product (A). These 
experiments confirm that 287 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 287 are provided in 
Figure 17. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1 992, Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 287 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 30 
Expression of ORF 406 
The primer described in Table 1 for ORF 406 was used to locate and clone ORF 406. 
The predicted gene 406 was cloned in pET vector and expressed in E. colL The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 406- 
His fusion protein purification. Mice were immunized with the purified 406-His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 406 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 406 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 406 are provided in 
Figure 18. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, 7. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
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1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 406 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 1 1 



Table 2 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 225 among different strains. 
Table 2 



225 gene variability: List of used Neisseria strains 


Identification Strains 


Source / reference 


number 




Group B 




zoOl 225 NG6/88 


in.. ivjajalui / OvllCI ol (XI, y 1 yyo 


zo02_225 BZ198 


R. Moxon / Seiler et al, 1 996 


zo03 225 NG3/88 


R. Moxon / Seiler et al, 1996 


zo04 225 297-0 


R. Moxon / Seiler et al, 1996 


zo05_225 1000 


R. Moxon / Seiler et al, 1996 


zo06 225 BZ147 


1? \zfnvnn / vVili*r &t nl 1 QQ£ 

IV. 1V1\JAU1i / OvPiiCI t>l Lit.) 1 ZrZrlj 


zo07~225 BZ169 


R. Moxon / Seiler et al, 1 996 


zo08 225 528 


R. Moxon / Seiler et al, 1996 


zo09_225 NGP165 


R. Moxon / Seiler et al, 1 996 


70IO 225 R7m 


T? Mnvnn / Q jail at* nt nl 1 OO^ 

iv. lviuAun / ociier ei a/., lyyo 


zoll 225 NGE31 


R. Moxon / Seiler et al, 1996 


zol2 225 NGF26 


R. Moxon / Seiler et al, 1996 


zol3 225 NGE28 


R. Moxon / Seiler et al, 1 996 


zol4 225 NGH38 


R. Moxon / Seiler et al, 1996 


zol5 225 SWZ107 


R. Moxon / Seiler et al, 1996 


zol6 225 NGH15 


R. Moxon / Seiler et al, 1 996 


zol7 225 NGH36 


R. Moxon / Seiler et al, 1996 


zol8 225 BZ232 


R. Moxon / Seiler et al, 1 996 


zol9 225 BZ83 


R. Moxon / Seiler et al, 1 996 


zo20_225 44/76 


R. Moxon / Seiler et al, 1 996 


zo21 225 MC58 


R. Moxon 


zo96_225 2996 


Our collection 


Group A 




zo22 225 205900 


R. Moxon 


zo23 225 F6124 


R. Moxon 


z2491 Z2491 


R. Moxon / Maiden et al, 1998 


Group C 




zo24 225 90/18311 


R. Moxon 


zo25_225 93/4286 


R. Moxon 
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Others 

zo26_225 A22 (group W) R. Moxon / Maiden et a/., 1 998 
zo27_225 E26 (group X) R. Moxon / Maiden <?/ a/., 1 998 
zo28_225 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zo29_225 E32 (group Z) R. Moxon / Maiden et aL, 1998 

Gonococcus 

zo32_225 Ng F62 R. Moxon / Maiden et aL, 1 998 

zo33_225 Ng SN4 R. Moxon 

fa!090 FA 1090 R. Moxon 



References: 

Seiler A. et aL, Mol. Microbiol, 1996, 19(4): 84 1-856. 
Maiden et aL, Proc. NatL Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 

>FA1090 <SEQ ID 3115> 

MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDC3 
GFMQHIFKRAMGINLPRTSAEQARMGAFVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z2491 <SEQ ID 3116> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLFINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGFDCSGF 
MQH I FKRAMG INL PRT S AE OARMGTP V AR SE LQ PG DMVFFRT LGGSRISHVGLYI GNN R F 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN+ 

ZO01_225 <SEQ ID 3117> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRA3NADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO02_225 <SEQ ID 3118> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
N A DE L I GS AMGLN EQ P VL P VNRV PARRAGN ADELI GN AMGLN EQ P V L PVNRA PARRAGN A 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO03_225 <SEQ ID 3119> 

MDSFFKPAVWAVLWLMFAVRLALADELTNLLSSREQILRQFAEDEQPVLPINRAFARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQE^SDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 
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ZO04_225 <SEQ ID 3120> 

KDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARNGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO05_225 <SEQ ID 3121> 

MDS FFKPAVWAVLWLM FAVRPALADELTNLLS SREQI LRQFAEDEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO06225 <SEQ ID 3122> 

MDS FFKPAVWAVLWLMFAVRPALADEL7NLLS SREQI LRQFAEDEQPVLP INRAPARRAG 
N A DE L I G S AMGLNEQP VLP VNRVPARRAGN ADE LI GNAMGLN EQ PVL PVNRAPARRAGNA 
DE L IGN AMGL LGI AYR YGGT S V £ TG FDC S G FMQHI FKRAMG INLPRT S A3QARMGT P VAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

ZO07_225 <SEQ ID 3123> 

MDS FFKPAVW AVLWLMFAVRPALADELTN LLS 3 REQ I LRQFAEDEQ PVL? IN RAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO08_225 <SEQ ID 3124> 

MDSFFKPAVWAVLWLM FAVRPALADELTNLLS SREQILRQFAEDEQPVLPINRAPARRAG 
N A DE L I G S AMG LN EQ P VL P VN R V PARRAGNADE L I G S AMGLN E QP V L P VNRA PARRAG N A 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO09_225 <SEQ ID 3125> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLS SREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPWJRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

ZO10_225 <SEQ ID 3126> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGN AMG LNEQPVL PVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

Z011_225 <SEQ ID 3127> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAEDEQPVLP INRAPARRAG 
N A DE L I G S AMG LNE Q PVL P VN RV PARRAGNADE L I GN AMG LNEQ P VL P VN RA P ARRAGN A 
DELI GNAMGLN EQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIE ITS LSHKYWSGKYAFARR VKKNDPSRFLN * 

Z012_225 <SEQ ID 3128> 

MDS FFKPAVWAVLWLM FAVRPALADELTNLLS SREQI LRQFAEDEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELI GN AMGL LG I AYR YGG TSISTGFDCSG FMQH I FKRAMG I N L PRT S AE Q ARMGT P VAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGK V AFARR 
VKKNDPSRFLN* 

Z013_225 <SEQ ID 3129> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFIQHIFKRAMGINLPRTSAEQARMGTPVAR 
SE LQPGDMVFFRTLGGSRI SHVGLY IGNNRFI HAPRTGKN I EI T SLS HKYWSGKYAFARR 
VKKNDPSRFLN * 

Z014_225 <SEQ ID 3130> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLY IGNNRFIHAPRTGKN IEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z015_225 <SEQ ID 3131> 

MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCS 
G FMQHI FKRAMGINLPRTSAEQARMGT PVARSELQPG DMVFFRTLGG SRI SHVG L, YI GNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z016_225 <SEQ ID 3132> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELI GN AMGL LG I A YR YGGT S VSTGFDCSG FMQH I FKRAMG I N L PRT S AE Q ARMGT PV AR 
SELQPGDMVFFRTLGGSRI SHVG LYIGNNRFI HAPRTGKN IEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z017J225 <SEQ ID 3133> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGI AYR YGGT SVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKN IEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z018_225 <SEQ ID 3134> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGI AYR YGGT SVSTGFDCSG FMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKN IEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z019_225 <SEQ ID 3135> 

M D S FFK P AVW A VLWLM FAVR PAL A DE LTN LLS S RE Q I LRQ FAE DE Q PVL P IN RA PARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO20_225 <SEQ ID 3136> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPINRAPARRAGNADELIGSAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSG? 
MQH I FKRAMG INLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRI SHVG LYIGNNRF 
IHAPRTGKN IEITSLSHKYWSGKYAFARR VKKNDPSRFLN* 

Z021_225 <SEQ ID 3137> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADEL IG S AMGLNEQPVLPVNRV PARRAGNADEL I GNAMGLN EQPV LP VNRAP ARRAGN A 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFI HAPRTGKN IEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z022_225 <SEQ ID 3138> 

M D S F FK?A VW AVLWLM FAVR P ALADE LTN LLS S REQ I LRQ FAE DEQ PV L P I N RAP ARRAG 
N A DEL I G S AMGLN EQ P V L P VN R V P ARRAGN AD ELI GN AMG LNEQ P VLP VNRA P ARRAGN A 
DELIGNAMGLLG I AYRYGGTSISTGFDCSGFMC'H I FKRAMGINLPRTSAEQARMGTPVAR 
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SELQPGDMVFFRTLCGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z023 225 <SEQ ID 3139> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLG I AYRYGGTS I STG FDC SG FMQH I FKRAMG IN LPRTSAEQARMGT PVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

Z024_225 <SEQ ID 3140> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELI GN AMGLLG I AY R YG GT SISTGFDCSGFMQHI FKRAMG I N L PRT S AEQARMGT PVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z025_225 <SF.Q TD 31.4 1> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIG S AMG LNEQPVLPVNR V PARRAGNADE L I GNAMG LNE Q PVL PVNRAPARRAGN A 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z026_225 <SEQ ID 3142> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

Z027_225 <SEQ ID 314 3> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLFINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLG I AYRYGGTSVSTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z028_225 <SEQ ID 3144> 

MDS FFKPAVWAVLWLM FAVRPALADE LTN LLS SREQILRQFAEDEQPVL PI NRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLG I AYR YGGTSVSTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z029_225 <SEQ ID 3145> 

MD S FFKPAVWAVLWLM FAVRPALADE LTN LLS SREQI LRQFAE DEQP VL PI NRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGI AYRYGGTSVSTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z032_225 <SEQ ID 3146> 

MDS FFKPAVWAVLWLM FAVRPALADE LTN LLS SREQI LRQFAE DEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHI FKRAMG INLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARR VKKNDPSRFLN* 

Z033_225 <SEQ ID 3147> 

MDS FFKPAVWAVLWLMFAVRSALADE LTN LLS SREQI LRQFAE DEQPVLPVNRAPARRAG 
N A DE L I G S AMG LN EQ PVL P VN RA P ARRAGN ADEL I GS AMGL LG I A YR Y GG T S VS T G FDC S 

GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRIKKNDPSRFLN* 

Z096_225 <SEQ ID 3148> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLFINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARE^AGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 



Figure 19 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 225, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 12 



Table 3 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 235 among different strains. 
Table 3 



235 gene variability: List of used Neisseria strains 


Identification Strains 


Reference 


number 






Group B 




gnmzqOl 


NG6/88 


Seiler e/ <//., 1996 


gnmzq02 


BZ198 


Seiler et al, 1996 


gnmzq03 


NG3/88 


Seiler et al, 1996 


gnmzq04 


1000 


Seiler et al, 1996 


gnmzqOS 


1000 


Seiler etal, 1996 


gnmzq07 


BZ169 


Seiler et al, 1996 


gnmzq08 


528 


Seiler etal, 1996 


gnmzq09 


NGP165 


Seiler et al, 1996 


gnmzqlO 


BZ133 


Seiler etal, 1996 


gnmzqll 


NGE31 


Seiler et al, 1996 


gnmzql3 


NGE28 


Seiler etal, 1996 


gnmzql4 


NGH38 


Seiler etal, 1996 


gnmzqlS 


SWZ107 


Seiler etal, 1996 


gnmzql6 


NGH15 


Seiler etal, 1996 


gnmzql7 


NGH36 


Seiler <?* al, 1996 


gnmzql8 


BZ232 


Seiler al, 1996 


gnmzql9 


BZ83 


Seiler etal, 1996 


gnmzq2l 


MC58 


Virjiefa/., 1992 


Group A 




gnmzq22 


205900 


Our collection 
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gnmzq23 F6124 
Z2491 Z2491 



Our collection 
Maiden et aL, 1998 



Group C 

gnmzq24 90/18311 
gnmzq25 93/4286 



Our collection 
Our collection 



Others 

gnmzq26 A22 (group W) Maiden et aL, 1998 
gnmzq27 E26 (group X) Maiden et aL, 1998 
granzq28 860800 (group Y) Maiden et aL, 1998 
gnmzq29 E32 (group Z) Maiden et aL, 1998 
gnmzq31 N. lactamica Our collection 

Gonococcus 

gnmzq32 NgF62 Maiden et aL, 1998 

gnmzq33 NgSN4 Our collection 

fa!090 FA1090 Dempsey et aL 1991 



References: 



Seiler A. et aL, Mol. Microbiol., 1996, 19(4):841-856. 

Maiden R. etaL, Proc. Natl Acad. Sci. USA, 1998, 95:3140-3145. 

Viiji M. et al 9 Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. et aL, J. Bacterid., 1991 , 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 314 9> 

MKPLIl^IJ^VLALSACQVRKAPDLDYTSFKESKPASILVVPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GHMZQ01 <SEQ ID 3150> 

MKPLILGIJUIVIjALSACQVQKAPDFDYTSFKESKPA5ILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSA^/VNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK+ 

GNMZQ02 <SEQ ID 3151> 

l^PLIl^LAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEET FKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSA'/VNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 



GNMZQ03 <SEQ ID 3152> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVWGTWGVLAST 
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AAPLSEAGYYVFPAAVVEETFKQNGLTN'AADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ04 <SEQ ID 3153> 

MK PL I L 3 LAAV LAL S ACQVQKA P D FD YT S FKE S KPAS I LW P P LNE S P D VN GT WG VL A ST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGI LKGPRFVEEQPK* 

GNMZQ05 <SEQ ID 3154> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEET FKENGLTNAADIHAVRPEKLHQI FGN DAVLY I TV7E YGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQV5KTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ07 <SEQ ID 3155> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQV SKTAAYNLLS PYSHNG I LKG PRFVEEQPK* 

GNMZQ08 <SEQ ID 3156> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGI LKGPRFVEEQPK* 

GNMZQO 9 <SEQ ID 3157> 

MK PL I LGLAAAL VLS ACQVQKA P D FD YT S FKE SKPAS I LW P PLN ESP D VNG TWGM LAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVQPEKLHQIFGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGI LKGPRFVEEQPK * 

GNMZQ10 <SEQ ID 3158> 

MKPLILGLAAVLALSACQVQKAPDFDYT3FKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITV1EYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSG3ASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ11 <SEQ ID 3159> 

MK PL I LG L AA VLAI • S ACQ VQKAP D FD YT S FKE SK PAS I LW PPLNESPDVM GT W G VLA S T 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI L DS VTTVS AKARLV D SRN GKE LW S G S AS I RE G SNN SN S GL LG ALV S AWN Q IAN S LT 
DRGYQVSKTAAYNLLSPYSHNGI LKGPRFVEEQPK* 

GNMZQ13 <SEQ ID 3160> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVKGTWGVLAST 
AAP LS EAG YY V FP AAWE ETFKQNG LT N AAD I HAVRPE KLI 1 QI FGN D AVL Y I T VTE YGT S 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ14 <SEQ ID 3161> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAAD I HAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGI LKG PR FVEEQPK * 

GNMZQ15 <SEQ ID 3162> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQKGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGI LKGPRFVEEQPK* 

GNMZQ16 <SEQ ID 3163> 
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MK PL I LG LAA V L ALS ACQ VQKA PD FD YT S FKE S K PAS I LW P P LN E S P D VNG T WG V LAS T 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ17 <SEQ ID 3164> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPL S EAG Y YV FPAA WE ET FKQN G LT N AAD I H AVR PE KLHQ I FGN D A V L Y I TV T E YG T S 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ18 <SEQ ID 3165> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLS P YSHNG I LKGPRFVEEQPK * 

GNMZQ19 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYT S FKESKPAS I LW PPLNES PD\^GTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNG I LKGPRFVEEQPK* 

GNMZQ21 <SEQ ID 3166> 

MKP L I LG LAAVLAL S ACQVQKAP D FD YTS FKE S K P A S I L W P P LNE 5 P D VN GT WGV L A S T 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKSLWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNG I LKGPRFVEEQPK* 

GNMZQ22 <SEQ ID 3167> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIPiAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNG I LKGPRFVEEQPK* 

GNMZQ23 <SEQ ID 3168> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGI LKGPRFVEEQPK* 

GNMZQ24 <SEQ ID 3169> 

MK PL I LG L AA VL A LS ACQVQKA P D FD YTS FKE S K P AS I L W P P LN E S PD VNGT WG V LAS T 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS P YS HNG I LKG PRFVEEQ PK * 

GNMZQ25 <SEQ ID 3170> 

MKPLILGIAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ26 <SEQ ID 3171> 

MKPLILGLAAVLALS ACQVQKAP DFD YTS FKE SKPAS I L WPP LNE SPDVNGTWGMLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGI LKGPRFVEEQPK* 

GNMZQ27 <SEQ ID 3172> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADI HAVRPEKLHQI FGNDAVLYITVTE YGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGI LKGPRFVEEQPK* 
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GNMZQ28 <SEQ ID 3173> 

MKPLILSLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDWGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ29 <SEQ ID 3174> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEET FKQNGLTNAADI HAVRPEKLHQI FGNDAVLY I TVTEYGT S 
YQILDSVTTVS7^KARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQ:ANSLT 
DRGYQVSKTAAYNLLS PYSHNGI LKGPRFVEEQPK* 

GNMZQ31 <SEQ ID 3175> 

MK P L I LGLAAVLALS ACQ VQKA P D FD YT A FKE S K P AS I L W P P LN E S PD VNG TWGMLAS T 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITITEYGT5 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRG YQV SKAAA YDLLS P Y S HN G I LKG PRFVE E Q PK * 

GNMZQ32 <SEQ ID 317 6> 

MKPLI LGLAAVLALS AC QVRKAP DLDYTS FKESKPAS I LWP PLME S PD VNGTWGMLAS T 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSRNGI LKG PRFVEEQPK* 

GNMZQ33 <SEQ ID 3177> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAP I SEAGYYVFPAAWEET FKENGLTNAAD I HAVRPEKLHQI FGN DAVLY I TVTEYGT S 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWKQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEECPK* 

Z2491 <SEQ ID 3178> 

MK PL I LGLAAVLALS ACQ VQKAP D FDYT S FKE SKP AS ILWPPLNESP D VN G T W G V LA S T 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADI HAVRPEKLHQI FGNDAVLY I TVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK+ 



Figure 20 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 235, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 13 



Table 4 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 287 among different strains. 
Table 4 

287 gene variability: List of used Neisseria strains 

Identification Strains Reference 
number 
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Group B 



287_2 BZ198 

287_9 NGP165 

287_14 NGH38 

287 21 MC58 



Seiler a/., 1996 
Seiler etal, 1996 
Seilere/ al, 1996 
Virjie/a/., 1992 



z2491 



Group A 

Z2491 



Maidens al, 1998 



Gonococcus 

fal 090 FA1 090 Dempsey et al 1991 



References: 

Seiler A. etal, Mol. Microbiol, 1996, 19(4): 84 1-856. 

Maiden R. et al, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. et al, Mol. Microbiol., 1992 : 6:1271-1279 

Dempsey J.F. et al, J. Bacteriol., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



287^14 <SEQ ID 3179> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPS AQGGQDMAAVS EENTGNGGAAAT DKPKN2 DEGAQN DMPQNAADT D S LT PNHT PAS 
NMPAGNMENQAPDAGESEQPANQFDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAGSQNPASSTNPSATNSGGDFGRTNVGNSVVIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSE FEKLSDADKI SNYKKDGKNDGKNDKFVGLVADS VQMKG INQY 1 1 FYK PKPr S FAR 
FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 
GG S Y ALRVQGE P SKGEM LAGT AV YNGE VLH FHTENG R PS P SRGR FAAKV DFGSKSVDGII 

DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_2 <SEQ ID 3180> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGA PSAQGG QDMAAV S EEN T GNGGAAATDKPKNE DEGAQN DM PQNAADTDSLT PNHT PAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 
FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 
GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 
DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD * 

287_21. <SEQ ID 3181> 

MFKRS V IAMACI FALSACGGGGGGS PDVKSADTLSKPAAPV VSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKKEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSE FEKLSDADKI SNYKKDGKNDKFVGLVADS VQMKG INQYI I FYK PKPTS FAR FRRS 
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ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRB-AAKVDFGSKSVDGIIDSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEXGGFGV 
FAGKKEQD* 

287_9 <SEQ ID 3182> 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPWTEDVGEEVLPKEKKDEEA 
VSGAPQADTQDATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADTDS 
STPNHTPAPNKPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAGENAGNTADQA 
ANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKVCDR 
DFLDEEAPPKSEFEKLSDEEKINKYKKDEQREN FVGLVADRVEKNGTNKYVI I YKDKS7VS 
SSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYG 
AEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAfCVDFGSKS 
VDGIIDSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPT 
DAEKGG FG VFAGKKEQD* 

FA1090 <SEQ ID 3183> 

MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPWAENAGEGVLPKEKKDEEA 
AGGAPQADTQDATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAAESAN 
QTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDSCNGDN 
LLDEEAPSKSEFEKLSDEEKI KR YKK DEQREN FVG LVADR VKK DGTNK Y 1 1 FYTD KP PT R 
SARSRRSLPAEI PLI PVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRYLTYGAEKLPGGS 
YALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGSKSVDG 1 1 DSG 
DDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPTDAEKGGFG 
VFAGKKDRD* 

Z2491 <SEQ ID 3184> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSOG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
N M LAGNMEN QAT DAG E S SQ PANQ P DMAN AADGMQG DDP S AGGQNAGNT AAQGANQ AGNN Q 
AAGSSDPIPASNPAPANGGSNFGRVDLAMGVLIDGPSQNITLTHCKGDSCSGKNFLDEEV 
QLKSE FEKLS DADKI SNYKKDGKNDK FVGLVADS VQMKGINQYI I FYKPKPTSFARFRRS 
AR.SRRSLPAEMPL I PVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRYLTYGAEKLPGGS Y 
ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGIIDSGD 
DLHMGTQKFKAAI DGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYS YR PT DAEKGGFGV 
FAGKKEQD* 



Figure 21 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 287, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 14 



Table 5 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 519 among different strains. 
Table 5 

519 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 



WO 99/57280 



129 



PCT7US99/09346 



number 




Group B 


zvOl 519 


NG6/88 


zv02_519 


BZ198 


zv03 519ass 


NG3/88 


zv04 519 


297-0 


zv05_519 


1000 


zv06 519ass 


BZ147 


zv07 519 


BZ169 


zvll 519 


NGE31 


zvl2 519 


NGF26 


zvl8 519 


BZ232 


zvl9_519 


BZ83 


zv20_519ass 


44/76 


zv21 519ass 


MC58 


zv96 519 


2996 



R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al, 1 996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al 9 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al, 1996 
R. Moxon / Seiler et al, 1 996 
R. Moxon / Seiler et al, 1996 
R. Moxon 
Our collection 



Group A 

zv22_519ass 205900 
z2491 519 Z2491 



R. Moxon 
R. Moxon / Maiden et al, 1998 



Others 

zv26_519 A22 (group W) R. Moxon / Maiden et al, 1998 
zv27_51 9 E26 (group X) R. Moxon / Maiden et al, 1998 
zv28_519 860800 (group Y) R. Moxon / Maiden et al, 1998 
zv29_519ass E32 (group Z) R. Moxon / Maiden et al, 1998 



Gonococcus 

zv32_519 Ng F62 

fa!090 519 FA1090 



R. Moxon / Maiden et al, 1998 
R. Moxon 



References: 

Seiler A. etal, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090_519 <SEQ ID 3185> 

ME F F 1 1 LLAAV AV FG FKS FV V I PQQE VH VVE RLG R FH RALTAGLN I L I P FI DR VA YRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAI^QAQirAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANPXAIRQIAAALQTOGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 
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Z2491_519 <SEQ ID 3186> 

MEFFIILIAAVWFGFKSFVVIPQQEVHVVERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMATTQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV01_519 <SEQ ID 3187> 

ME FFI I LLVAVAVFGFKS FWI PQQF.VH WERLGRFHRALTAGLNI LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INS T WAALDEAAGAWG VKV LRYE IKDLVP PQE I LRSMQAQI TAERE 
KRARIAESEGRKIEQ1NLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV02_519 <SEQ ID 3188> 

ME FFI I LLVAVAV FGFKS FWI PQQE VH VVERLGR FHRALT AG LNILIPFI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RME L DKT FE ERD E I N S TW S ALDE AAG AW G VKVLR YE I KDL V P PQE I LR SMQAQI T AE RE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESN1TLIMPANVADIGSL 
ISAGMKI I DS SKTAK* 

ZV03_519 <SEQ ID 3189> 

ME FFI I LLVAVAV FG FKS FWI PQQEVHWERLGRFHRALTAGLNILI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMEL DKTFEERDE I N ST V V S AL DE AAG AWG VKVLR YE IKDLVPPQEILR SMQAQI T AE RE 
KRAR I AE S EGRK I E Q I N L A SGQRE AE I QQ S E GEAQ AA VN AS N AE K I AR I NRAKGE AE SLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV04_519 <S2Q ID 3190> 

ME FFI I LLVAVAVFGFKS FWI PQQEVHWERLGRFHRALTAGLNILI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWSALDE71AGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKI IDS SKTAK* 

ZV05_519 <SEQ ID 3191> 

MEFFI I LLVAVAVFGFKS FWI PQQEVHWERLGRFHRALTAGLNILI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS S KTAK * 

ZV06_S19ASS <SEQ ID 3192> 

MEFFI I LLVAVAV FG FK S FVV I PQQE VH WE RLGR FHRALT AG LN I L I P F I DRV AY RHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEIN ST VFS ALDE AAGAWGVKVLRYE IKDLVP PQE I LRSMQAQITAERK 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKI IDSSKTAK* 

ZV07_519 <SEQ ID 3193> 

MEFFI I LLVAVAV FGFKS FVV I PQQEVHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPXLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKI IDSSKTAK* 
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ZV11_519 <SEQ ID 3194> 

ME FFI ILLAAVAV FG FKS FW I PQQEVH WERLGR FHRALTAGLN ILIPFI DRVAYRH SL 
K2IPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEER DE I N ST WAALDE AAGAWG VKVLRYE I K DLV PPQE T LR SMQAQ I TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
L V AE ANAEAI RQ IAAALQ TQGG AD AVN LK I AE Q YVAAFNN LAKE SN T L IM P AN VAD I G S L 
I SAGMKI I DSSKTAK* 

ZV12_519 <SEQ ID 3195> 

MEFFI IL LVAVAVFG FKS FW I PQQEVH WERLGR FH RALTAG LN I L I PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS S KT AK * 

ZV18_519 <SEQ ID 3196> 

MEFFI ILLVAVAVFGFKS FWI PQQE VHWERLGRFH RALTAG LN I LIPFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RME L DKT FEE R DE I N S T WAALDE AAGAWG VKVLRY E IK DL VP PQE I LR SMQAQ I TAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS SKTAK * 

ZV19_519 <SEQ ID 3197> 

ME FFI I LLVAVAV FG FKS FWI PQQE VH WER LGRFHRALT AG LN I L I P FI DRVAYRH S L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMEL DKT FEERDE INST WAAL DEAAGAWG VKVLRYE I KDLVP PQE I LRSMQAQI TAERE 
KRAR I AE S EGRK I EQI N LAS GQREAE I QQ S EG E AQ AAVN A S N AEKI AR I NRAKGE AE SLR 
LVAEAN AE A I RQI AAALQT QGG A DAVN LK I AE Q YVAA FNN LAKE S N T LI M P AN VA D I G S L 
I SAGMKI IDS SKTAK* 

ZV20_519ASS <SEQ ID 3198> 

ME FFI I LLVAVAVFGFKS FWI PQQEVH WERLGR FHRALTAGLN I LI P FI DRVAYRH S L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFOVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINST WAALDEAAGAWGVKVLR YE I KDLVP PQE ILRSMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSM 
ISAGMKIIDSSKTAK* 

ZV21_519ASS <SEQ ID 3199> 

ME FFI I LLVAVAVFGFKS FWI PQQEVH WERLGR FHRALTAGLN I LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRAR I AESEGRK I EQ IN LASGQREAE I QQSE GEAQAAVN AS N AE K I AR INRAKGEAE S LR 
LVAEAN AEAIRQI AAALQTQGGADAVNLK I AEQYVAAFNNLAKES NTL IMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV22_519ASS <SEQ ID 3200> 

MEFFIILLAAWVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERF. 
KRARIAESEGRKIEQINLASGQREAKIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV26_519 <SEQ ID 3201> 

ME FFI I LLAAWVFG FKS FWI PQQEVH WERLGR FHRALTAGLN ILIPFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAE ANAEA I RQI AAALQTQGGADA VN LK I AEQY V AA FNN LAKE SNT L I M PAN VAD I GS L 
ISAGMKIIDSSKTAK* 

ZV27 519 <SEQ ID 3202> 
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MEFFIILLVAVAVFGFKSFWIFQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RME L D KT FEERDE IN S TWAAL DEAAG AWG VK V LRYEIKDLVPPQEI LR SMQ AQI T AE RE 
KRAR I AE S EGRK I EQ I N LASGQRE AE I QQSEGE AQAAVN AS N AEK I AR I N RAKGE AES LR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV28_519 <SEQ ID 3203> 

ME FFI I LLAAVAV FG FKS FWI PQQEVH WERLGRFHRALTAGLN I LI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYI^4AITQLAQTTLRSVIG 
RMELDKT FEER DF.IN ST WAALDE AAGAWGVKVLRYEI KDLVPPQE I LRSMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
1 3AGMKI I DSSKTAK * 

ZV29_519ASS <SEQ ID 3204> 

ME FFI ILLAAVAVFGFKS FW I PQQEVH WERLGRFHRALTAGLNILI PFI DRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSIWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREPEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAAXQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSNKTAK * 

ZV32_519 <SEQ ID 3205> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRAIjTAGLNILIPFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
L VAE AN AE AI RQ I AAALQTQGG A DAVN LKI AE Q YVAA FNN LAKE SNT L I M P AN VAD I G S L 
ISAGMKIIDSSKTAK* 

ZV96_519 <SEQ ID 3206> 

ME FF 1 1 L LAAVAV FG FK S FW I PQQE V HWERLGRFHRALTAG LN I L I P FI DRVAYRH S L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INS T WAALDE AAGAWGVKVLRYE IKDLVP PQE I LRSMQAQI TAERS 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

Figure 22 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 519, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 15 



Table 6 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 919 among different strains. 
Table 6 



919 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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Group B 






zmOl 


NG6/88 


R. Moxon / Seiler et aL, 


1996 


zm02 


BZ198 


R. Moxon / Seiler et aL, 


1996 


zm03 


NG3/88 


R. Moxon / Seiler et aL, 


1996 


zm04 


297-0 


R. Moxon / Seiler et aL, 


1996 


zm05 


1000 


R. Moxon / Seiler et aL, 


1996 


zm06 


BZ147 


R. Moxon / Seiler et aL, 


1996 


zm07 


BZ169 


R. Moxon / Seiler et aL, 


1996 


zm08n 528 


R. Moxon / Seiler et aL, 


1996 


zm09 


NGP165 


R. Moxon / Seiler et aL, 


1996 


zmlO 


BZ133 


R. Moxon / Seiler et aL, 


1996 


zmllasbc NGE31 


R. Moxon / Seiler et aL, 


1996 


zml2 


NGF26 


R. Moxon / Seiler et aL, 


1996 


zml3 


NGE28 


R. Moxon / Seiler et aL, 


1996 


zml4 


NGH38 


R. Moxon / Seiler et aL, 


1996 


sml5 


SWZ107 


R. Moxon / Seiler et aL, 


1996 


zml6 


NGH15 


R. Moxon / Seiler et aL, 


1996 


zml7 


NGH36 


R. Moxon / Seiler et aL, 


1996 


zml8 


BZ232 


R. Moxon / Seiler et aL, 


1996 


zml9 


BZ83 


R. Moxon / Seiler et aL, 


1996 


zm20 


44/76 


R. Moxon / Seiler et aL, 


1996 


zm21 


MC58 


R. Moxon 




zm96 


2996 


Our collection 





Group A 

zm22 205900 
zm23asbc F6124 
z2491 Z2491 



R. Moxon 
R. Moxon 

R. Moxon / Maiden et aL, 1998 



Group C 

zm24 90/18311 
zm25 93/4286 



R. Moxon 
R. Moxon 



Others 

zm26 A22 (group W) R. Moxon / Maiden et aL, 1998 
zm27bc E26 (group X) R. Moxon / Maiden et aL, 1 998 
zm28 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zm29asbc E32 (group Z) R. Moxon / Maiden et aL, 1998 
zm31asbc N. lactamica R. Moxon 

Gonococcus 

zm32asbc Ng F62 R. Moxon / Maiden et aL, 1998 

zm33asbc Ng SN4 R. Moxon 



fal090 FA1090 R. Moxon 
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References: 

Seiler A. et al 9 Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090 <5EQ ID 3207> 

MKKH LLR S A LYG I AAA I LAAC QSR S I QT FPQPDTSVINGPDR PAG I P D P AGT T V AGGG A V 
YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
YFTPWQVAGNGSIAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 
I DRH Y I T LG AP L FVAT AH PVTRKALNR L I MAQDTG S A I KGA VR V D Y FW G YG DE AGE LAGK 
QKTTGY VWQLLPNGMKPEYRP * 

Z2491 <SEQ ID 3208> 

MKKYLFRAALCGIAAAILAACQSKSIQT?PQPDTSVINGPDRPVGIPDPA3TTVGGGGAV 
YTW PH L S L PH W AAQ D FAKS LQS FRLG C AN LKNRQGW QDVC AQA FQT P VH S VQAKQ FFER 

Y FT P WQ V AGN G S LAGTVT G YYE PVLKG D DRRT AQ ARF P IYGIPDDFISVPL P AGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

Y DRH Y I T LGAP L FVAT AH PVT R KALNRL IMAQ DT G S A I KG AVR VD Y FWG YG DE AGE LAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM01 <SEQ ID 3209> 

MKK YL FRAAL YG I AAA I LAACQ S KSIQTFPQPDTSVINGP DRP VG I P D PAG T T VGG GG AV 
YTWPHLS LPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVH S FQAKQFFER 

Y FT PWQVAGNG SL AGTVT GY YE PVLKG D DRRTAQAR FPIYGIPDDFISVPL PAG LR S GKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGT^L 
DGKAPILGYAEDPVELFFMHIQGSGRXKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLG QT SMQG I K S YM RQN PQRLAE V LGQN PS Y I FFRE LAG S SN DG P VG ALGT PLMGE Y AG A 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGY VWQLLPNGMKPEYRP * 

ZM02 <SEQ ID 3210> 

MKKY L FRAAL YG I AAAI LAACQS K S I QT FPQPDT S V I NG P DR P VG I P D P AGTT V GG GGA V 
YTWPHLS LP HWAAQDFAKSLQSFRLGC AN LKNRQGWQDVCAQAFQT PVH S FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM03 <SEQ ID 3211> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKS LQS FRLGCAN LKNRQGWQDVCAQAFQT PVH S FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQN PS YI FFRE LAGS SNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKAI^RLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



ZM04 <SEQ ID 3212> 
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MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
Kl^QTSMOGIKAYMQQNPQRUyEvTLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

Y DRH Y I T LG A P L FVAT AH P VT RKALN RL I MAQ DT GS A I KG A VR V D YFWG YG DE AGE LAGK 
QKTTGYVWQL L PNGMKPEYRP * 

ZM05 <SEQ ID 3213> 

MKKYLFRAALYGTAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YT W PH L S L PHWAAQ D FAKS LQS FRL S CAN LKN RQGW Q D VC AQA FQT P VH S FQAKQ F FER 

Y FT PWQ VAGNG S LAGT VTG Y YE P V LKG DDRRT AQAR FPIYGIPDDFISVPL PAG LR S GKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM06 <SEQ ID 3214> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQD FAKS LQS FRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKT TG YVWQLLPN GMK PE YRP * 

ZM07 <SEQ ID 3215> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

Y FT P W Q VAGNG S LAGT VT G Y YE PV LKGD DRRT AQARFP IYGIPDDFISVPL PAG LR SGKA 
LVR I RQTGKN SGT I DNTGGTHTADLSRFP ITARTT AI KGRFEGSR FLP YHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLrMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTT G Y VWQLL PNGMK PE YR P * 

ZM08N <SEQ ID 321 6> 

MKKYLFRAALYGIAAAI LAACQSKS IQTJPQPDTS VINGPDRPVG I PDPAGTTVGGGGAV 
YTW PHLS L PH WAAQDFAKS LQ S FRLG C AN LKNRQGW QD VC AQAFQT P VHS FQAKQ F FER 
YFTPWQVAGNGS LAG TVTGYYEPVLKGDDRRTAQARFP I YG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAI KGRFEGSR FLP YHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KAYMRQN PQRLAEVLGQN PS Y I FFRELAGS SN DG P VGALGT PLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM09 <SEQ ID 3217> 

MKKYLFRAALCGIAAAI LAACQSKS IQT FPQPDT S VINGPDRPVG I PAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

Y FT PWQVAGNGSLAGTVTGY YE PVLKGDDRRTAQARFP I YG I PDDFI SVPLPAGLRSGKA 
LVRI RQTGKN SGTIDNTGGTHTADLSQFPITARTTAI KGRFEGSR FLP YHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQG1KSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

Y DRH Y I T LGA PLFV AT AH P VTRKALN RL I MAQDTG S A I KG A VR VD Y FWG YG DE AGE LAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM10 <SEQ ID 3218> 

MKKYLFRAALCGIAAAI LAACQSKSIQTFPQPDTSVINGPDRPVGI PAPAGTTVAGGGAV 
YTWPHLSLPHWAAQD FAKS LQS FRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGS LAGT VTG Y YE PVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LGYAEDPVELFFMH IQGSGRLKT PSGKYIRIGYADKNEHPYVS I GKYMADKGYL 
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KLGQTSMQGIKSYMRQNPQRLAEVLGQNP3YIFFRELTGSGNDGPVGALGTPLMGEYAGA 
V DRH Y ITLGAP L FV ATAHPVTRKALNRLIMAQDTGSA I KG AVRVDYFWGYG DEAGELAGK 
QKTTG YVWQLL PNGMK PEYRP* 

ZM11ASBC <SEQ ID 3219> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVGGGGAV 
YTWPHLSLPKWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIFDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGOT S MQG I K S YMRQN PORL AE V LGQN PS Y I F FRF. LTG S SN DG PVG ALG T PLMGE YAG A 

V DRH Y I T LG A P LFVAT AH P VT RKALN R L I MAQDTG S A I KG AVR V D Y FWG YG DEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP* 

ZM12 <SEQ ID 3220> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPbTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

V FT PWQ V AGN G S LAGT VTG Y YE P VLKG DD RRT AQARF P IYGIPDDFISVPL P AGLRS GKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
K LGQT S M QG I K S YMRQN PQR LAE V LGQN P S YI FFREL AG S SN DG P VGALG T P LMGE YAG A 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLLPNGMK PEYRP* 

ZM13 <SEQ ID 322 1> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAEQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKN SGT IDNTGGTHTADLSRFP ITARTTAIKGRFEG SRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLLPNGMKPE YRP * 

ZM14 <SEQ ID 3222> 

MKK YL FRAA LCG I AAA I LAAC Q S K S I QT F PQP DT S VI NG P DR PVG I PA PAGT T VAGGG AV 
YTWPH1SLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSRNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAH P VT RKALN RL I MAQDTGSAIKGAVRVDYFWGYG DEAGELAGK 
QKT TG YVWQLLPNGMK PE YR P * 

ZM15 <SEQ ID 3223> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDLAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNHQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
L VR I RQTGKN S GT I DNTGGTHT AD LSRFP I TART T AI KG RFEG SR FLP YHTRN Q INGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSI3KYMADKGYL 
KLGQTSMQG IKS YMRQN PQRLAEV LGQN PS Y I FFRELTGSGNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDY FWG YG DEAGELAGK 
QKTTG YVWQLL PNGMKPE YRP * 

ZM16 <SEQ ID 3224> 

MKKYLFRAALCG IAAAI LAACQSKS IQT FPQPDT S VINGPGRPVG I PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQ INGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQG I K S YMRQN PQRLAEVLGQN PS Y I FFRELTGS SNDGPVGALGT PLMGE YAGA 

V DRH Y I T LGA P L FVATAH P VT RKA LNR L I MAQ DT G S A I KGA VR V D Y FWG YG DEAGE LAGK 
QKTTG YVWQLL PNGMKPE YRP * 

ZM17 <SEQ ID 3225> 
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MKKYLFRAALYGIAAAI LAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

Y T V V PHL S L PH W AAQ DFAKS LQ S FR LG C AN LKN RQG W Q DV CAQAFQT P VH S FQAKQF FE R 

Y FT PWQVAGNG SLAGTVTGYYEPVLKGDDRRTAQARFP I YGIPDDFI SVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLK7PSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PNGMK PEYR P * 

ZM18 <SEQ ID 3226> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGIPDDFI SVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQT SMQG IKS YMRQNPQRLAEVLGQN PS YIFFRE LAGS SNDGPVGALGT PLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGSLAGK 
QKTTG YVWQLLPNGMKPEYRP* 

ZML9 <SEQ ID 3227> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPD?SVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLS L PHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQTP VHS FQAKQ FFER 

Y FT PWQVAGNG S LAGT VTG YYE PVLKGDDRRTAQAR FP I YG I PDD FI S VPL PAG LR SGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQT SMQG IKS YMRQNPQRLAEVLGQN PS Y I FFRELAG S SN DG PVGALGT PLMGE YAGA 

Y DRHY I T LG AP L FV ATAH P VT RKALNRL IMAQDT G SAI KG AVRVD YFWG YG DE AGELAGK 
QKTTGYWQLLFNGMKPEYRP* 

ZM20 <SEQ ID 3228> 

MKKYLFRAALYGIAAAI LAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLFYHTRNQINGGAL 
DGKAPILGYAEDPVELFET4HIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSMDGPVGALGTPLMGEYAGA 
VDRH Y I T LGAPLFVATAHPVTRKALNR L IMAQDT G SAI KGAVRV DY FWG YG DEAGE LAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM21 <SEQ ID 3229> 

MKKYLFRAALYGIAAAI LAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAP I LG YAE D P VEL FFMH I QG S GRLKT P S GKY IR IG Y ADKNEH P YV S I GR YMADKG YL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM22 <SEQ ID 3230> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLS LPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGY YE PVLKGDDRRTAQARFPIYG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMACKGYL 
KLGQT SMQG IKAYMQQNPQRLAE VLGQN P S YI FFRE LTGS SN DG PVGALGT PLMGE YAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM23ASBC <SEQ ID 3231> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YT W P H L S L PH WAAQ D FAKS LQS FR LGC ANLKN RQGWQ DVCAQA FQT P VHS FQAKQF FER 

Y FT PWQVAGNG S LAGT VT G YY E P VLKG D DRRT AQAR FPIYGIPDDFISVPL P AG LRS GKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DG KA P I LG Y AE D PVE L FFMH I QG S G R LKT PS GKY I R I G YADKNEH P Y VS I GK YMADKG Y L 
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KLGQT SMQGI KS YMRQN PQRLAEVLGQN PS Y I FFRELAGS SNDG PVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTSKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPE YRP * 

ZM24 <SEQ ID 3232> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAG^RSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQT SMQG I KSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM25 <SEQ ID 3233> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQTPVKSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLI^4AQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM26 <SEQ ID 3234> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPCTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRH Y I TLG AP L EVAT AH P VT RKALNRL I MAQ DTG S AI KG AVRV D Y FWG YG DEAG E LAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM27BC <SEQ ID 3235> 

MKKYL FRAALYG I S AAI LAACQSKS IQT FPQ PDT S VI NG PDR PAG I PD PAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LGYAE D PVE L FFMH I QG S GRLKT PSGKY I RI G YADKNEH PYV S IGRYMADKGYL 
KLGQT SMQG I K S YMRQN PQRLAEVLGQN ?SY I FFRELTGS SN DG P VGALGT PLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPE YRP * 

ZM28 <SEQ ID 3236> 

MKKYLFRAALCGIAAAI LAACQSKS IQTFPQPDU'SVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI 1X3 YAE DPVELFFMHIQGSGRLKTPSGKY IRI GYADKNEHPYVS IGRYMADKGYL 
KLGQT SMQG I KA YMRQN PQRLAEVLGQN PS Y I FFRELAGS SNDG PVGALGT PLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM29ASBC <SEQ ID 3237 > 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQTPVH5FQAKQFFEP. 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LGYAE DPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGRYMADKGYL 
KLGQTSMQG IKS YMRQN PQRLAEVLGQN PS Y I FFRELTGSGNDG PVGALGTPLMGEYAGA 
V DRH Y I T LGA P LFVATT H P I TRKALN RL I MAQDTG S A I KG AVRV D Y FWG YG DE AGE LAG K 
QKTTGYVWQLLPNGMKPEYRP* 

ZM31ASBC <SEQ ID 3238> 
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MKKH L FRAAL YG I AAAI LAACQ S K S I QT FPQ P DT S 1 1 KG P DR PAG I P D PAG TT VGGGGA V 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
Y FT P WQ V AGNG S LAGTVTGY Y E P V LKG D DRRT AQ AR F P I YG IPDDFISVPL PAG LR S GKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKA PI LG YAE DPVE LF FMH I QG S GRLK? PS GK Y I R I G Y A DKNEH P YV S I GR YMADKGYL 
klXSQTSMQGIKAYMRC^PQRU^VLGQNPSYVFETlEUVGSGNDGPVGAI^TPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLZMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM32ASBC <SEQ ID 3239> 

MKKH LLRSAL YG I AAAI LAACQS RS I QT FPQ PDT S V ING P DRPAGI P D P AGTTVAGGGAV 
YTVVPHLSMPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LG YAEDPVELFFMH IQGSGRLKT PSGKY I RIG YADKNEH P YVS I GRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGGDGPVGALGTPLMGGYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM33ASBC <SEQ ID 324 0> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTVVPHLSMPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT P I HSFQAKRFFER 
YFTPWQVAGNGS LAGTVTGY YE PVLKGDGRRTERARFPIYG I PDDFISVPLPAGLRGGKN 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIG YADKNEH P YVS IGR YMADKGYL 
KLGQTSMQGIKSYMRQNPHKLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 
I DRH Y I TLG A PL FV AT AH P V TRKALN RL I MAQ DT GS AI KG AV R V D Y FWG YG DEAGE LAG K 
QKTTGYVWQLLPNGMKPEYRP* 

ZM96 <SEQ ID 324 1> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLS LPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGS LAGTVTGY YE PVLKGDDRRTAQARFP I YG IPDDFISVPL PAG LRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHY ITLGAPL FVATAH PVTRKALNRL IMAQDTGS AI KG AVRVD YFWG YG DEAGE LAGK 
QKTTGYVWQLLPNGMKPEYRP* 



Figure 23 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 919, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 16 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 7: Oligonucleotides used for PCR to amplify complete or partial ORFs 
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ORF primer Sequence 


r> '• — : 1 

Restriction 
sites 


AA1 

UU1 


r orwara CGGGGA I CGGA 1 A 1 G- 1 GGATGGTGCTGGTCAT 


Ram HI 






in a ei 




P r*wr<jf* PPPOPTPH A P-TPPPPTPTTPTrT^r 1 A P 
ivo verse Vv\^v^vjr\^ i v>vj/avj- i vjv^v^Vj lul LLLAL 


Anol 


AAO 


b orwara GGGGGA I GLGA 1 A 1 G-GTCGT ATTCGTGGC 


JJdllLLll- 






rsGci 




ive verse ttv^kjt i luau-aaaa i v^A I OAAGAGGGGC 


AilOl 




r orwara GGGGGA 1 GGGA 1 ATG-G AC AATATTGACATGT 


oajTLrll- 






JNdel 




jvcverse tt toe i cljau-l a i lala 1 CGGGGGG 


Xhol 


006 


Forward GGGGGATCCCATATG-CTGCTGGTGCTGG 


Bamrll- 






Ndel 




o QTrnrro C y C*f > f~±c %r TC s cz&.c y A p'TT , pppp/'" , T , T , T , /"'' AT^T* 
Keverse i^la^vji^ l uvjALj-alj j 1 CHjGC 111 (jA 1 G 1 


Xhol 


U07 


r orwara CGGGGATGGG ATATG-GCCGAC AAC AGCA FC AT 


tsamrii- 






JNdel 




O PVPrCf3 PPPnPTPr^ AH A A PP^PPTTP A TP A T A T A A 

Jveverse t v^ut l luau-aavjuLvj 1 1 GA 1 GA 1 A 1 A AG 


Anol 


AAQ 


Forward LGGGGA 1 GGG A I ATG-AACAACAGACATTTTG 


DaTTlxlJ- 






VfJ„T 

XNaei 




T> W PKf PPPPrPTPPw A Pr P P TPTP PPPVT A A A A P A P 


AnOl 


AAA 


Forward LGGGGA 1 GGG A 1 A 1 G-GGCCGCGCTGCT 








iNaei 




PA WrC A PPPP»PTPPi A P* TPPPTTTTPPP A PrTTTT 

ive verse lv^l-uv/ j i out l 1 1 1 vjLL ALu 1111 


AH01 


A1 1 

Ul 1 


Forward LGGGGATCCCATATG-AAGACACACCGCAAG 


RamT4T 
OaJrilll- 






iNaei 




Reverse CCCCtCTCCt A P-PPPPPTP A PT A PPPT 


Xnol 


012 


Forward PGPPGATPPP AT A TP-PTPPPPPPTTP^P 


u ami 11- 






KMpT 

INU.C1 




Reverse CCCGCTCGAG-AGCGGGGAAGAGGCAC 


Xhol 


013 


Forward CGCGGATCCCATATG-CCTTTGACCATGCT 


BamHI- 

i- » <A111_L J.J. 






Ndel 




Reverse CCCGCTCGAG-CTGATTCGGCAAAAAAATCT 


Xhol 


018 


Forward CGCGGATCCCATATG-CAGCAGAGGCAGTT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GACGAGGCGAACGCC 


Xhol 


019 


Forward AAAGAATTC-CTGCCAGCCGGCAAGACCCCGGC 


EcoRI 




Reverse AAACTGCAG-TCAGCGGGCGGGGACAATGCCCAT 


PstI 


023 


Forward AAAGAATTC-AAAGAATATTCGGCATGGCAGGC 


EcoRI 




Reverse AAACTGCAG-TTACCCCCAAATCACTTTAACTGA 


PstI 


025 


Forward AAAGAATTC-TGCGCCACCCAACAGCCTGCTCC 


EcoRI 




Reverse AAACTGC AG-TCAG AACGCGATATAG CTGTTCGG 


PstI 


031 


Forward CGCGGATCCCATATG-GTCTCCCTTCGCTT 


BamHI- 






Ndel ! 




Reverse CCCGCTCGAG-ATGTAAGACGGGGACAAC 


Xhol 


032 


Forward CGCGGATCCCATATG-CGGCGAAACGTGC 


BamHI- 
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Ndel 

Reverse CCCGCTCGAG-CTGGTTTTTTGATATTTGTG Xhol 

033 Forward CGC GGATCCCATATG -GCGGCGGCAGACA BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATTTGCCGCATCCCGAT Xhol 

034 Forward CGCGGATCCCAT^TG-GCCGAAAACAGCTACGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGACGATTTGGTTCAATT xhol 
036 Forward CGCGGATCCCATATG-CTGAAGCCGTGCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -CCGGACTGCGTATCGG Xhol 

038 Forward CG CGGATCCCATATG -ACCGATTTCCGCCA BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTCTACGCCGTACTGCC Xhol 

039 Forward CGCGGATCCCATATG-CCGTCCGAACCGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TAGGATGACGAGGTAGG Xhol 

041 Forward CGCGGATCCCATATG-TTCGTGCGCGAACCGC BamHI- 

Ndel 

Reverse CCC GCTCGAG -GCCCAAAAACTCTTTCAAA Xhol 

042 Forward CGCGGATCCCATATG-ACGATGATTTGCTTGC BamHl- 

Ndel 

Reverse CCCGCTCGAG-TTTGCAGCCTGCATTTGAC Xhol 

043 Forward AAAAAAG^TACC-ATGGTTGTTTCAAATCAAAATATC Kpnl 
Reverse AAACTGCAG-TTATTGCGCTTCACCTTCCGCCGC PstI 

043a Forward AAAAAAGGTACC^GCAAAAGTGC ATG GCGGCTTGGACGGTGC Kpn I 
Reverse AAAAA ACTGCAG - p st i 
TTAATCCTGCAACACGAATTCGCCCGTCCG 

044 Forward CG CGGATCCCATATG -CCGTCCGACTAGAG BaraHI- 

Ndel 

Reverse CCCGCTCGAG-ATGCGCTACGGTAGCCA Xhol 

046 Forward AAAGAATTC-ATGTCGGCAATGCTCCCGACAAG Eco RI 
Reverse AAACTGCAG-TCACTCGGCGACCCACACCGTGAA Pst 1 

047 Forward CGCGGATCCCATATG-GTCATCATACAGGCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TCCGAAAAAGCCCATTTTG Xhol 

048 Forward AAAGAATTC-ATGCTCAACAAAGGCGAAGAATTGCC EcoRI 
Reverse AAACTGCAG-TCAAGATTCGACGGGGATGATGCC PstI 

049 Forward AAAGAATTC-ATGCGGGCGCAGGCGTTTGATCAGCC Eco Rl 
Reverse AAACTGCAG-AAGGCGTATCTGAAAAAATGGCAG PstI 

050 Forward CGC GGATCCCATATG -GGCGCGGGCTGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AATCGGGCCATCTTCGA Xhol 
052 Forward AAAAAAGAATTC-ATGGCTTTGGTGGCGGAGGAAAC Eco RI 

Reverse AAAAAAGTQ^-TCAGGCGGCGTTTTTCACCTTCCT Sal I 

052a Forward AAAAAAGAATTC-GTGGGGGAGGAAACGG A A AT AT rear Eco RI 
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Reverse AAAAAACTGCAG-TTAGCTGTTTTTGGAAACGCCGTCCAACCC Pst I 



073 Forward CGC GGATCCCATATG -TGTATGCCATATAAGAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -CACCGGATTGTCCGAC Xhol 

075 Forward CGC GGATCCCATATG -CCGTCTTACTTCATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCACCAATGCCGATTATTT xhol 

077a Forward AAAAAAGAATTC-GGCGGCATTTTCATCGACACCTTCCT Eco RI 

Reverse AAAAAA CTGCAG -TC AGACGAAC ATCTGCAC AA A C.GC. A A T p s t] 

080 Forward AA AGAATTC -GCGTCCGGGCTGGTTTGGTTTTAC A ATTC. Eco RI 
Reverse AAACTCCAG-CTATTCTTCGGATTCTTTTTCGGG Pst I 

081 Forward AAAGAATTC-ATGAAACCACTGGACCTAAATTTCATCTG EcoRI 
Reverse AAACTGCAG-TCACTTATCCTCCAATGCCTC Pst I 

082 Forward AAAGAATTC-ATGTGGTTGTTGAAGTTGCCTGC Eco RI 
Reverse AAACTGCAG-TTACGCGGATTCGGCAGTTGG Pst I 

084 Forward AAAGAATTC-TATCACCCAGAATATGAATACGGCTACCG EcoRI 
Reverse AAACTGCAG-TTATACTTGGGCGCAACATGA Pst I 

085 Forward CGCGGATCCCATATG-GGTAAAGGGCAGGACT BamHI- 

Ndel 

Reverse CCCG CTCGAG -CAAAGCCTTAAACGCTTCG Xliol 

086 Forward AAAAA AGGTACC -TATTTGGCATCAAAAGAAGGCGG Kpn I 
Reverse AAA CTGCAG -TTACTCCACCCGATAACCGCG Pstl 

087 Forward AA AGAATTC -ATGGGCGGTAAAACCTTTATGC Eco RI 
Reverse AAACTGCAG-TTACGCCGCACACGCAATCGC Pstl 

087a Forward AAAAAAGAATTC-AAGCTATl'AGGCGTGCCGATTGTGATTCA Eco RT 

Reverse AAAAAACTGCAG-TTACGCCTGCAAGATGCCCAGCTTGCC Pst I 

088 Forward AAAAAAGAATTC-ATGTT1TTATGGCTCGCACATTTCAG Eco RI 
Reverse AAAAAACTCCAG-TCAGCGGATTTTGAGGGTACTCAAACC Pst I 

089 Forward CGC GGATCCCATATG -CCGCCCAAAATCAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TGCGCATACCAAAGCCA Xhol 

090 Forward CGC GGATCCCATATG -CGCATAGTCGAGCA BamHl- 

Ndel 

Reverse CCCG CTCGAG -AGCAAAACGGCGGTACG Xhol 

091 Forward AAAGAATTC-ATGGAAATAGCCGTACCGCCGAGTCC Eco RI 
Reverse AAACTGCAG-TCAGCGCAGGGGGTAGCCCAAGCC Pstl 

092 Forward AAAGAATTC-ATGTTTTTTATTTCAATCCG EcoRI 
Reverse AAA CTGCAG -TCAAATCTGTTTCGACAATGC Pstl 

093 Forward AAAGAATTC-ATGCAGAATTTTGGCAAAGTGGC EcoRI 
Reverse AAACTGCAG-CTATGGCTCGTCATACCGGGC Pstl 

094 Forward AAAGAATTC-ATGCCGTCACGGAAGCGCATCAACTC Eco RI 
Reverse AAACTGCAG-TTATCCCGGCCATACCGCCGAACA Pstl 

095 Forward AAAGAATTC-ATGTCCTTTCATTTGAACATGGACGG Eco RI 
Reverse AAACTGCAG-TCAACGCCGCAGGCACTAACGCCC Pst 1 

096 Forward AAAGAATTC-ATGGCTCGTCATACCGGGCAGGG Eco RI 
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Reverse AA ACTGCAG -TCAAAGGAAAAGGCCGTCTGA A A Anrr. j^j 

097 Forward AAAGAATTC-ATGGACACTTCAAAACAAACACTGTTG Eco RI 
Reverse AAACTGCAG-TCAGCCCAAATACCAGAATTTCAG Pst I 

098 Forward AAAGAATTC-GATGAACGCAGCCCAGCATGGATACG Eco Rl 
Reverse AAACTGCAG-TTACGACATTCTGATTTGGCA p s tl 

102 Forward AAAAAAGAATTC-GGCCTGATGATTTTGGAAGTCAACAC Eco RI 

Reverse AAAAAACTCCAG-TTATCCTTTAAATACGGGGACGAGTTC Pstl 

105 Forward CGCGGATCCCATATG-TCCGCAAACGAATACG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GTGTTCTGCCAGTTTCAG Xhol 

107 Forward AAAAAA GAATTC - Eco m 

CTGATGATTTTGGAAGTCAACACCCATTATCC 

Reverse AAAAAACTGCA^-TTATCCTTTAAATACGGGGACGAGTTC Pstl 

107b Forward AAAAAA GAATTC - EcoPvI 

GATACCCAAGCCCCCGCCGGCACAAACTACTG 
Reverse AAAAAA CTGCAG - p stI 

TTACGCGTCGCCTTTAAAGTATTTGAGCAGGCTGGAGAC 

108 Forward AAAGAATTC-ATGTTGCCGGGCTTCAACCG EcoRI 
Reverse AAACTGCAG-TTAGCGGTACAGGTGTTTGAAGCA Pst I 

108a Forward AAAAAAGAATTC-GGTAACACATTCGGCAGCTTAGACGGTGG EcoRI 
Reverse AAACTGCAG-TTAGCGGTACAGGTGTTTGAAGCA Pstl 

109 Forward AAAGAAJTC-ATGTATTATCGCCGGGTTATGGG EcoRI 
Reverse AAACTGCAG-CTAGCCCAAAGATTTGAAGTGTTC Pstl 

111 Forward CGCGGATCCCATATG-TGTTCGGAACAAACCGC BamHl- 

Ndel 

Reverse CCCG CTCGAG -GCGGAGCAGTTTTTCAAA xhol 
114 Forward CGCGGATCCCATATG-GCTTCCATCACTTCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CATCCGCGAAATCGTC Xhol 

117 Forward AAAAAAG^TACC-ATGGTCGAAGAACTGGAACTGCTG Kpnl 
Reverse AAACTGCAG-TTAAAGCCGGGTAACGCTCAATAC Pst I 

118 Forward AAAGjrcGACATGTGTGAGTTCAAGGATATTATAAG Sail 
Reverse AAAGCAl^C-CTATTTTTTGTTGTAATAATCAAATC Sph I 

121 Forward CGCGGAJCCCATATG-GAAACACAGCTTTACAT BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATAATAATATCCCGCGCCC Xhol 

122 Forward CG CGGATCCCATATG -GTCATGATTA A AA7C.C.GC. A BamHI- 

Ndel 

Reverse CCCGCl^GAG-AATCITGGTAGATTGGATTT Xhol 

125 Forward AAAGAATTC-ATGTCGGGCAATGCCTCCTCTCC Eco RI 
Reverse AAACTGCAG-TCACGCCGTTTCAAGACG Pst I 

125a Forward AAAAAAGAATTC-ACGGCAGGCAGCACCGCCGCACAGGT1TC Eco RI 
Reverse AAAAA ACTGCAG - PstI 

TTATTTTGCCACGTCGGTTTCTCCGGTGAACAACGC 

126 Forward CGCGGATCCCATATG-CCGTCTG A A A C.C.C. BamHI- 
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Ndel 

Reverse CCCGCTCGAG-ATATTCCGCCGAATGCC Xhol 

127 Forward AA AGAATTC - ATGG AAATATGGAAT ATGTTGG A C A C.TTG EcoRl 
Reverse AAACTGCAG-TTAAAGTGTTTCGGAGCCGGC PstI 

127a Forward AAAAAAGAATTC-AAGGAACTGATTATGTGTCTGTCGGG Eco RI 

Reverse AAACTGCAG-TTAAAGTGTTTCGGAGCCGGC PstI 

128 Forward CGCGGATCCCATATG-ACTGACAACGCACT BamHI- 

Ndel 

Reverse CCCG CTCGAG -GACCGCGTTGTCGAAA Xhol 

130 Forward CGC GGATCCCATATG -AAACAACTCCGCGA BamHI- 

Ndel 

Reverse CCCG CTCGAG -GAATTTTGCACCGGATTG Xhol 

132 Forward AAAGAATTC- ATGGAACCCTTCAAAACCTTAATTTG Eco RI 

Reverse AAAAAA CTGCAG -TCACCATGTCGGCATTTGAAA A AC PstI 

134 Forward CGC GGATCCCATATG -TCCCAAGAAATCCTC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CAGTTTGACCGAATGTTC Xhol 

135 Forward CGC GGATCCCATATG -AAATACAAAAGAATCGTATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAATTCGGTCAGAAGCAGG Xhol 

137 Forward AAAAAAGGTACC-ATGATTACCCATCCCCAATTCGATCC Kpn I 
Reverse AAAAA ACTGCAG -TCAGTGCTGTTTTTTCATGCCG A A Pst I 

137a Forward AAAAAAGAATTC-GGCCGCAAACACGGCATCGGCTTCCT EcoRI 

Reverse AAAAAACTGCAG-TTAAGCGGGATGACGCGGCAGCATACC PstI 

138 Forward AAAAAA GAATTC -AACTCAGGCGAAGGAGTGCTTGTGGC Eco RI 
Reverse AAAAAA TCTAGA- TCAGTTTAGGGATAGCAGGCGTAC Xbal 

141 Forward AAAGAATTC-ATGAGCTTCAAAACCGATGCCGAAATCGC Eco RI 
Reverse AAACTGCAG-TCAGAACAAGCCGTGAATCACGCC Pst I 

142 Forward CGC GGATCCCATATG -CGTGCCGATTTCATG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAACTGCTGCACATGGG Xhol 

143 Forward AAAAA AGAATTC - Eco RI 

ATGCTCAGTTTCGGCTTTCTCGGCGTTCAGAC 

Reverse AAAAAACTGCAG-TCAAACCCCGCCGTGTGTTTCTTTAAT PstI 

144 Forward AAAAAA GAATTC -GGTCTGATCGACGGGCGTGCGGTAAG EcoPJ 
Reverse AAAAA ATCTAGA- TCGGCATCGGCCGGCATATGTCGG Xba I 

146 Forward AAAAA AGAATTC- Eco RI 

CGCCAAGTCGTCATTGACCACGACAAAGTC 

Reverse AAAAAACTGCAG-TTAGGCATCGGCAAATAGGAAACTGGG Pst I 

147 Forward AAAAAA GAATTC -ACTGAGCAATCGGTGGATTTGGA A AC EcoRI 
Reverse AAAAAATCTAGA-TTAGGTAAAGCTGCGGCCCATTTGCGG Xbal 

148 Forward AAAAAA GAATTC - EcoRI 

ATGGCGTTAAAAACATCAAACTTGGAACACGC 

Reverse AAAAAA TCTAGA -TCAGCCCTTCATACAGCCTTCGTTTTG Xbal 

149 Forward CGCGGATCCCATATG-CTGCTTGACAACAAAGT BamHI- 
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Ndel 

Reverse CCC GCTCGAG -AAACTTCACGTTCACGCC Xhol 

150 Forward CGC GGATCCCATATG -CAGAACACAAATCCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATAAACATCACGCTGATAGC Xhol 

151 Forward AAAAAAGAATTC- EcoRI 

ATGAAACAAATCCGCAACATCGCCATCATCGC 
Reverse AAAAAACTGCAG-TCAATCCAGCTTTTTAAAGTGGCGGCG PstI 

152 Forward AAAAA AGAATTC - EcoRI 

ATGAAAAACAAAACCAAAGTCTGGGACCTCCC 
Reverse AAAAAACTGCAG-TCAGGACAGGAGCAGGATGGCGGC PstI 

153 Forward AAAAA AGAATTC -ATGGCGTTTGCTTACGGTATGAC EcoRI 
Reverse AAAAAACTGCAG-TCAGTCATGTTTTTCCGTTTCATT PstI 

153a Forward AAAAAAGAATTC-CGGACTTCGGTATCGGTTCCCCAGCATTG EcoRI 
Reverse AAAAA ACTGCAG - p st j 
TTACGCCGACGAAATACTCAGACTTTTCGG 

154 Forward CGC GGATCCCATATG -ACTGACAACAGCCC BamHl- 

Ndel 

Reverse CCCG CTCGAG -TCGGCTTCCTTTCGGG Xhol 

155 Forward AAAAAAGAATTC- ATGAAAATCGGTATCCCACGCGAGTC Eco RI 
Reverse AAAAA ACTGCAG -TTACCCTTTCTTAAAC ATATTC AGC A T p s t I 

156 Forward AAAAAA GAATTC -GCACAGCAAAACGGTTTTGAAGC Eco RI 
Reverse AAAAAACTGCAG-TCAAGCAGCCGCGACAAACAGCCC Pst I 

157 Forward CGC GGATCCCATATG -AGGAACGAGGAAAAAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -AAAACACAATATCCCCGC Xhol 

158 Forward AAAAAAGAATTC-GCGGAGCAGTTGGCGATGGCAAATTCTGC EcoRI 
Reverse AAAAAA TCTAGA -TTATCCACAGAGATTGTTTCCCAGTTC Xba I 

160 Forward CG CGGATCCCATATG -GACATTCTGGACAAAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TTrTTGCCCGCCTTCTTT Xhol 

163 Forward AAAAAAG^JACC-ACCGTGCCGGATCAGGTGCAGATGTG Kpn I 
Reverse AAAAAATCTAGA-TTACTCTGCCAATTCCACCTGCTCGTG Xba I 

163a Forward AAAAAAGAATTC-CGGCTGGTGCAGATAATGAGCCAGAC EcoRI 
Reverse AAAAAATCIAGA-TTACTCTGCCAATTCCACCTGCTCGTG Xba I 

164 Forward CGC GGATCCCATATG -AACCGGACTTATGCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGTTTCCGTCAAACTGC Xhol 

165 Forward CGCGGATCCGCTAGC-GCTGAAGCGACAGACG BaraHI- 

Nhel 

Reverse CCCG CTCGAG -AATATCCAATACTTTCGCG Xhol 
206 Forward CG CGGATCCCATATG -AAACACCGCCAACCGA BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC Xhol 
209 Forward CGC GGATCCCATATG -CTGCGGCATTTAGGA BamHl- 
Ndel 
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"DpvprQf* pppPrPTPf T a fx-T a cccttci a a pu^:p a at 


Xhol 


211 


Forward AAAAAAGAATTC-ATnTTnPnnnTTPPTOPTOP 


Eco RI 




Keverse aaaaaal i utAU-t I A 1 CC 1 GLGG ATTGGCATTG A A A 


PstI 


212 


Forward CGCGGATCCCATATG-GACAATCTCGTATGG 


BamHI- 






Ndel 




Keverse va^CUC l COAU-ALjuuG 1 1 AGA 1 CLTTCC 


Xhol 


215 


Forward CGCGGATCCCATATG-GCATGGTTGGGTCGT 


BamHI- 






Ndel 




Reverse LLLlrL J LuAu-tA 1 A 1 L u ITGTATC ATAAATC 


Xhol 


216 


Forward CGCGGATCCCATATG-GCAATGGCAGAAAACG 


BamHI- 






Ndel 




Reverse CCCGC I CGAG- 1 ACAATCCGTGCCGCC 


Xhol 


217 


Forward CGCGGATCCCATATG-GCGGATGACGGTGTG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ACCCCGAATATCGAATCC 


Xhol 


218 


Forward CGCGGATCCCATATG-GTCGCGGTCGATC 


BajnHl- 






Ndel 




Reverse CGGGCTCGAG-TAACTCATAGAATCCTGC 


Xhol 


219 


Forward CGCGGATCCGCTAGC-ACGGCAAGGTTAAG 


BamHI- 






Nhel 




d pvprc p r^pr^r^r^Tr^n a p ttt a a a pp a tptpptp a a a a o 
Keverse ^c**A_n^ i 1 1 1 AAAtLA 1 CI GC 1 GAAAAC 


Xhol 


993 


Forward PfrfYrO ATPPPATATP PtAATTPAnnPAPPA apta 


Trj TTT 

BamHI- 






Ndel 




ppvorcp rrrnrTrn a n pppttpppppptptp 
rve verse v^v^v^vjv_, i i j la,L-IjvAj 1 G I G 


Xhol 


99S 


Fnrward PrtPfTfrATPPP ATATH fl A PP A PTTP A PP A A 


T** TTT 

BamHI- 






Ndel 




Rpvptcp PPPPPTPPiAP pttpapa a a nc^r^r^r* A o 


Xhol 


996 


Forward A A A OA ATTP-PTTPPO ATT ATPPTPPPiP a rrznn 


Eco RI 




Reverse AAACIGCAG-ICaAAAATCCCAAAACGGGGAT 


PstI 


228 


Forward CGCGGATCCCATATG-TCGCAAGAAGCCAAACAG 


BamHI- 






Ndel 




Davptcp rPPPPTPP A P TTTPPPPPP A TPTTTP A T 

tveverse v>v^i^ov^ i v^lx/\vj- ill vjUCvjLjCA Iti 1 1 CA 1 


Xhol 


990 


Ffir\x/<Mv1 PPPPP A TPPP ATA TP P A A P A PPTTTTPPPP 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ACACAATATAGCGGATGAAC 


yVllUl 


210 


Forward PPPPP A TPPP ATA TPJ-P A TPPPPPTPPPP A P 


T~i _ TTT 

BamHI- 






Ndel 




Reverse CCCGCTCGAG-AAGTTTGGCGGCTTCGG 


Xhol 


232 


Forward AAAAAAGAATTC-ATGTACGCTAAAAAAGGCGGTTTGGr, 


EcoRI 




Reverse AAAAAACTGCAG-TCAAGGTTTTTTCCTGATTGCCGCCGC 


PstI 


232a 


Forward AAAAAAGAATTC-GCCAAGGCTGCCGATACACAAATTGA 


EcoRI 




Reverse AAAAAACTGCAG-TTAAACATTGTCGTTGCCGCCCAGATG 


PstI 


233 


Forward CGCGGATCCCATATG-GCGGACAAACCCAAG 


BamHJ- 






Ndel 




Reverse CCCGCTCGAG-GACGGCATTGAGCAG 


Xhol 


234 


Forward CGCGGATCCCATATG-GCCGTTTCACTGACCG 


BamHI- 
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Ndel 




Reverse GCCCAAGCTT-ACGGTTGGATTGCCATG 


Hind in 


235 


Forward CGCGGATCCCATATG-GCCTGCCAAGTTCAAA 








I^UCl 




Reverse CCCGCTCGAG-TTTGGGCTGCTCTTC 


Xhol 


236 


Forward CGCGGATCCCATATG-GCGCGTTTCGCCTT 


JD (XI 111 LL 










Reverse CCCGCTCGAG-ATGGGTCGCGCGCCGT 


Xhol 


238 


Forward CGCGGATCCGCTAGC-AACGGTTTGGATGCCCG 


RamHT- 

Uallllll" 






iniigi 




Reverse CCCGCTCGAG-TTTGTCTAAGTTCCTGATATG 


Xhol 


239 


Forward CCGGAATTCTACATATG-CTCCACCATAAAGGTATTG 








i>uei 




Reverse CCCGCTCGAG-TGGTGAAGAGCGGTTTAG 


Xhol 


240 


Forward CGCGGATCCCATATG-GACGTTGGACGATTTC 


Dainrix- 






iNaei 




Reverse CCCGCTCGAG-AAACGCCATTACCCGATG 


Xhol 


241 


Forward CCGGAATTCTACATATG-CCAACACGTCCAACT 


ncorvi- 






iNtiei 




Reverse CCCGCTCGAG-GAATGCGCCTGTAATTAATC 


Xhol 


242 


Forward CGCGGATCCCATATG-ATCGGCAAACTTGTTG 


OdlllOl 










Reverse GCCCAAGCTT-ACCGATACGGTCGCAG 


Hindlll 


243 


Forward CGCGGATCCCATATG-ACGATTTTTTCGATGCTGC 


Od.HUi.l- 






INUCl 




Reverse CCCGCTCGAG-CGACTTGGTTACCGCG 


Xhol 


244 


Forward CGCGGATCCCATATG-CCGTCTGAAGCCC 


OdJILTll- 






1NQC1 




Reverse CCCGCTCGAG-TTTTTTCGGTAGGGGATTT 


Xhol 


246 


Forward CGCGGATCCCATATG-GACATCGGCAGTGC 


OdlllXXl- 






iNuei 




Reverse CCCGCTCGAG-CCCGCGCTGCTGGAG 


Xhol 


247 


Forward CGCGGATCCCATATG-GTCGGATCGAGTTAC 


Ddllini- 






MHpT 




Reverse CCCGCTCGAG-AAGTGTTCTGTTTGCGCA 


Xhol 


248 


Forward CGCGGATCCCATATG-CGCAAACAGAACACT 


oailiril- 






iNaei 




Reverse CCCGCTCGAG-CTCATCATTATTGCTAACA 


Xhol 


249 


Forward CGCGGATCCCATATG-AAGAATAATGATTGCTTC 


Odlllxll- 






iNaei 




Reverse CCCGCTCGAG-TTCCCGACCTCCGAC 


Xhol 


251 


Forward CGCGGATCCCATATG-CGTGCTGCGGTAGT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TACGAAAGCCGGTCGTG 


Xhol 


253 


Forward AAAAAAGAATTC-ATGATTGACAGGAACCGTATnrTnrn 


EcoRI 




Reverse AAAAAACTGCAG-TTATTGGTCTTTCAAArnrrrTTrrrn 


PstI 
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253a Forward AAAAAAGAATTC-A/l\ATCCTTTTGAAAACAAGCGAAAACGG Eco RI 



Reverse AAAAAACTGCAG-TTATTGGTCTTTCAAACGCCCTTCCTG PstI 

254 Forward AAAAAAGAATTC-ATGTATACAGGCGAACGCTTCAATAC Eco RI 
Reverse AAAAAATCTAGA-TCAGATTACGTAACCGTACACGCTGAC Xbal 

255 Forward CGC GGATCCCATATG -GCCGCGTTGCGTTAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCCGCAATACCGACCAG Xhol 

256 Forward CGC GGATCCGCTAGC -TTTTAACACCGCCGGAC BamHI- 

Nhel 

Reverse CCC GCTCGAG -ACGCCTGTTTGTGCGG Xhol 

257 Forward CGC GGATCCCATATG -GCGGTTTCTTTCCTG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GCGCGTGAATATCGCG Xhol 

258 Forward AAAAAAGAAjnG-GATTATTTCTGGTGGATTGTTGCGTTCAG EcoRI 
Reverse AAAAA ACTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG PstI 

258a Forward AAAAAAGAATTC-GCGAAGGCGGTGGCGCAAGGCGA Eco RI 

Reverse AAAAA ACTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG PstI 

259 Forward CGC GGATCCCATATG -GAAGAGCTGCCTCCG BamHl- 

Ndel 

Reverse CCCGCTCGAG-GGCTTTTCCGGCGTTT Xhol 

260 Forward CG CGGATCCCATATG -GGTGCGGGTATGGT BamHl- 

Ndel 

Reverse CCC GCTCGAG -AACAGGGCGACACCCT Xhol 

261 Forward AAAAAAGAATTC-CAAGATACAGCTCGGGCATTCGC Eco RI 
Reverse AAAAAACTGCAG-TCAAACCAACAAGCCTTGGTCACT PstI 

263 Forward CGC GGATCCCATATG -GCACGTTTAACCGTA BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCGTAAGCCTGCAATT Xhol 

264 Forward AAAAA AGGTACC -GCCGACGCAGTGGTCAAGGC AG A A Kpn I 
Reverse AA ACTGCAG -TCAGCCGGCGGTCAATACCGCCCG Pst I 

265 Forward AAAAA AGAATTC -GCGGAGGTCAAGAGAAGGTGTTTG Eco RI 
Reverse AAAAAACTGCAG-TTACGAATACGTCGTCAAAATGGG Pst I 

266 Forward AA AGAATTC -CTCATCTTTGCCAACGCCCCCTTC Eco RI 
Reverse AA ACTGCAG -CTATTCCCTGTTGCGCGTGTGCCA Pst I 

267 Forward AAAGAATTC-TTCTTCCGATTCGATGTTAATCG EcoRI 
Reverse AA ACTGCAG- TTAGTAAAAACCTTTCTGCTTGGC PstI 

269 Forward AA AGAATTC -TGCAAACCTTGCGCCACGTGCCC Eco RI 
Reverse AAACTGCAG-TTACGAAGACCGCAACGAAAGGCAGAG PstI 

269a Forward AAAAAAGAAITC-GACTTTATCCAAAACACGGCTTCGCC EcoRI 
Reverse AAACTGCAG-TTACGAAGACCGCAACGAAAGGCAGAG PstI 

270 Forward AAAGAATTC-GCCGTC AA GCTCGTTTTGTTGCAATG Eco RI 
Reverse AAACTGCAG-TTATTCGGCGGTAAATGCCGTCTG PstI 

271 Forward CGC GGATCCCATATG -CCTGTGTGCAGCTCGAC BamHI- 

Ndel - 

Reverse CCCGCTCGAG-TCCCAGCCCCGTGGAG Xhol 
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272 Forward AAAGAATTC-ATGACCGCAAAGGAAGAACTGTTCGC EcoRI 
Reverse AAACTGCAG-TCAGAGCAGTTCCAAATCGGGGCT PstI 

273 Forward AAAGAATTC-ATGAGTCTTCAGGCGGTATTTATATACCC EcoRI 
Reverse AA ACTGCAG -TTACGCGTAAGAAAAAACTGC p s tl 

274 Forward CGCGGATCCCATATG-ACAGATTTGGTTACGGAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TTTGCTTTCAGTATT ATTG A A Xhol 

276 Forward AAAAA AGAATTC - ' EcoRI 

ATGATTTTGCCGTCGTCCATCACGATGATGCG 
Reverse AAAAAACTGCAG-CTACACCACCATCGGCGAATTTATGGC Pstl 

277 Forward AAAAAAGAATTC-ATGCCGCGCTTTGAGGACAAGCTCGTAGG Eco RI 
Reverse AAAAAACTGCAG-TCATAAGCCATGCTTACCTTCCAACAA Pstl 

277a Forward AAAAAAGAATTC-GGGGCGGCGGCTGGGTTGGACGTAGG Eco RI 
Reverse AAAAAACTGCAG-TCATAAGCCATGCTTACCTTCCAACAA Pst I 

278 Forward AAAAAAfJGTACC-GTCAAAGTTGTATTAATCGGGCCTTTGCC Kpn I 
Reverse AAAAAACTGCAG-TCATTCAACCATATCAAATCTGCC Pstl 

278a Forward AAAAAAGAATTC-AAAACTCTCCTAATTCGTCATAGTCG Eco RI 
Reverse AAAAAACTGCAG-TCATTCAACCATATCAAATCTGCC Pstl 

279 Forward CGC GGATCCCATATG -TTGCCTGCAATCACGATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTAGAAGCGGGCGGCAA Xhol 

280 Forward AAAAA AGGTACC -GCCCCCCTGCCGGTTGTAACCAG Kpn I 
Reverse AAAAA ACTGC AG -TTATTGCTTCATCGCGTTGGTC A A GGC. Pstl 

281 Forward AAAAAAGAATTC-GCACCCGTCGGCGTATTCCTCGTCATGCG EcoRI 
Reverse AAAAAA TCTAGA -GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

281a Forward AAAAAAGAATTC-TCCTACCACATCGAAATTCCTTCCGG Eco RI 
Reverse AAAAAATCTAGA-GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

282 Forward AAAAAAGAATTC-CTTTACCTTGACCTGACCAACGGGCACAG Eco RI 
Reverse AAAAAACTGCAG-TCAACCTGCCAGTTGCGGGAATATCGT Pst I 

283 Forward CGC GGATCCCATATG -GCCGTCTTTACTTGGAAG BamHl- 

Ndel 

Reverse CCCG CTCGAG -ACGGCAGTATTTGTTTACG Xhol 

284 Forward CGC GGATCCCATATG -TTTGCCTGCAAAAGAATCG Bamffl- 

Ndcl 

Reverse CCC GCTCGAG -CCGACTTTGCAAAAACTG Xhol 

286 Forward CGC GGATCCCATATG -GCCGACCTTTCCGAAAA BamHI- ' 

Ndel 

Reverse CCCG CTCGAG -GAAGCGCGTTCCCAAG Xhol 

287 Forward CCG GAATTC T AGCTAGC -CTTTCAGCCTGCGGG EcoRI- 

Nhel 

Reverse CCCG CTCGAG -ATCCTGCTCTTTTTTGCC Xhol 

288 Forward CGCGGATCCCATATG-CACACCGGACAGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGTATCAAAGACTTGCGT Xhol 
290 Forward CGCGGATCCCATATG-GCGGTTTGGGGCGGA BamHI- 
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Ndel 

Reverse CCCG CTCGAG -TCGGCGCGGCGGGC Xhol 
292 Forward CGC GGATCCCATATG -TGCGGGCAAACGCCC Bamffl- 

Ndel 

Reverse CCCG CTCGAG -TTGATTTTTGCGGATGATTT xhol 

294 Forward AAAAAAGAATTC-GTCTGGTCGATTCGGGTTGTCAGAAC Eco RI 
Reverse AAAAAACTGCAG-TTACCAGCTGATATAAAACATCGCTTT PstI 

295 Forward CG CGGATCCCATATG- AACCGGCCGGCCTCC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CGATATTTGATTCCGTTGC Xhol 

297 Forward AAAAAAGAATTC-GCATACATTGCTTCGACAGAGAG Eco RI 
Reverse AAAAA ACTGCAG -TCAATCCGATTGCGACACGGT p s tl 

298 Forward AAAAAAGAATTC-CTGATTGCCGTGTGGTTCAGCCAAAACCC Eco RI 
Reverse AAAAAACTCCAG-TCATGGCTGTGTACTTGATGGTTGCGT Pstl 

299 Forward CGCGGATOCGCTAGC-CTACCTGTCGCCTCCG BamHI- 

Nhel 

Reverse CCCG CTCGAG -TTGCCTGATTGCAGCGG Xhol 
302 Forward AAAAAAGAATTC-ATGAGTCAAACCGATACGCAACG Eco RI 

Reverse AAAAAACTGCAG-TTAAGGTGCGGGATAGAATGTGGGCGC Pstl 

305 Forward AAAAAAG^TACC-GAATTTTTACCGATTTCCAGCACCGGA Kpn I 
Reverse AAAAAACTCCAG-TCATTCCCAACTTATCCAGCCTGACAG Pstl 

305a Forward AAAAAAGGTACC-TCCCGTTCGGGCAGTACGATTATGGG Kpn I 
Reverse AAAAAACTGCAG-TTACAAACCGACATCATGCAGGGTGAA Pstl 

306 Forward CGC GGATCCCATATG -TTTATGAACAAATTTTCCC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CCGCATCGGCAGAC Xhol 
308 Forward CGC GGATCCCATATG -TTAAATCGGGTATTTTATr Bamlll- 

Ndel 

Reverse CCC GCTCGAG -ATCCGCCATTCCCTGC Xhol 

311 Forward AAAAAAGCTACC-ATGTTCAGTTTTGGCTGGGTGTTT Kpn I 
Reverse AAACTCCAG-ATGTTCATATTCCCTGCCTTCGGC Pstl 

312 Forward AAAAAAGGTACC-ATGAGTATCCCATCCGGCGAAATT Kpn I 
Reverse AAACTGCAG-TCAGTTTTTCATCGATTGAACCGG Pstl 

313 Forward AAAAAAGAATTC-ATGGACGACCCGCGCACCTACGGATC EcoRI 
Reverse AAAAAACTGCAG-TCAGCGGCTGCCGCCGATTTTGCT Pstl 

401 Forward CG CGGATCCCATATG -AAGGCGGCAACACAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCTTACGTTTTTCAAAGCC Xhol 

402 Forward AAAAAAGAATTC-GTGCCTCAGGCATTTTCATTTACCCTTGC Eco RI 
Reverse AAAAAATCTAGA-TTAAATCCCTCTGCCGTATTTGTATTC Xba l 

402a Forward AAAAAAGAATTC-AGGCTGATTGAAAACAAACACGG Eco RT 

Reverse AAAAAATCTAGA-TTAAATCCCTCTGCCGTATTTGTATTC Xbal 
406 Forward CGCGGATCCCATATC-TGCGGGACACTGACAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGTTGTCCTTGTCTATG Xhol 
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Forward CGCGGATCCCATATG-GCAGGCGGAGATGGC 


BairiHI- 








rvdei 




Reverse 


tttot 1 LUAu-u(j 1 G 1 OA rGTTCACCC 


Xhol 




rorwarti 


LULuuA l \^v^v>/\ I A 1 U-vj 1 AGACGCGC 1 1 AAGCA 


BaiTiHI- 








in del 




Reverse 


CCCGC J CuAU-AGCTGCATGGCGGCG 


Xbol 




Forward 


LutuoA 1 LLLA 1A1G-1G11 CGGGGAAAGGCG 


Bamlil- 








Mdel 




Reverse 


ppppp'T , /''"' , /"" , a p r^r^r^ r^r* /™i a ■ i "I'nnTTm/i a 

CCCGC J CCAG-CCGCGCATTCCTCGCA 


Xhol 




rorwara 


PPPPP, A TPPP ATA TV" A C^i^r^ a t a tt/~* a a /^"-n/ - ^ a *-i/-t 

L-OCULrA I CL/^A 1 A 1 G- AGCGATATTG AAG T G ACG 


BamHI- 








IN del 




Reverse 


GCCCAAGCTT-TGATTCAAGTCCTTGCCG 


Hindlll 


jUj 


Forward 


PPPP^P A TPPP ATA TP 1 * H* 1 1" l '/"^/"^T"*! v r a o a a ttti k 

l,ul,<jua j CCCA 1A1G-I 1 TCG 1 TT AC AATTC AGG 


BamHI- 








IN del 




Reverse 


CCCGCTCGAG-CGGCGTTTTATAGCGG 


Xhol 


CIA 

MU 


Forward 


tuLOuA 1 CCC A 1 A rG-CCTTCGCGGACAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGCACTGGCAGCG 


Xhol 


jIZ 


Forward 


CGCGGA 1 CCCATATG-GGACATGAAGTAACGGT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGGAATAGCCTTTGACG 


Xhol 


< 1 ^ 


Forward 


CGCGGA 1 CCCATATG-GAGGAAATAGCCTTCGA 


BamHI- 








XT J T 

Ndel 




Reverse 


CCCGCTCGAG- aa ATGrrnr AAA Gr A TP 


Anol 


516 


rorwara 


PPPHPATPPP ATA TP" 1 TPT A /^ , /^ ,r T r T , 0 A r T , r"'"T ,r n/"^ tv-' 

UUL/VjUA 1 LttA 1 A 1 G- 1 G 1 ACG 1 lGATGTTGTGG 


D« m Ul 

oainiii- 








iNdei 




Reverse 


CCCGCTCGAG-TTTGCGGr. CCrGC A TP 


AilOl 


517 


r orwdi u 


PPPPiP A TPPP AT A TP PpTA A APPTPTPPA a a t» a 


DnmTJT 

DclUlxll- 








iNuei 




Reverse 


CCCGCTCGAG-GTGPGPPPAGPPGT 


Anol 


518 


rorwara 


A A APwA ATTP PtPTTTTTT A PTPPTPPP A r*r*r*r* a a r*r* 

aaauaa 1 1 t-UL llllll AC 1 GC 1 CCGACCGGaAAGG 


13CO ivl 




Reverse 


AAAC 1 GCAG-TC AAATTTCAGACTCTGCCAC 


Tk_A T 

Pst I 


51 0 


r orwara 


PPPPP A TPPP A T A TP TTP A A A TPPTTTnTnoTn a 

tutUUA 1 v>CC A 1 A 1 U- 1 1 C AAATCCTTTGTCG 1 C A 


BamHI- 








iNdel 




Reverse 


CCCGCTCGAG-TTTGGCGGTTTTGCTGC 


Xhol 


jZU 


rorwara 


PPPPP A TPPP ATA T*P /^/^T/^/^/~l/^« r P'T , /^«'T^T' , x>/^ a 

LULuuA l LtLA 1 A 1 u-CC 1 GCGCTTCTTTC A 


BamHI- 








VTJ.I 

iNael 




Reverse 


CCCGCTCGAG-ATATTTACATTTCAGTCGGC 


Xhol 


521 


Forward 


CGCGGATCCCATATG-GCCAAAATCTATACCTGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CATACGCCCCAGTTCC 


Xhol 


522 


Forward 


CGCGGATCCCATATG-ACTGAGCCGAAACAC 


BamHl- 








Ndel 




Reverse 


GCCC AAGCTT-TTCTG ATTTC A AATCGGC A 


Hindlll 


523 


Forward 


CGCGGATCCCATATG-GCTCTGCTTTCCGCG 


BamHl- 








Ndel 
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Reverse CCC GCTCGAG -AGGGTGTGTGATAATAAGAAO Xhol — 

525 Forward CGC GGATCCCATATG- GCCGAAATGGTTC A A ATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCCGTGCATATCATAAA Xhol 
527 Forward AAAGAATTC-TTCCCTCAATGTTGCCGTTTTCG Eco RI 

Reverse AAA CTGCAG -TTATGCTAAACTCGAAAC A A A TTC p st l 

529 Forward CGC GGATCCGCTAGC- TGCTCCGGCAGCA A A AC BamHI- 

Nhel 

Reverse GCCC AAGCTT -ACGCAGTTCGGAATGGAG Hindlll 

530 Forward CGC GGATCCCATATG -AGTGCGAGCGCGG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACGACCGACTGATTCCG Xhol 

531 Forward AAAAAAGAATTC-TATGCCGCCGCCTACCAAATCTACGG EcoRI 
Reverse AAAAAACTGCAG-TTAAAACAGCGCCGTGCCGACGACAAG Pstl 

532 Forward AAAAAAGAAJTC-ATGAGCGGTCAGTTGGGCAAAGGTGC Eco RI 
Reverse AAAAAACTGCAG-TCAGTGTTCCAAGTGGTCGGTATCAAA Pstl 

532a Forward AAAAAAGAATrC-TTGGGTGTCGCGTTTGAGCCGGAAGT Eco RI 
Reverse AAAAAACTGCAG-TCAGTGTTCCAAGTGGTCGGTATCAAA Pstl 

535 Forward AAA GAATTC -ATGCCCTTTCCCGTTTTCAGAC EcoRI 
Reverse AAACTGCAG-TCAGACGACCCCGCCTTCCCC Pstl 

537 Forward CGC GGATCCCATATG -CATACCCAAAACCAATm BamHl- 

Ndel 

Reverse CCCGCTCGAG-ATCCTGCAAATAAAGGGTT Xhol 

538 Forward CGC GGATCCCATATG -GTCGAGCTGGTCA A AGP BamHI- 

Ndel 

Reverse CCC GCTCGAG -TGGCATTTCGGTTTCGTC Xhol 

539 Forward CGC GGATCCGCTAGC -GAGGATTTGC AGG A A A BamHI- 

Nhel 

Reverse CCCGCTCGAG-TACCAATGTCGGCAAATC Xhol 

542 Forward AAAGAATTC-ATGCCGTCTGAAACCGTGTC Eco RI 
Reverse AAA CTGCAG -TTACCGCGAACCGGTCAGGAT p st i 

543 Forward AAAAAAGAATTC-GCCTTCGATGGCGACGTTGTAGGTAC EcoRI 
Reverse AAAAA ATCTAGA- Xbal 

TTAATGAAGAAGAACATATTGGAATTTTGG 
543a Forward AAAAAAGAATTC-GGCAAAACTCGTCATGAATTTGC Eco RI 

Reverse AAAAA ATCTAGA - Xbal 
TTAATGAAGAAGAACATATTGGAATTTTGG 

544 Forward AAAGAAJTC-GCGCCCGCCTTCTCCCTGCCCGACCTGCACGG EcoRI 
Reverse AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT Pstl 

544a Forward AAAAA AGAATTC - EcoRI 

gcaaatgactataaaaacaaaaacttccaagtacttgc 

Reverse AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT Pstl 

547 Forward AAAGAATTCATGTTCGTAGATAACGGATTTAATAAAAC EcoRI 
Reverse AAACTCCAG-TTAACAACAAAAAACAAACCGCTT Pstl 

548 Forward AAAGAATTC-GCCTGCAAACCTCAAGACAACAGTGCGGC EcoRI 
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Reverse AA ACTGCAG -TCAGAGCAGGGTCCTTACATCGGC PsTT 

550 Forward AAAAAA GTCGAC - Sal t 

ATGATAACGGACAGGTTTCATCTCTTTCATTTTCC 

Reverse AAACTGCAG-TTACGCAAACGCTGCAAAATCCCC Pst I 

550a Forward AAAAAAGAATTC-GTAAATCACGCCTTTGGAGTCGCAAACGG Eco Rl 

Reverse AAACTCCAG-TTACGCAAACGCTGCAAAATCCCC Pst I 

552 Forward AAAAAAGAATTC-TTGGCGCGTTGGCTGGATAC Eco RI 

Reverse AA ACTGCAG -TTATTTCTGATGCCTTTTCCCAAC p st I 

554 Forward CG CGGATCCCATATG -TCGCCCGCGCCCAAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CTGCCCTGTCAGACAC Xhol 

556 Forward AAAGAATTC-GCGGGCGGTTTTGTTTGGACATCCCG Eco RI 
Reverse AAACTGCAG-TTAACGGTGCGGACGTTTCTGACC Pst I 

557 Forward CGC GGATCCCATATG -TGCGGTTTCCACCTGA A BamHI- 

Ndel 

Reverse CCC GCTCGAG -TTCCGCCTTCAGAAAGG Xhol 

558 Forward AAAGAATTC-GAGCTTTATATGTTTCAACAGGGGACGGC Eco RI 
Reverse AAACTGCAG-CTAAACAATGCCGTCTGAAAGTGGAGA Pst I 

558a Forward AAAAAAGAATTC-ATTAGATTCTATCGCCATAAACAGACGGG Eco RI 
Reverse AAAAAACTGCAG-CTAAACAATGCCGTCTGAAAGTGGAGA Pst I 

560 Forward AAAAAA GAATTC - Eco RI 

TCGCCTTTCCGGGACGGGGCGCACAAGATGGC 
Reverse AAAAAACTCCAG-TCATGCGGTTTCAGACGGCATTTTGGC Pst I 

561 Forward CC GGAATTC T ACATATG -ATACTGCCAGCCCGT EcoRI- 

Ndel 

Reverse CCCG CTCGAG -TTTCAAGCTTTCTTCAGATG Xhol 

562 Forward CG CGGATCCCATATG -GCAAGCCCGTCGAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGACCAACTCCAACTCGT Xhol 

565 Forward CGCGGATCCCATATG-AAGTCGAGCGCGAAATAC - BamHI- 

Ndel 

Reverse CCCGCTCGAG-GGCATTGATCGGCGGC Xhol 

566 Forward CGC GGATCCCATATG -GTCGGTGGCGAAGAGG BamHT- 

Ndel 

Reverse CCC GCTCGAG -CGCATGGGCGAAGTCA Xhol 

567 Forward CCG GAATTC T ACATATG -AGTGCGAACATCCTTG EcoRI- 

Ndel 

Reverse CCCG CTCGAG -TTTCCCCGACACCCTCG Xhol 

568 Forward CG CGGATCCCATATG -CTCAGGGTCAGACC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGGCGCGGCGTTCAG Xhol 

569 Forward AAAAAAGAATTC-CTGATTGCCTTGTGGGAATATGCCCG EcoRI 
Reverse AAAAAACTGCAG-TTATGCATAGACGCTGATAACGGCAAT Pst I 

570 Forward CGCGGATCCCATATG-GACACCTTCCAAAAAATCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGGGCGTTCATTTCTTT Xhol 
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571 Forward AAAAAA GAATTC - ' EcoRT 

ATGGGTATTGCCGGCGCCGTAAATGTTTTGAACCC 
Reverse AAAAAACTGCAG-TTATGGCCGACGCGCGGCTACCTGACG Pstl 

572 Forward CGC GGATCCCATATG -GCGCAAAAAGGC A A A ACC BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGCAGTGTGCCGATA Xhol 

573 Forward CGCGGATCCCATATG-CCCTGTTTGTGCCG BamHl- 

Ndel 

Reverse CCCGCTCGAG-GACGGTGTCATTTCGCC Xhol 

574 Forward CG CGGATCCCATATG -TGGTTTGCCGCCCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACTTCGATTTTATTCGGG Xhol 

575 Forward CGC GGATCCCATATG -GTTTCGGGCGAGG BamHI- 

Ndel 

Reverse CCCGCTCGAG-CATTCCGAATCTGAACAG Xhol 

576 Forward CGCGGATCCCATATG-GCCGCCCCCGCATCT BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATTTACTTTTTTGATGTCGAC Xhol 

577 Forward CGC GGATCCCATATG -GAAAGGAACGGTGTATTT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGCTGTTTGGTAGATTCG Xhol 

578 Forward CGC GGATCCCATATG -AGAAGGTTCGTACAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCAACGCCTCCACG Xhol 

579 Forward CGCGGATCCCATATG-AGATTGGGCGTTTCCAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGAATTGATGATGTGTATGT Xhol 

580 Forward CGCGGATCCCATATG-AGGCAGACTTCGCCGA BamHI- 

Ndel 

Reverse CCCGCTCGAG-CACTTCCCCCGAAGTG Xhol 

581 Forward CGCGGATCCCATATG-CACTTCGCCCAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGCCGTTTGGCTTTGG Xhol 

582 Forward AAAAAAGAATTC-TTTGGAGAGACCGCGCTGCAATGCGC EcoRI 
Reverse AAAAAATCTAGA-TCAGATGCCGTCCCAGTCGTTGAA Xbal 

583 Forward AAAAAAGAATTC-ACTGCCGGCAATCGACTGCATAATCG Eco RI 
Reverse AAAAAACTGCAG-TTAACGGAGGTCAATATGATGAAATTG Pst I 

584 Forward AAAAA AGAATTC - EcoRI 

GCGGCTGAAGCATTGAATTACAATATTGTC 
Reverse AAAAAACTGCAG-TCAGAACTGAACCGTCCCATTGACGCT Pst I 

585 Forward AAAAAACjGlACC-TCTTTCTGGCTGGTGCAGAACACCCTTGC EcoRI 
Reverse AAAAA ACTGCAG -TCAGTTCGCACTTTTTTCTGTTTTGG A p s tl 

586 Forward CGC GGATCCCATATG -GCAGCCCATCTGG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTCAGCGAATCAAGTTTC Xhol 

587 Forward CGCGGATCCCATATG-GACCTGCCCTTGACnA BamPn- 
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Ndel 

Reverse CCCG CTCGAG -AAATGTATGCTGTACGCC Xhol 

588 Forward AAAAAAGAATTC-GCCGTCCTGACTTCCTATCAAGAACCAGG Eco RI 
Reverse AAAAAA CTGCAG -TTATTTGTTTTTGGGCAGTTTCACTTr p s tl 

589 Forward AAAAAA GAATTC - Eco ri 

ATGCAACAAAAAATCCGTTTCCAAATCGAAGG 
Reverse AAAAAACTG^AG-CTAATCGATTTTTACCCGTTTCAGGCG Pstl 

590 Forward AAAAAAGAATTC-ATGAAAAAACCTTTGATTTCAGTTGCGGC Eco RI 
Reverse AAAAAACTCCAG-TTACTGCTGCGGCTCTGAAACCAT Pstl 

591 Forward AAAAAAGAATTC-CACTACATCGTTGCCAGATTGTGCGG EcoRI 
Reverse AAAAAACTGCAG-CTAACGGAGCAGCCGGGTAACGTCGTT Pstl 

592a Forward AAAAAAGAATTC-CGCGATTACACCGCCAAGCTGAAAATGGG Eco RI 

Reverse AAAAAACTGCAG-TTACCAAACGTCGGATTTGATACG Pstl 
593 Forward CGC GGATCCGCTAGC -CTTGAACTGAACGGACTC BamHl- 

Nhel 

Reverse CCCG CTCGAG -GCGGAAGCGGACGATT Xhol 
594a Forward AAAAAAGAATTC-GGTAAGTTCGCCGTTCAGGCCTTTCA Eco RI 

Reverse AAAAAACTGCAG-TTACGCCGCCGTTTCCTGACACTCGCG Pstl 

595 Forward AAAAAAGAATTC-TGCCAGCCGCCGGAGGCGGAGAAAGC Eco RI 
Reverse AAAAAACTGCAG-TTATTTCAAGCCGAGTATGCCGCG Pstl 

596 Forward CGC GGATCCCATATG -TCCCAACAATACGTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGCGTTACCGGTTTGT Xhol 

597 Forward CG CGGATCCCATATG -CTGCTTCATGTCAGC BamHI- 

Ndel 

Reverse GCCCAAGCTT-ACGTATCCAGCTCGAAG Hindlll 

601 Forward CGC GGATCCCATATG -ATATGTTCCCAACCGGCAAT BamHI- 

Ndel 

Reverse CCC GCTCGAG -AAAACAATCCTCAGGCAC Xhol 

602 Forward CG CGGATCCGCTAGC -TTGCTCCATCAATGC BamHI- 

Nhel 

Reverse CCC GCTCGAG -ATGCAGCTGCTAAAAGCG Xhol 

603 Forward AAAAAAGAATTC-CTGTCCTCGCGTAGGCGGGGACGGGG Eco RI 
Reverse AAAAAACTGCAG-CTACAAGATGCCGGCAAGTTCGGC Pstl 

604 Forward CGC GGATCCGCTAGC- CCCGAAGCGCACTT BamHI- 

Nhel 

Reverse CCC GCTCGAG -GACGGCATCTGCACGG Xhol 
606a Forward AAAAAAGAATTC-CGCGAATACCGCGCCGATGCGGGCGC Eco RI 
Reverse AAAAAACTGCAG-TTAAAGCGATTTGAGGCGGGCGATACG Pstl 

607 Forward AAAAAAGAATTC-ATGCTGCTCGACCTCAACCGCTTTTC Eco RI 
Reverse AAAAAACTGCAG-TCAGACGGCCTTATGCGATCTGAC Pstl 

608 Forward AAAAAAGAATTC-ATGTCCGCCCTCCTCCCCATCATCAACCG Eco RI 
Reverse AAAAAACTGCAG-TTAGTCTATCCAAATGTCGCGTTC Pstl 

609 Forward CGCGjGATCCCATATG-GTTGTGGATAGACTCG BamFII- 
Ndel 
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Reverse CC CGCTCGAG -CTGGATTATGATGTCTGTC Xhol — 

610 Forward CGCGGATCCCATATG-ATTGGAGGGCTTATGCA BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACGCTTCAACATCTTTGCC Xhol 

611 Forward CGCGGATCCCATATG-CCGTCTCAAAACGGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACGACTTTGAACGCGCAA xhol 

613 Forward CGC GGATCCCATATG -TCGCGTTCGAGCCG3 BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGCCTGTAAAATAAGCGGC Xhol 

614 Forward CGC GGATCCCATATG -TCCGTCGTGAGCGGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCATACTGCGGCGTTC Xhol 
616 Forward AAAAAAGAATTC-ATGTCAAACACAATCAAAATGGTTGTCGG Eco RI 

Reverse AAAAAATCTAGA-TTAGTCCGGGCGGCAGGCAGCTCG Xba I 

619a Forward AAAAAAGAATTC-GGGCTTCTCGCCGCCTCGCTTGC EcoRl 

Reverse AAAAAACTGCAG-TCATTTTTTGTGTTTTAAAACGAGATA PstI 
622 Forward CGC GGATCCCATATG -GCCGCCCTGCCTAAAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTGTCCAAATGATAAATCTG Xhol 

624 Forward CGC GGATCCCATATG -TCCCCGCGCTTTTACCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGATTCGGGCCTGCGC Xhol 

625 Forward CGC GGATCCCATATG -TTTGCAACCAGGAAAATG BamHI- 

Ndel 

Reverse CCC GCTCGAG -CGGCAAAATTACCGCCTT Xhol 
627a Forward AAAAAAGAATTC-AAAGCAGGCGAGGCAGGCGCGCTGGG Eco RI 

Reverse AAAAAACTGCAG- p s ti 
TTACGAATGAAACAGGGTACCCGTCATCAAGGC 
628 Forward AAAAA AGGTACC -GCCTTACAAACATGGATTTTGCGTTC Kpn I 

Reverse AAAAAACTGCAG-CTACGCACCTGAAGCGCTGGCAAA PstI 
629a Forward AAAAAAGAATTC-GCCACCTTTATCGCGTATGAAAACGA EcoRI 

Reverse AAAAAACTGCAG-TTACAACACCGCCGTCCGGTTCAAACC PstI 
630a Forward AAAAAAGAATTC-GCGGCTTTGGGTATTTCTTTCGG Eco RI 

Reverse AAAAAACTGCAG-TTAGGAGACTTCGCCAATGGAGCCGGG PstI 
635 Forward AAAAA AGAATTC - Eco RI 

ATGACCCAGCGACGGGTCGGCAAGCAAAACCG 

Reverse AAAAAACTGCAG-TTAATCCACTATAATCCTGTTGCT PstI 

638 Forward AAAAAAGAATTC- ATGATTGGCGAAAAGTTTATCGTAGTTGG EcoRI 
Reverse AAAAAACTGCAG-TCACGAACCGATTATGCTGATCGG Pst I 

639 Forward CGCGGATCCCATATG-ATGCTTTATTTTGTTCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATCGCGGCTGCCGAC Xhol 
642 Forward CGCGGAJCCCATATG-CGGTATCCGCCGCAAT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGATTGCGGGGCATTA Xhol 
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643 Forward CGC GGATCCCATATG -GCTTCGCCGTCGGCAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACCGAAAAACAGACCGC . Xhol 

644 Forward AAAAA AGAATTC - EcoRI 

ATGCCGTCTGAAAGGTCGGCGGATTGTTGCCC 

Reverse AAAAAATCTAGA-CTACCCGCAATATCGGCAGTCCAATAT PstI 

645 Forward AAAAAAGAATTC-GTGGAACAGAGCAACACGTTAAATCG Eco RI 
Reverse AAAAAACTGCAG-CTACGAGGAAACCGAAGACCAGGCCGC PstI 

647 Forward AAAAAA GAATTC -ATGCAAAGGCTCGCCGCAGACGG Eco RI 
Reverse AAAAA ACTGCAG -TTAGATTATCAGGGATATCCGGTAGAA PstI 

648 Forward AAAAA AGAATTC - Eco RI 

ATGAACAGGCGCGACGCGCGGATCGAACG 

Reverse AAAAAACTGCAG-TCAAGCTGTGTGCTGATTGAATGCGAC PstI 

649 Forward AAAAAAGAATTC-GGTACGTCAGAACCCGCCCACCG Eco RI 
Reverse AAAAAACTGCAG-TTAACGGCGGAAACTGCCGCCGTC Pst I 

650 Forward AAAAA AGAATTC -ATGTCCAAACTCAAAACCATCGC EcoRI 
Reverse AAAAA ACTGCAG -TCAGACGGCATGGCGGTCTGTTTT PstI 

652 Forward AAAAA AGGTACC - Kpn I 

GCTGCCGAAGACTCAGGCCTGCCGCTTTACCG 

Reverse AAAAA ACTGCAG -TTATTTGCCCAGTTGGTAGAATGCGGC PstI 

653 Forward AAAAA AGAATTC -GCGGCTTTGCCGGTAATTTTCATCGG Eco RI 
Reverse AAAAAACTGCAG-CTATGCCGGTCTGGTTGCCGGCGGCGA Pst I 

656a Forward AAAAAA GAATTC -CGGCCGACGTCGTTGCGTCCTAAGTC Eco RI 

Reverse AAAAA ACTGCAG -CTACGATTTCGGCGATTTCCACATCGT Pst I 

657 Forward AAAAAAGAATTC-GCAGAATTTGCCGACCGCCATTTGTGCGC Eco RI 
Reverse AAAAAACTGCAG-rTATAGGGACTGATGCAGTTITTTTGC PstI 

658 Forward CGC GGATCCCATATG -GTGTCCGGAATTGTG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GGCAGAATGTTTACCGTT Xhol 

661 Forward AAAAA AGAATTC - Eco RI 
ATGCACATCGGCGGCTATTTTATCGACAACCC 

Reverse AAAAAACTGCAG-TCACGACGTGTCTGTTCGCCGTCGGGC Pst I 

663 Forward CGC GGATCCCATATG -TGTATCGAGATGAAATT BamHI- 

Ndel 

Reverse CCC GCTCGAG -GTAAAAATCGGGGCTGC Xhol 

664 Forward CG CGGATCCCATATG -GCGGCTGGCGCGGT BamHl- 

Ndel 

Reverse CCCGCTCGAG-AAATCGAGTTTTACACCAC Xhol 

665 Forward AAAAAAGAATTC-ATGAAATGGGACGAAACGCGCTTCGG Eco RI 
Reverse AAAAAACTGCAG-TCAATCCAAAATTTTGCCGACGATTTC Pst I 

666 Forward AAAAAAGAATTC-AACTCAGGCGAAGGAGTGCTTGTGGC Eco RI 
Reverse AAAAAAJCJAGA-TCAGTTTAGGGATAGCAGGCGTAC Xbal 

667 Forward AAAAA AGAATTC - Eco RI 

CCGCATCCGTTTGATTTCCATTTCGTATTCGTCCG 

Reverse AAAAAACTGCAG-TTAATGACACAATAGGCGCAAGTC Pst I 
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669 Forward AAAAAAGAATTC-ATGCGCCGCATCATTAAAAAACACCAGCC Eco RI 
Reverse AAAAAACTGCAG-TTACAGTATCCGTTTGATGTCGGC PstI 

670a Forward AAAAAAGAATTC-AAAAACGCTTCGGGCGTTTCGTCTTC Eco Rl 

Reverse AAAAA ACTGC AG - p st i 
TTAGGAGCTTTTGGAACGCGTCGGACTGGC 

671 Forward CGC GGATCCCATATG -ACCAGCAGGGTAAC BamHI- 



Ndel 

Reverse CCCG CTCGAG -AGCAACTATAAAAACGCAAG Xhol 

672 Forward CGC GGATCCCATATG -AGGAAAATCCGCACC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGGGATAGGCGGTTG Xhol 

673 Forward AAAAAAGAATTC-ATGGATATTGAAACCTTCCTTGCAGG Eco RI 
Reverse AAAAAACTGCAG-CTACAAACCCAGCTCGCGCAGGAA PstI 

674 Forward AAAAAAGAATTC-ATGAAAACAGCCCGCCGCCGTTCCCG Eco RI 
Reverse AAAAAACTCCAG-TCAACGGCGTTTGGGCTCGTCGGG PstI 

675 Forward CGC GGATCCCATATG -AACACCATCGCCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTCTTCGTCTTCAAACTGT Xhol 
677a Forward AAAAAAGAATTC-AGACGGCATTCCCGATCAGTCGATTTTGA Eco RI 
Reverse AAAAAACTGCAG-TTACGTATGCGCGAAATCGACCGCCGC PstI 

680 Forward CG CGGATCCGCTAGC -ACGAAGGGCAGTTCGG BamHI- 

Nhel 

Reverse CCCGCTCGAG-CATCAAAAACCTGCCGC Xhol 

681 Forward AAAAAAGAATTC-ATGACGACGCCGATGGCAATCAGTGC Eco RI 
Reverse AAAAAA CTGCAG -TTACCGTCTTCCGCAAAAAACAGC PstI 

683 Forward CG CGGATCCCATATG -TGCAGCACACCGGACAA BamHI- 

Ndel 

Reverse CCCG CTCGAG -GAGTTTTTTTCCGCATACG Xhol 

684 Forward CGC GGATCCCATATG -TGCGGTACTGTGCAAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CTCGACCATCTGTTGCG Xhol 

685 Forward CG CGGATCCCATATG -TGTTTGCTTAATAATAAACATT BamHI- 

Ndel 

Reverse CCC GCTCGAG -CTTTTTCCCCGCCGCA Xhol 

686 Forward CGC GGATCCCATATG -TGCGGCGGTTCGGAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CATTCCGATTCTGATGAAG Xhol 

687 Forward CGCGGATCCCATATG-TGCGACAGCAAAGTCCA BamHI- 

Ndel 

Reverse CCCG CTCGAG -CTGCGCGGCTTTTTGTT Xhol 

690 Forward CGC GGATCCCATATG -TGTTCTCCG AGC AAAGAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TATTCGCCCCGTGTTTGG Xhol 

691 Forward CGC GGATCCCATATG -GCCACGGCTTATATCCG BaraHI- 

Ndel 

Reverse CCCGCTCGAG-TTTGAGGCAGGAAGAAAG xhol 
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694 Forward CGC GGATCCCATATG -TTGGTTTCCGCATCCGG 

Reverse CCCGCTCGAG-TCTGCGTCGGTGCGGT 

695 Forward CGCGGATC^CATATG-TTGCCTCAAACTCGTCCG 

Reverse CCCG CTCGAG -TCGTTTGCGCACGGCT 

696 Forward CGCGGATCCCATATG-TTGGGTTGCCGGCAGG 

Reverse CCCG CTCGAG -TTGATTGCCGCAATGATG 
700a Forward AAAAAAGAATTC-GCATCGACAGACGGTGTGTCGTGGAC 
Reverse AAAAAACTCCAG-TTACGCTACCGGCACGACTTCCAAACC 

701 Forward CGCGGATOCCATATG-AAGAC1TGTTTGGATACTTC 

Reverse CCCG CTCGAG -TGCCGACAACAGCCTC 

702 Forward AAAAAAGAATTC-ATGCCGTGTTCCAAAGCCAGTTGGATTTC 
Reverse AAAAAACTGCAG-TTAACCCCATTCCACCCGGAGAACCGA 

703 Forward CGC GGATCCGCTAGC -CAAACGCTGGCAACCG 

Reverse CCCG CTCGAG -TTTTGCAGGTTTGATGTTTG 
704a Forward AAAAAAGAATTC-GCTTCTACCGGTACGCTGGCGCG 
Reverse AAAAAACTGCAG- 

TTAGTTTTGCCGGATAATATGGCGGGTGCG 

707 Forward CGC GGATCCGCTAGC -GAAATTATTAACGATGCAGA 

Reverse CCCGCTCGAG-GAAACTGTAATTCAAGTTGA 

708 Forward CGC GGATCCGCTAGC -CCTTTTAAGCCATCC A AAA 

Reverse CCCG CTCGAG -TTGACCGGTGAGGACG 
710 Forward CGC GGATCCCATATG -G AAACCC ACGAAAAA ATC. 

Reverse CCC GCTCGAG -AACGGTTTCGGTCAG 
714 Forward CGC GGATCCCATATG -AGCTATCAAGACATCTT 

Reverse CCC GCTCGAG -GCGGTAGGTAAATCGGAT 
716 Forward CGCGGATCCCATATG-GCCAACAAACCGGCAAG 

Reverse CCCG CTCGAG -TTTAGAACCGCATTTGCC 
718 Forward CGC GGATCCCATATG -GAGCCGATAATGGCAAA 

Reverse CCCGCTCGAG-GGCGCGGGCATGGTCTTGTCC 
720 Forward CGCGGATCCCATATG-AGCGGATGGCATACC 

Reverse CCC GCTCGAG -TTTTGCATAGCTGTTGACCA 
723 Forward CGCGGATCCCATATG-CGACCCAAGCCCC 

Reverse CCCGCTCGAG-AATGCGAATCCGCCGCC 



BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHl- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHl- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHT- 

Nhel 

Xhol 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BarnHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 
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725 


Forward CGCGGATGCG AT ATG-GTGPGP A PGGTT A A A 

1 vi "<uu v> vj v> vj vj x v x V-/V>v^i v. i. x\. x vj VJ X VJ V_> VJ v^./v V_j VJ VJ X X /A^r\.r\ 


Bam Hi- 






Ndel 




Reverse CCCGCTCGAG-TTGCTTATCCTTAAGGGTTA 


vVXXUl 


726 


Forward CGCGGATCCCATATG-ACCATrTATTTr A A A A A P 

1 " *** *-* *• — VJ XJ VJ VJ* X -» VJV->V>.f & A J X M. VJ XXV^V^Xl. J. VX X J\. XXX vAiuul/iV/ 


BamHI- 






XT J_T 

Ndel 




Reverse CCCGCTCGAG-GCCGATGTTTAGCGTCC 


YhnT 


728 


Forward CGCGGATCCCATATG-TTTTfiGCTfiGfJA Arnnn 

v/x rr (u. vi. v>vjv\Jviri a v^ v^vw » x ^ i A VJ X X X. X VJ VJ V/ X VJ VJ VJiTjr\v^vJvJvJ 


Bamril- 






XT J 

Ndel 




Reverse CCCGCTCGAG-GTGAGAAAGGTCGCGC 


YhnT 


729 


Forward CGCGG ATPPP AT ATG-TGP A PP A TG A TTPPrr a 

i vi ntuu vvivvj vin i v> Wii i rv i \j 1 VJ v/rt v»\_<x \ 1 VJ/i 1 1 vv/LLA 


BamHI- 






XT J T 

Ndel 




Reverse GCCCAAGCTT-TTTGTCGGTTTGGGTATC 


HindTTT 

ITLlXlUJ.il 


731 


Forward CGCGGATCCGCTAGC-GCCGTCTfTrOAfW"! 

* V/A IU *J. VJ VJ J X A WVJ V_-> JL J WVJ \w> VJ V* V* VJ A \J V/V>VJ VJ^\V TV I 


n TTT 

Bamrll- 






XTL. — T 

Nnel 




Reverse CCCGCTCGAG-ACGGGCGCGGCAG 


AJ1U1 


732 


Forward CCGGAATTPTAP ATATG-TPG A A A PPTPiTTTTT aapaa 

X VI VYtUU V/V/\J\jrLTV 1 1 vl X / x X VJ X Vj VJf\f\f\\^> v^ 1 vJ 11111 J\J\\jJ\J\ 


T~7 Tj T 

EcoRI- 






Ndel 




Reverse CCCGCTCGAG-CTTCTTATCTTTTTTATCTTTC 


AJxOl 


733 


Forward PGPGGATPPP ATATn-frPPTPPHnPf^or 1 A A 

i ui w (xi vj VjVjv^vjvj/a i v>v>L/A i A x VJ vJvjV^ X VJ V-/ vJvJv^ vJvJv>/vr\ 


TTT 

BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCGCTTGCCTCCTTTAC 


YhnT 


734 


Forward CGCGG ATPPPATATG-GPPG AT A PTT A CCICITT A t 

1 VIWOIU \^V*J^VJ VJ-tV X V^V^V^Xi. X r\. X VJ~VJV^v^VJ.rY X /\v> 1 I/WjVJvJvj 1 /V J 


BamHI- 






XT J ~T 

Ndel 




Reverse CCCGCTCGAG-TTTGAGATTTTGAATCAAAGAG 


YhnT 


735 


Forward PGPGGATPPPATATG-A AGP AHT AP^PnrTrr' A 

X \Jl W £11 U VjVJvjVj Vjr\.X VjV^Vj/XX A X VJ~/A/\vJ^/\VJvj/\\JVJV^OvJ X V_//V 


BamHI- 






XT J T 

Ndel 




Reverse CCCGCTCGAG-ATTTCCGTAGCCGAGGG 


YhnT 


737 


Forward PGPGGATPPP AT A TG-P A PP APHA pr;r; A P a pr 

x sjxwaxxx V/xj^vj\jrv i v v, un. l a l u v^/\^\^A v^UAL-vJVjAUAL/vJ 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GTCGTCGCGGCGGGA 


YhnT 


739 


Forward CGPGGATCPCATATG-GP A A A AAA APPPA AP A 


Bamrix- 






Ndel 




Reverse CCCGCTCGAG-GAAGAGTTTGTCGAGAATT 


YhnT 


740 


Forward CGCGGATCCC AT ATG-GCC A ATPPGPPPG A AG 


Bamril- 






Ndel 




Reverse CCCGCTCGAG-AAACGCGCCAAAATAGTG 


YhnT 
vyjuvji 


741 


Forward PGPGGATPPP AT ATG-TGP A GP A PPPPi A Ptf^P 

x yji vvtuu Vjvj VjVJ vjrv x VjV^vj.tx x rv l VJ- 1 VJv^-/^VJ v^/xvJv^vJvj/\vJVJvJ 


"Pl T TT 

Bamrll- 






Ndel 




Reverse CCCGCTCGAG-TTGCTTGGCGGCAAGGC 




743 


Forward PGPGGATPPP ATA TG-G A PPtPiTPTTPTPPPTPTT 

X KJl WOiU VjVJvjVJVJrVX VjVjVj.t\ X r\ X VJ VJAV/VlVJ .! VJ 1 1 vJ 1 VJL-L/ I vJ 1 X 


TJ TTT 

BamHI- 






Ndel 




Reverse CCCGCTCGAG-CTTACGGATCAAATTGACG 


Xhol 


745 


Forward CGCGGATCCCATATG-TTTTGGCAACTGACCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CAAATCAGATGCCTTTAGG 


Xhol 


746 


Forward CGCGGATCCCATATG-TCCGAAAACAAAGAAAAP 


BamHI- 
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Ndel 




Reverse 


CCCuC I CGAG-TTCATTCGTTACCTGACC 


Xhol 


747 


Forward 


CCGGAATTCTAGCTAGC-CTGACCCCTTGGG 


EcoRI- 








Nhel 




Reverse 


GCCCAAGCTT-TTTTGATTTTAATTGACTATAGAAC 


Hindlll 


749 


Forward 


CGCGGATCCCATATG-TGCCAGCCGCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCuAG-TTTCAAGCCGAGTATGC 


Xhol 


750 


Forward 


CGCGGATCCCATATG-TGTTCGCCCGAACCTG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTl 1 ITCCCCGCCGCAA 


Xhol 


758 


Forward 


CGCGGATCCCATATG-AACAATCTGACCGTGTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGGCTCAATCCTTTCTGC 


Xhol 


759 


Forward 


CGCGGATCCGCTAGC-CGCTTCACACACACCAC 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-CCAGTTGTAGCCTATTTTG 


Xhol 


763 


Forward 


CGCGGATCCCATATG-CTGCCTGAAGCATGGCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTCCGCAAATACCGTTTCC 


Xhol 


764 


Forward 


CGCGGATCCCATATG-TTTTTCTCCGCCCTGA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCGCTCCCTAAAGCTTTC 


Xhol 


765 


Forward 


CGCGGATCCCATATG-TTAAGATGCCGTCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ACGCCGACGTTTTTTATTAA 


Xhol 


767 


Forward 


CGCGGATCCCATATG-CTGACGGAAGGGGAAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCTGTACAGCAGGGG 


Xhol 


768 


Forward 


CGCGGATCCCATATG-GCCCCGCAAAAACCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCATCCCTTTTTTGAGC 


Xhol 


770 


Forward 


CGCGGATCCCATATG-TGCGGCAGCGGCGAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGTTTGTCGAGATTTTC 


Xhol | 


771 


Forward 


CGCGGATCCCATATG-TCCGTATATCGCACCTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CGGTTCTTT AGGTTTG A G 

\/V^\/VJ \s J. VVJi »■ VJ >-> V_J VJ A. A \^ AAA A1.VJ VJ 111 vJzjL vJ 


Ail 01 


772 


Forward 


CGCGGATCCCATATG-TTTGCGGCGTTGGTGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CAATGCCGACATCAAACG 


Xhol 


774 


Forward 


CGCGGATCCCATATG-TCCGTTTCACCCGTTCC 


BamHl- 








Ndel 




Reverse 


CCCGCTCGAG-TCGTTTGCGCACGGCT 


Xhol 


790 


Forward 


CGCGGATCCCATATG-GCAAGAAGGTCAAA A AC 


BamHI- 








Ndel 
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Reverse CCCG CTCGAG -GGCGTTGTTCGGATTTCG xhol — 

900 Forward CGCGGATCCCATATG-CCGTCTGAAATGCCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATATGGAAAAGTCTGTTGTC Xhol 

901 Forward CGC GGATCCCATATG -CCCGATTTTTCGATG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAAATGGAACAATACCAGG Xhol 

902 Forward. CCGGAATTCT ACATATG-TTG CACTTTC AAAGGAT AATC EcoRI- 

2 Ndel 
Reverse CCCGCTCGAG-AAAAATGTACAATGGCGTAC Xhol 

903 Forward CCGGAATTCT AGCTAGC-CAGCGTCAGCAGCACAT EcoRI- 

Nhel 

Reverse CCCGCTCGAG-GAAACTGTAATTCAAGTTGAA Xhol 

904 Forward AAAAAAG^TACC-ATGATGCAGCACAATCGTTTC Kpnl 
Reverse AA ACTGC AG -TTAAT ATCGAT AGGTT A T A TG p st i 

904a Forward AAAAAAGAATTC^CGGCTCGGCATTGTGCAGATGTTGCA Eco RI 
Reverse AA ACTGCAG -TTAAT ATCGAT AGGTT ATATG p st i 

905 Forward CGCGGATCCCATATG-AACAAAATATACCGCATC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CCACTGATAACCGACAGAT Xhol 

907 Forward CGC GGATCCCATATG -GGCGCGC AACGTG A G BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACGCCACTGCCAGCG Xhol 

908 Forward AAAGAATTC-GCAGAGTTAGTAGGCGTTAATAAAAATAC Eco RI 
Reverse AAACTGCAG-TTAATATGGTTTTGTCGTTCG Pst I 

909 Forward CGC GGATCCCATATG -TGCGCGTGGG A A A CTT AT BamHI- 

Ndel 

Reverse CCCGCTCGAG-TCGGTTTTGAAACTTrGGTTTT Xhol 

910 Forward AAAGAAJTjC-GCATTTGCCGGCGACTCTGCCGAGCG EcoRI 
Reverse AAACTGCAG-TCAGCGATCGAGCTGCTCTTT p s tl 

911 Forward AAAGAATTC-GCTTTCCGCGTGGCCGGCGGTGC EcoRI 
Reverse AAAAAACTGCAG-GTCGACTTATTCGGCGGCTTTTTCCGC Pstl 

912 Forward AAAAA AGAATTC - EcoRI 

CAAATCCGTCAAAACGCCACTCAAGTATTGAG 
Reverse AAAAAACTGCAG-TTACAGTCCGTCCACGCCTTTCGC Pst I 

913 Forward CGCG^ATCCCAJATG-GAAACCCGCCCCGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGTTGTGTTCCAGGTTG Xhol 

915 Forward CGC GGATCCCATATG -TGCCGGCAGGCGG A A BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTGAAAATATAGGTATCAGG Xhol 

914 Forward AAAGAATTC-GACAGAATCGGCGATTTGGAAGCACG Eco RI 
Reverse AAACTGCAG-CTATATGCGCGGCAGGACGCTCAACGG Pst I 

916 Forward CGCG^ATCCCAJATG-GCAATGATGGCGGCTG BamHI- 

Ndel 

Reverse CCCGCTCGAG-nTGGCGGCATCTTTCAT Xhol 
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917 Forward AAAAAAGAATTC-CCTGCCGAAAAACCGGCACCGGC EcoRT 

Reverse AAAAAACTCCAG-TTATTTCCCCGCCTTCACATCCTG PstI 

919 Forward CGCGGATCCCATATG-TGCCAAAGCAAGAGCATC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CGGGCGGTATTCGGG xhol 

920 Forward CGC GGATCCCATATG -CACCGCGTCTGGGTC BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATGGTGCGAATGACCGA Xhol 

921 Forward AAAAAAGAATTC-TTGACGGAAATCCCCGTGAATCC Eco RI 
Reverse AAAAAACTGCAG-TCATTTCAAGGGCTGCATCTTCAT PstI 

922 Forward. CGCGGATCCGCTAGC-TGTACGGCGATGGAGGC BamHI- 

2 Nhel 

Reverse CCC GCTCGAG -CAATCCCGGGCCGCC Xhol 

923 Forward CGCGGATCCCAXATG-TGTTACGCAATATTGTCCC BamHJ- 

Nhel 

Reverse CCCGCTCGAG-GGACAAGGCGACGAAG Xhol 

925 Forward C GC GGATCCC ATATG - AAACAAATGCTTTT AGCm BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCCGTTGCATTTGATTTC Xhol 

926 Forward CGC GGATCCCATATG -TGCGCGCAATTACCTr BamHI- 

Ndel 

Reverse CCCGCTCGAG-TCTCGTGCGCGCCG Xhol 

927 Forward CGCGGATCCCATATG-TGCAGCCCCGCAGC BamHI- 

Ndel 

Reverse CCC GCTCGAG -GTTTTTTGCTGACGTAGT Xhol 
929a Forward AAAAAAGAATTC-CGCGGTTTGCTCAAAACAGGGCTGGG Eco RI 

Reverse AAAAAATCTAGA-TTAAGAAAGACGGAAACTACTGCC Xba I 

931 Forward AAAAAAGAATTC-GCAACCCATGTTTTGATGGAAAC ' EcoRI 

Reverse AAAAAACTGCAG-TTACTGCCCGACAACAACGCGACG PstI 

935 Forward AAAAA AGAATTC - Eco m 

GCGGATGCGCCCGCGATTTTGGATGACAAGGC 
Reverse AAAAAACTGCAG-TCAAAACCGCCAATCCGCCGACAC PstI 

936 Forward CGCGGATCCCATATG-GCCGCCGTCGGCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGTTGGACGTAGTTTTG Xhol 

937 Forward AAAAAAGAATTC-CCGGTITACATTCAAACCGGCGCAAC EcoRI 
Reverse AAAAAACTGCAG-TTAAAATGTATGCTGTACGCCAAA PstI 

939a Forward AAAAAAGAATTC-GGTTCGGCAGCTGTGATGAAACC Eco RI 

Reverse AAAAAACTGCAG-TTAACGCAAACCTTGGATAAAGTTGGC PstI 
950 Forward CGCGGATCCCATWTG-GCCAACAAACCGGCAAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTAGAACCGCATTTGCC Xhol 
953 Forward CGCGGAJTCC^ATWrG-GCCACCTACAAAGTGGAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TrGTTTGGCTGCCTCGAT Xhol 
957 Forward CGCGGATCCC ATATG-TTTTGGCTGGG A A C.GCiCi B amHI- 
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Ndel 

Reverse CCCGCTCGAG-GTGAGAAAGGTCGCGC Xhol 

958 Forward CGCGGATCCCATATG-GCCGATGCCGTTGCG BamHI- 

Ndel 

Reverse GCCCAAGCTT-GGGTCGTTTGTTGCGTC HindHI 

959 Forward CGCGGATCCCATATG-CACCACGACGGACACG BamHI- 

Ndel 

Reverse CCCGCTCGAG-GTCGTCGCGGCGGGA Xhol 
961 Forward CGCGGATCCCATATG-GCCACAAGCGACGACG ' BamHI- 

Ndel 

Reverse CCCGCTCGAG-CCACTCGTAATTGACGC Xhol 

972 Forward AAAAA AGAATTC - Eco M 

TTGACTAACAGGGGGGGAGCGAAATTAAAAAC 
Reverse AAAAAATCTAGA-TTAAAAATAATCATAATCTACATTTTG Xbal 

973 Forward AAAAAAGAATTC-ATGGACGGCGCACAACCGAAAAC Eco RI 
Reverse AAAAAACTGCAG-TTACTTCACGCGGGTCGCCATCAGCGT PstI 

982 Forward CGCGGATCCCATATG-GCAGCAAAAGACGTAC BamHI- 

Ndel 

Reverse CCCGGTCGAG-CATCATGCCGCCCATCC Xhol 

983 Forward CGCGGATCCCATATG-TTAGCTGTTGCAACAACAC BamHl- 

Ndel 

Reverse CCCGCTCGAG-GAACCGGTAGCCTACG Xhol 

987 Forward CGCGGATCCCATATG-CCCCCACTGGAAGAAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TAATAAACCTTCTATGGGC Xhol 

988 Forward CGC GGATCCCATATG -TCTTTAAATTTACGGGA AAAAG BamHI- 

Ndel 

Reverse GCCCAAGCTT-TGATTTGCCTTTCCGTTTT Hindin 

989 Forward CCGGAATTCTACATATG-GTCCACGCATCCGGCTA EcoRJ- 

Ndel 

Reverse CCCGCTCGAG-TTTGAATTTGTAGGTGTATTGC Xhol 

990 Forward. CGCGGATCCGCTAGC-TTCAGAGCTCAGCTT BamHI- 

2 Nhel 
Reverse CCCGCTCGAG-AAACAGCCATTTGAGCGA Xhol 

992 Forward CGCGGATCCCATATG-GACGCGCCCGCCCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -CCAAATGCCCAACCATTC . . . xhol 

993 Forward CGCGGATCCCATATG-GCAATGCTGATTGAAATCA BamHI- 

Ndel 

Reverse CCCGCTCGAG-GAACACATCGCGCCCG Xhol 

996 Forward CGCGGATCCCATATG-TGCGGCAGAAAATCCGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TCTAAACCCCTGTTTTCTC Xhol 

997 Forward CC GGAATTC T AGCTAGC -CGGCACGCCGACGTT EcoRI- 

Nhel 

| Reverse CCCGCTCGAG-GACGGCATCGCTCAGG Xhol 
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Underlined sequences indicate restriction recognition sites. 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 1>: 

gOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG GTGTCGGCGA ACGAGGTGTC 

51 CGGCAGGGCT TGCGCCCGGA TGGTGCTGGT CATCTGCCAG ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGC TCGGGTACGC ATACTTTACC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTGCG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCGGTTT GTAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCTGAAG CGATGTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCGGCTTCAT CGGGCAGGTG GGACAATACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 2; ORF 001. ng>: 

gOOl .pep 

1 MLPQGKAARR VSANEVSGRA CA RMVLVICQ TLPKRDTLNG SGTHTLPVWA 

51 ILPRSLRSKS TIITFSARFF GSVCNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEAMLRKSS GEKHSVHADC PASSGRWDNT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3>; 

mOOl .seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGcAssCTT SS.GCTTGGA yGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GATCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

3C1 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCCTCCGCAT CGGGCAGGTG GGACAAGACG GCATAG 

This corresponds to the amino acid sequence <SEQ TD 4; ORF 001>: 

mOOl .pep 

1 MLPQGKAARR MSANEVCGXL X AWXVLVICQ TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PSASGRWDKT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 5>: 

aOOl. seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGCAAGGCT TGGGCTTGGA TGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCTTGTGCAT CGGGCAGGTG GGACAAAACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 6; ORF 001. a>: 

aOOl.pep 

1 MLPQGKAARR M5ANEVCGKA W AWMVLVICO TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PCASGRWDKT A* 



mOOl/aOOl 96.2% identity over a 131 aa overlap 

10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
I I I I ! I i I I t I I I I I It I II I I I I I I i I I I I I I | | | I | | I i I I M | I I I I I I I I I I 
aOOl.pep MLPQGKAARRMSANEVCGKAWAWMVLVICOTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
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10 



20 



30 



40 



50 



60 



70 80 90 100 HO 120 

mOOl.pep TI ITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKS SGEKHSVHADC 

I M I I I I I I I I 1 1 I I I I I M I 1 1 I I I I 1 I I 1 1 I I U I! I I I 1 1 I 1 1 ; I I I i I I I I I II 1 1 
aOOl.pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKS SGEKHSVHADC 

7 0 00 90 100 110 120 



mOOl .pep 
aOOl.pep 



130 

PSASGRWDKTAX 
I I I I I I I I I II 
PCASGRWDKTAX 
130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 001 shows 89.3% identity over a 131 aa overlap with a predicted ORF (ORF 001 ng) 
from TV. gonorrhoeae: 

mOOl/gOOl 



10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTI.NGSGTHTVPWAILPRSLRSKS 
M I I I I I I I I : I i I M I | I I I I II I I I I I I I I M I I I I I : I I I I II I I I I I I M 
9031 MLPQGKAARRVSANEVSGRACARMVLVICQTLPKRDTLNGSGTHTLPVWAILPRSLRSKS 

!0 20 30 40 50 60 

70 80 90 100 110 120 

mOOl . pep TI ITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 

I I I I H M I M I : I I I I I I I J I I T I I I I I I I I I f | f I t I I I I : t I I C I I f I I I t i | | | 
gOOl TIITFSARFFGSVCNSAARRSSCPSPKIGAVPFIGSVLMVPSEAMLRKSSGEKHSVHADC 

70 80 90 100 110 120 

130 

mO 0 1 . pep PSASGRWDKTAX 
1111:111 

gOOl P ASS GRW DN T AX 

130 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 7>: 

g003.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCGTATTC GGTCGCGCTG TTTTGGGTCA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGCTTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAGG TAGTCCTGAT GGTATTCCTC GGCGTCGTAG 

251 AAGTTTTTCA GCGGTTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTGC CGGTGTCGTT ACCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAA TAATATTGCA GGATGTCGTC 

451 CAGgCTGagt TTGTCGGCAT CGTaggtcac tTTGACGGTC TCGGCATGAC 

501 CCGTATGGCG GTaggacact tctTCgtanc TcGGGtTTTC CGTGttGCCG 

551 TTGGCgttac cGGATACCGC gtcaACCACG CCGTcgatgc gttggaAATa 

601 ggCTTCCAAg ccccaaaagc agccgccggc gaagtaaatg gtgcccgtgt 

651 tcatgattGC TGa 

This corresponds to the amino acid sequence <SEQ ID 8; ORF 003.ng>: 

g003.pep 

1 MWFVAEGVF GRAVLGHLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGB'G 

51 FARQRFVGFA DVDV AVAVGV FNQWLMVFL GWEVFQRFV FKNEGQLVFL 

101 LLAFEGGGDD GFFGGVG VVH AAAVLRAGW TLFV EAGR1N DAEIILQDW 
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151 QAE FVGIVGH FDGLGMTRMA VGHFFVRVFR VAVGVTGYRV NHAVDALEIG 
201 FQAPKAAAGE VNGARVHDC 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 9>: 

m003.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGsTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGGG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CAGCkTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTtCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGk CGGCGATGAC GGCTTTTTCG kCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATT3CA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCg. CCGGC GAGGTAAATG GTGCGCGTGT 

651 TCATGATTTT TGA 



This corresponds to the amino acid sequence <SEQ ID 10; ORF 003>: 

m003.pep Length: 221 

1 MWFVAEGIF GRAVLGNLXL LFGQGAFEFG VTRFFIRCRV EAFALRGGLG 

51 FARQRFVSXA DVDV AVAVGV FNQWLMVFL G1VEVFQRLV FNNEGQLVFL 

1G1 LLAFEGXGDD GFFXGVG WH AAAVLRTGW ALFV EAGRTN DAEEILQDVV 

151 *AE FVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

2C1 GFQAPEAAXG EVNGARVHDF * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1>: 

a003.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATATCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTTCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

401 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

451 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCCGCCGGC GAGGTAGATG GTGCGCGTGT 

651 TCATGATTTT TGA 



This corresponds to the amino acid sequence <SEQ ID 12; ORF 003.a>: 

a003.pep 

1 MWFVAEGIF GRAVLGNLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGLG 

51 FARORFVGFA DID VAVAVGV FNQWLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG WH AAAVLRTGW ALFV EAGRIN DAEEILQDW 

151 *AE FVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAAG EVDGARVHDF * 



m0O3/a003 95.9% identity over a 220 aa overlap 



10 20 30 40 50 60 

m003 . pep MWFVAEGI FGRAVLGNLXLLFGQGAFE FGVTRFFI RCRVEAFALRGGLG FARQRFVSXA 

MMiMiiiiiiiiiii i Mini i mi i nit i mi i mi i tiitiiini: i 
a003 m wi:tv af.gifgravlgnlvllfgqgafefgvtrffircrveafalrcglgfarqrfvgfa 

10 20 30 40 50 60 
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."70 80 90 100 110 120 

m003.pep DVDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGWH 
I : I i I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I ! t I I I I Hit I I I I I I | | | M I 
a003 DIDVAVAVGVFNOWLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGGGDDGFFGGVGWH 
70 80 90 100 110 120 

130 140 150 160 170 180 

Itl003.pep AAAVLRTGVVALFVEAGRINDAEEILQDVVXAEFVGIVGHFDGFGVARMAVGHVFIARIF 
I M i I I I I I M II I I I I I I I I I i i I I I I I I I || | I | | |j I M ! I i I M I M I i I I I I I I ] 
a003 AAAVLRTGVVALFVEAGRINDAEEILQDVVXAEFVG IVGHFDGFGVARMAVGHVFIARIF 

130 140 150 160 170 180 

190 200 210 220 

m003 . pep RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 
tl I I Ml II I Ml III Ml M Hi III I 111:11111111 
a 003 RVAVGVAGYRVNHAVDALEIGFQAPEAAAGEVDGARVHDFX 

190 200 210 220 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 003 shows 88.6% identity over a 219 aa overlap with a predicted ORF (ORF 003. ng) 
from N. gonorrhoeae: 

m003/g003 

10 20 30 40 50 60 

m003 . pep MVVFVAEGIFGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLGFARQRFVSXA 
11111111:1111111:1 I I I M I I I I | | | | | | 1 | | | [| | M ! | | 1:1111)111: | 
g003 MWFVAEGVFGRAVLGHLVLLFGQGAFE FGVTRFFIRCRVEAFALRCGFGFARQRFVGFA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m003 . pep DVDVAVAVGVFNQWLMVFLG IVEV FQRLVFNNEGQLVFLLLAFEGXG DDG FFXGVGWH 

I I I I I M I I I [ I I ! I I I t I I I : I I I I I 1 : II I I I I II I I || llllll HUM 

g003 DVDVAVAVGVFNQVVLMVFLGVVEVFQRFVFNNEGQLVFLLLAFEGGGDDGFFGGVGVVH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m003 . pep AAAVLRTGWALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 

HNIhlll: || (Mill I I I I I I I I I I I I : I :: I | I | | | I : | : | 

g003 AAAVLRAGWTLFVEAGRINDAEIILQDWQAEFVGIVGHFDGLGMTRMAVGHFFV-RVF 

130 140 150 160 170 180 

190 200 210 220 

m003 . pep RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 

II I 1 I I : I M I I I I I I I I I I I I I I I : II I I I M I I I || 
g003 R VAVG VTG YRVNHAV DALE I G FQA PKAAAGE VN G AR VH DC 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 13>: 

g004. seq 

1 ATGgtagAAC GG CAT AT CCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GCGCCCATCC CAACAagtga gccaAAtgtT CGGCGGCAGG GCCTacgatT 

101 TCCGCGCCGA TAAagcggcc gGTGgctTTT tcgGCataca ggcgcaTatg 

151 gCCTTTGTTT ACCAgcatca cgcggctgcg accttgaTTT TTGAACGATA 

201 CTTCGCCgaT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCgga ctggtaaACA CCACGCCAAT 

301 GGTgctgcgg cGCAAACCGC TGCCGATATt cgGgtagcgg ccccgcgtta 

351 ttgcccggca atcttacctt ggtcggcggc ttcatGCAGC AGGGGCagtt 

4 01 ggttggacgc gtcgcccgca ataAAGATAT GCGGAATgct ggtCTGCATg 

4 51 gtCAGCGGAT CGGCAACGGG tacgccgcgc gcgtctttgT CGATATTGAT 

501 GTTTTCCAAA CCGATATtgT CAACGTTCGG ACGGCgACCT ACGGCTGCCA 
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551 ACATATATTC GGCAACAAAT ACGCCTTTTT CGCCATCCTG CTCCCAATGG 

601 ACTtctACAT TGCCGTCTGC GTCGAGTTTG ACCTCGGTTT TAGCATCCAG 

651 ATGCAGTTTC AATtctTCTC CGAACACGGC TTTCGCCTCG TCTGAAACAA 

701 CGGGGTCGGA AATGCCGCCG ATGATTCCGC CCAAACCGAA AATTTCAACT 

751 TTCACACCCA AACGGTGCAA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 14; ORF 004.ng>: 

g004.pop 

1 MVERHIQHLR NGHLHLMRPC CQVSQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVYQHHAAA TLIFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAN 

101 GAAAQTAADI RVAAPRYCPA ILPWSAASCS RGSW LDASPA IKICGMLVCM 

151 VSGSATGTPR ASLSILMFSK PILSTFGRRP TAANIYSATN TPFSPSCSQW 

201 TSTLPSASSL TSVLASRCSF NSSPNTAFAS SETTGSEMPP MIPPKPKIST 

251 FTPKRCNA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 15>: 

m004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCAGG GCCTACGATT 

101 TCCGCGCCGA TAAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTC ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAGAC CGACAAAGCC GATTTGCGGA CTGGTAAACA CCACGCCGAT 

301 GGTGCTGCGC CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCAGCTTC ATGCAGCAGA GGCAGTTGGT 

4 01 TGGACGCATC GCCTGCGATG AAGATATGCG GAATACTGGT CTGCATGGTC 

451 AGCGGGTCGG CAACAGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATATT 

501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCCACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCATC GAGTTTGACC TCGGTTTTAG CATCCAGATG 

651 CAGTTTCAAT TCTTCGCCGA ACACGGCGTT CGCCTCGTCT GAAACGACGG 

701 GGTCGGAAAT GCCGCCGATG ATTCCGCCCA AACCGAAAAT TTCAACTTTC 

7 51 ACGCCCAAAC GGTGCAATGC CTGA 

This corresponds to the amino acid sequence <SEQ ID 16; ORF 004>: 

mO04 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVHQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAD 

101 GAAPQTAADI RVAAALSPAI LPWSAASCSR GSW LDASPAM KICGILVCMV 

151 SGSATGTPRA SFSILIFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLT SVLASRCSFN SSPNTAFASS ETTGSEMPPM IPPKPKISTF 

251 TPKRCNA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 17>: 

a004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCCGG ACCTACGATT 

101 TCTGCGCCGA TGAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTT ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCGGA CTGGTGAACA CTACGCCGAT 

301 GGTGCTGCGG CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCGGCTTC ATGCAGCAGG GGCAGTTGGT 

4 01 TGGACGCGTC GCCCGCAATA AAGATATGCG GAATACTGGT CTGCATAGTC 

4 51 AGCGGATCGG CAACGGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATGTT 

501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCTACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCGTC GAGTTTGGCC TCGGTTTTAG CATCCAAATG 

651 CAGTTTCAAT TCTTCACCGA ACACGGCTTT CGCCTCGTCT GAAACGACGG 

701 GGTCGGAAAT GCCGCCGATG ATGCCACCCA AACCGAAAAT TTCAACTTTC 

751 ACGCCCAAAC GGTGCAATGC CTGA 
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This corresponds to the amino acid sequence <SEQ ID 18; ORF 004.a>: 

a004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR TYDFCADEAA GGFFGIQAHM 

51 AFVYQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGEHYAD 

101 GAAAQTAADI RVAAALSPAI LPWSAASCSR GSWL DASPAI KICGILVCIV 

151 SGSATGT PRA SFSILMFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLA SVLASKCSFN SSPNTAFASS ETTGSEMPPM MPPKPKISTF 

251 TPKRCNA* 

m004/a004 94.9% identity over a 257 aa overlap 

10 20 30 40 50 60 

m004 . pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 

IMMIINIIIItllinillllHIIIhlll II: MIIIIIIM:j Mill 

a004 MVERHIQHLRNGHLHLMCPSQQVRQMFGGRTYDFCADEAAGGFFGIQAHMAFVYQHHAAA 
10 20 30 40 50 60 

70 80 90 100 110 120 

m004 . pep ALVFERYFADDKFVGLVLRGNLRVFOTDKADLRTGKHHADGAAPQTAADIRVAAALSPAI 
N II I I I I M I I I I I I I I I I I I I I | | | | | | | | | | | : | : | | | | | I | | j | | | I 1 | | | | | ! | 
a004 ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGEHYADGAAAQTAADIRVAAALSPAI 

7 0 80 90 100 110 120 

130 140 150 160 170 180 

m004 .pep LPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRPT 
I I i M I I N I I I I I I I ill : I i I I I I I | : | M I II 1 1 : I | ! | | | | : | | | | | | | |] | | | | | 
a004 LPWSAASCSRGSWLDASPAIKICGILVCIVSGSATGTPRASFSILMFSKPILSTFGRRPT 

130 140 150 160 170 180 

190 200 210 220 230 240 

m004.pep AASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPPM 
' I I I I I II M I I I I M I I I I I [ I I II I I I : I I I I I : M I I I I I I I I I I I I I I I I I I | | | | 
a004 AASIYSATNTPFSPSCSQWTSTLPSASSLASVLASKCSFNSSPNTAFASSETTGSEMPPM 
190 200 210 220 230 240 

250 

rn004.pep IPPKPKISTFTPKRCNAX 
: I II I I I I I I I I I I I I I I 
a004 MP PK PK I ST FT PKRCNAX 

250 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 004 shows 93.4% identity over a 258 aa overlap with a predicted ORF (ORF 004.ng) 
from N. gonorrhoeae: 

m004/g004 

10 20 30 40 50 60 

m004 . pep MVERHIQHLRNGHLHl^CPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 
I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I | | | | | || | | | | | | | | : j | | | | i 
g004 MVERHIQHI^NGHLHI^RPCQQVSQMFGGRAYDFRADKAAGGFFGIQAHMAFVYQHHAAA 

10 20 30 40 50 60 . 

70 80 90 100 110 119 

ra004 . pep ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAA-LSPA 

s|:IMIII.IIMIIIII M i I I i I I I I I I I 1 M : | | | 1 | | | t I I I I I II 

g004 TLIFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHANGAAAQTAADIRVAAPRYCPA 

70 80 90 100 110 120 



m004 .pep 



120 130 14C 150 160 170 179 

ILPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRP 
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I I I 1 I I I E I t I I I 1 I M I i I : I I I I = M I 1 I 1 I I I I 1 I El I I : I I I : II I I I I I 

g004 ILPWSAASCSRGSWLDASPAIKICGMLVCMVSGSATGTPRASLSILMFSKPILSTFGRF.P 
130 140 150 160 170 160 

180 190 200 210 220 230 239 

ra004.pep TAASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPP 
I I > : I N II I I I I I IMI I 1 I I I I I | | | | | | I | I | | | | | | | | l | | ! | | | ,, | | , | | ,, ,| 
g004 TAANIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSSTTGSEMFP 
190 200 210 220 230 240 

240 25C 
m004 .pep MIPPKPKISTFTPKRCNAX 

I I I I I I I I I I I I I I I II 
g004 MIPPKPKISTFTPKRCNA 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 9>: 

g005 . seq 

1 ATGGGGATGG ACAATATTGA TATGTTCATG CCTGAACAAG A3GAAATCCA 

51 ATCAATGTGG AAAGAAATTT TACTGAATTA CGGTATTTTC CrGCTCGAAC 

101 TGCTTACCGT GTTCGGCGCA ATTGCGCTGA TTGTGTTGGC TATCGTACAG 

151 AG T AAG AAAC AGTCGGAAAG CGGCAGTGTC GTACTGACAG ATTTTTCGGA 

201 AAATTATAAA AAACAGCGGC AATCGTTTGA AACATTCTTT TTAAGCGAGG 

251 AAGAGACAAA ACATCAGGAA AAAAAAGAAA AGAAAAAGGA AAAGGCGGAA 

301 GCCAAAGCAG AGAAAAAGCG TTTGAAGGAG GGCGGGGAGA AATCTGCCGA 

351 AACGCAAAAA TCCCGCCTTT TTGTGTTGGA TTTTGACGGC GATTTGTATG 

401 CACACGCCGT AGAATCCTTG CGTCATGAGA TTACGGCGGT GCTTTTGATT 

451 GCCAAGCCTG AAG AT GAG GT TCTGCTCAGA TTGGAAAGTC CGGGCGGCGT 

501 GGTTCACGGT TACGGTTTGG CGGCTTCGCA GCTTAGGCGT TTGCGCGAAC 

551 GCAATATTCC GCTGAccgtc gccgTCGATA AGGTCGCGGC AAGCGgcggc 

601 tatatgatgg cgtgtgtgGC GGATAAAATT GTTTCCGCtc cgtttgcggt 

651 catcggttcg gtgggtgtgg tgGcggaagt gcCGAAT/iTC CAccgCctGT 

701 TGAA7AAAACA T GAT ATT GAT GTGGATGTGA TGACGGCGGG CGAATTTAAG 

751 CGCACGGTTA CTTTTATGGG TGAAAATACG GAAAAGGGCA AACAGAAATT 

801 CCGGCAGGAA CTGGAGGAAA CGCATCAGTT GTTCAAGCAG TTTGTCAGTG 

851 AAAACCGCCC CGGGTTGGAT ATTGAAAAAA TAGCGACGGG CGAGCATTGG 

901 TTCGGCCGGC AGGCGTTGGC GTTGAACTTG ATTGACGAGA TTTCGACCAG 

951 TGATGATTTG TTGTTGAAAG CGTTTGAAAA CAAACAGGtt aTCGAAGTGA 

1001 AATATCAGGA GAAGCGAAGC CTGATCCAGC GCATTGGTTT GCAGGCGGAA 

1051 GCTTCCGTTG AAAAGTTGTT TGCCAAACTT GTCAACCGGC GAGCGGATGT 

1101 GATGTAG 

This corresponds to the amino acid sequence <SEQ ID 20; ORF 005 .ng>: 

g005.pep 

1 MGMDNIDMFM PEQEEIQSMW KEILLNYG IF LLELLTVFGA IALIV LAIVQ 

51 SKKQSESGSV VLTDFSENYK KQRQSFETFF LSEEETKHQE KKEKKKEKAE 

101 AKAEKKRLKE GGEKSAETQK SRLFVLDFDG DLYAHAVESL RHEITAVLLI 

151 AKPEDEVLLR LES PGGVVHG YGLAASQLRR LRERNIPLTV AVDKVAASGG 

201 YMMACVADKI V SAPFAVIGS VGWAE VPNI HRLLKKHDID VDVMTAGEFK 

251 RTVTFMGENT EKGKQKFRQE LEETHQLFKQ FVSENRPGLD IEKIATGEHW 

301 FGRQALALNL IDEISTSDDL LLKAFENKQV IEVKYQEKRS LIQRIGLQAE 

351 ASVEKLFAKL VNRRADVM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21>: 

m005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AwAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 CACAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 
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301 GCAGAGAAAA A.CGTTTGAA GGAGGGTGGG GAGAAATCTG CCGAAACGCA 

351 nAAATCACGC CTTTTTGTGT TGGANNNNNN NNNNNNNNNN NNNNNNNNNN 

4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGCGAGCGG CGGTTATATG 

601 ATGGCGTGTG TGGCGGATAA AATTGCTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTGGTGGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

701 AAC AT GAT AT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

751 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGT A 

1101 G 



This corresponds to the amino acid sequence <SEQ ID 22; ORF 005>: 

m005 . pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGA1A LIV LAIVQSK 

51 KQSXSGSWL TDFSENYKKQ RQSFEAFFLS GEEAQHQEKE EKKKEKAEAK 

1C1 AEECXRLKEGG EKSAETXKSR LFVLXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXASGGYM 

2C1 MACVADK IAS APFAIVGSVG WAEV PNIHR LLKKHDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKCKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 

301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 

351 VEKLFAKLVN RRADVM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23>: 

a005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AAAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 CAAAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 

301 GCAGAGAAAA AGCGTTTGAA GGAGGGTGGG GAGAAATCTT CCGAAACGCA 

351 AAAATCCCGC CTTTTTGTGT TGGATTTTGA CGGCGATTTG TATGCACACG 

4 01. CCGTAGAATC CTTGCGTCAT GAGATTACGG CGGTGCTTTT GATTGCCAAG 

4 51 CCTGAAGATG AGGTTCTGCT TAGATTGGAA AGTCCGGGCG GCGTGGTTCA 

501 CGGTTACGGT TTGGCGGCTT CGCAGCTTAG GCGTTTGCGC GAACGCAATA 

551 TTCCGCTGAC CGTCGCCGTC GATAAGGTGG CGGCGAGCGG TGGTTATATG 

601 ATGGCGTGTG TGGCGGATAA AATTGTTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTTGTAGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

701 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

751 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGT A 

1101 G 

This corresponds to the amino acid sequence <SEQ ID 24; ORF 005. a>: 

aOOS.pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LATVOSK 

51 KQSESGSWL TDFSENYKKQ RQSFEAFFLS GEEAKHQEKE EKKKEKAEAK 

101 AEKKRLKEGG EKSSETQKSR LFVLDFDGDL YAHAVESLRH EITAVLLIAK 

151 PEDEVLLRLE SPGGWHGYG LAASQLRRLR ERNIPLTVAV DKVAASGGYM 

201 MACVADK IVS APFAIVGSVG WAEV PNIHR LLKKHDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 
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301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 
351 VEKLFAKLVN RRADVM* 



moos/aoos 79.2% identity over a 366 aa overlap 



mOOS.pep 
a005 

mOOS.pep 
a005 

mOOS.pep 
a005 

mOOS.pep 
a005 

mOOS.pep 
a005 

mOOS.pep 
a005 



10 20 30 40 50 60 

MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSWL 
I M I I f M f I I Ml i I ! I | | | | | ) | M I I I M I I I I I f I I I I I | | | | | M M I Mill! 
MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSWL 

10 20 30 40 50 . 60 

70 80 90 100 110 120 

TDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKXRLKEGGEKSAETXKSR 
H I I II I I I i I I I I I i I I I I I I I: I I I I I I | | 1 | | i | | | | | | | | | | | , j || : | , ,,| 

TDFSENYKKQRQSFEAFFLSGEEAKHQEKEEKKKEKAEAKAZKKRLKEGGEKSSETQKSR 
70 80 90 100 110 120 

130 140 150 160 170 180 

LFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
till : 

LFVL DFDG DLY AHAVES LRH E I T A VLL I AK PEDE VLLRLE S ?GG WHG YG LAAS QLRRLR 
130 140 150 160 170 180 

190 200 210 220 230 240 

XXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGVVAEVPNIHRLLKKHDIDVD 
: H M I I I I I I I i I I : Ml | [ | | M I I I I I I | I | | ! | | | | | | | | | | | 

ERN I PLTV A V DKV AAS GG YMMACVADK IVSAPFAIVGS VG VV AE V PN I H R L LKKH D I D VD 

190 200 210 220 230 240 

250 260 270 280 290 300 

VMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 
I Nil IN I II II I N HI Mill Ml III II I Ml III IIMMII I M MM I Ml II 
VMTAGEFKRT^/TFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 

250 26C 270 280 290 300 

310 32C 330 340 350 360 

RQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKLVN 
I N M I I M II II M I i M li MM M M I I I M I I M M I II II I I I I I I M i I I M I I 
RQ AL ALN LIDEISTSD DL LL KAFENKQV I E VK YQE KG S L I QR I G LQAE AS VEKLFAKLVN 

310 320 330 340 350 360 



m005.pep RRADVMX 
I I I I I I I 

a005 RRADVMX 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 005 shows 77.0% identity over a 366 aa overlap with a predicted ORF (ORF 005.ng) 
from N. gonorrhoeae: 

m005/g005 

10 20 30 4C 50 

mOOS.pep MDN I DMFMPEQEE IQSMWKE ILLNYGI FLLELLTVFGAIALI VLAIVQSKKQSXSGSV 

I I 1 M r I I I I M I I M M I M I I II I I r M M I t I I M II I I 11 ! t I I I I I II MM 
g005 MGMDN I DMFMPEQEE IQSMWKE ILLNYGI FLLELLTVFGAIALI VLAIVQSKKQSESGSV 

10 20 30 4 0 50 60 

60 70 80 90 100 110 

mOOS.pep VLTDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKKRLKEGGEKSAETXK 

' M M I I I I M M I I M : I I II M:: M M: MMMM III III III MMIIMI I 
g005 VLTDFSENYKKQRQSFETFFLSEEETKHQEKKEKKKEKAEAKAEKKRLKEGGEKSAETQK 
? 0 80 90 100 110 120 
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120 130 140 150 160 170 

SRLFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
I I I I I ! 

SRLFVLDFDGDLYAHAVESLRHEITAVLLIAKPE DEVLLRLESPGGVVHGYGLAASQLRR 
130 140 150 160 170 190 

180 190 200 210 220 230 

XXXXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGWAEVPNIHRLLKKHDID 
: I M I { I 1 I I I I I I f: i I I I I :: | | > I I I I I H I I ! I I I I I M I I 

LRERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAVIGSVGVVAEVPNIKRLLKKHDID 
190 200 210 220 230 240 

240 250 260 270 280 290 

VDVMTAGEFKRTVT FMGENTEKGKQKFRQELEETHQL FKQFVSENRPQLD IEE VATGEHW 
I M I M I I I I I i I I M I I I I I I I [ I I I I I I U I | ! | j | | | | | | [ M | MllrrlMMI 
VDVMTAGEFKRTVT FMGENTEKGKQKFRQELEETHQLFKQFVSENRPGLDIEKIATGEHW 
250 260 270 280 290 300 

300 310 320 330 340 350 

FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLOAEASVEKLFAKL 
I I M 1 I I I I I I I I I M I I I I I I I I I I II I I I I I I I I M : I I I I I I I I I I I I I i I M I I I I 
FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKRSLIQRIGLQAEASVEKLFAKL 
310 320 330 340 350 360 

360 
VNRRADVMX 

E 1 IE I I I j I 

VNRRADVMX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 25>: 

g006.3eq 

1 ATGCTGCTGG TGCTggaatt ttggctCGGc gtGtCGGCGG TGGGCatact 

51 tgCGTTGTTT TTATGGCttt TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC gcgACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GTACCGCCAT TACGGACTGG TTTCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCG 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACGGCA GCGCGGGGCA TATTTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATG AGTTTGGACG ATGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 CI ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 

451 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 26; ORF 006.ng>: 

g006 .pep 

1 MIiLVLE FWFG VSAVGILALF L.WLLPRFAAI SENLYFRLNN SLERDNHFIR 

51 KGDERQLYRH YGLVSRLRVL ISNREAFGYL CVGAAMGILF GFAFVM MTLK 

101 GYGSAGHIYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 

151 AGT* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 27>: 

m006.seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACC GGCGGCAGCT GTACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 



m005.pep 
g005 

mOOS.pep 
g005 

mOOS.psp 
g005 

mOOS.pep 
g005 

mOOS.pep 
g005 
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353 GTTTGCCATG AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 
4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 
451 GCCGGAACTTGA 

This corresponds to the amino acid sequence <SEQ ID 28; ORF 006>: 

mO06.pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLNN GLERDNHFIR 
51 KGDRRQLYRH YGLLARLRVL ISNREAFGYL CVGTAMGILF GFAFVM MTLK 
101 GYSSAGHVYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 
151 AGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29>: 

a006.soq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAAGAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GGACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

3C1 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATA AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 CI ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGAAACG GAACATCAAA 

4 51 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 30; ORF 006.a>: 

a006.pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLKN SLERDNHFIR 
51 KGDERQLDRH YGLLARLRVL ISNREAFGY L CVGTAMGILF GFAFVM MTLK 
101 GYSSAGHVYS VGTYLWMFAI SLDDVPRLVE QYSNLKDIGQ RIEWSKRNIK 
151 AGT* 

moo6/a006 96.7% identity over a 1 53 aa overlap 

10 20 30 40 50 60 

mOO 6 . pep MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLSRDNHFIRKGDRRQLYRH 
H I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I | I I I f M I I I! I I : I I I 1 I 
a006 MLLVLE FWVGVSAVG I LALFLWLLPRFAA I SENLYFRLKN S LI RDNHFIRKGDERQLDRH 

10 20 30 40 50 60 

™ 80 90 100 110 120 

m006.pep YGLLARLRVL I SNREAFGYLCVGTAMGI L FGFAFVMMT LKG YS S AG HVYS VGTYLWMFAM 

I I I I I I I I I I I I 1 1 I I I I I f I I I t I I I ^ I I I I I I I | 1 | 1 | | | | | t I I I I 1 1 I I t 1 1 I I I = 
a006 YGLLARLRVL I SNREAFGYLCVGTAMGIL FGFAFVMMT LKG YSSAGHVYS VGTYLWMFAI 

70 80 90 100 110 120 

130 140 150 

m006.pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
f I I I I I M I I I I I I I I I II I I I I I I : | | | ! | | | | 
a006 SLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 006 shows 95.4% identity over a 153 aa overlap with a predicted ORF (ORF 006.ng) 
from K gonorrhoeae: 

m006/g006 

10 20 30 40 50 60 

m006.pep MliVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLRRDNHFIRKGDRRQLYRH 
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■ I ■ 1 1 ■ 1 ■ MIIIIIIIIIIMMMIMIIIIIIIMIIII MIIMIIIII 

9006 MLLVLEFWFGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDERQLYRH 
10 20 3 0 40 50 60 

70 80 90 100 110 120 

mO 3 6 . pep YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYSVGTYLWMFAM 
I M-IIIIIIIIIIIM lillhlll! II I I || | II I II I hi! 11:1 II ! Ill Hill 
9006 YGLVSRLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSAGHIYSVGTYLWMFAM 
70 80 90 100 110 120 

130 140 150 

m006 .pep SLDDVPRLVEQYSNLKDI GQRI EWSERNI KAGTX 

IMIIIIIMIIIIIIIIII I E E I I I I I I t I I 
g 1 1 8 SLDDVPRLVEQYSNLKDI GQRI EWSERNI KAGT 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3 1>: 

g006-l.seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCAAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCCGGTCG GACTGGAAAA CCTTTTGATG CTGGGGTATC 

101 CGGTGTTTGG CGGCTGGGCG ATTAATGCCG TGATTGCGGG GAGGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTATTT TTGATGTGGC T3GTCGGTGC 

201 GGCACGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTTGTGTTG GAACAACGGC AGCGGCAAGT CCCGCATTCA 

301 GCGGTAACTG CACGGGTTGC CCTGTCGCGT GAATTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

401 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACGAGCGG CAGCTGTACC GCCATTACGG ACTGGTTTCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 

651 CGGCGCGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

701 TCAAAGGCTA CGGCAGCGCG GGGCATATTT ATTCGGTCGG CACTTATCTG 

751 TGGATGTTTG CCATGAGTTT GGACGATGTG CCGCGATTGG TCGAACAATA 

801 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 

851 TCAAAGCCGG AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 32; ORF 006-l.ng>: 

g006-l.pep 

1 MWKMLKHIAK THRKRLIGTF SPVGLENLLM LGYPVFGGWA INAVIAGRVW 

51 Q ALLYALWF LMWLVGAAR R IADTRTFTRI YTEIAVPWL EQRQRQVPHS 

101 AVTARVALSR EFVS FFEEHL PIA ATSWSI FGACIMLLVL E FWVGVSAVG 

151 ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDER QLYRHYGLVS 

201 RLRVLISNRE AFGY LCVGAA MGILFGFAFV M MTLKGYGSA GHIYSVGTYL 

251 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33>: 

m006- 1 . seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCCAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCTGGTCG GACTGGAAAA CCTTTTGATG CTGGTGTATC 

101 CGGTGTTTGG CGGCCGGGCG ATCAATGCCG TGATTGCGGG GGAGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTGCTT TTGATGTGGC TGGTCGGTGC 

201 GGTGCGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTCGTGTTG GAACAGCGGC AGCGACAAGT CCCGCATTCG 

301 GCGGTAACTG CGCGGGTTGC CCTGTCGCGT GAGTTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCCCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACCGGCGG CAGCTGTACC GCCATTACGG ACTGCTTGCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 
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651 CGGCACGGCG 

701 TCAAAGGCTA 

751 TGGATGTTTG 

801 TTCCAATTTG 

851 TCAAAGCCGG 



ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 
CAGCAGCGCG GGGCATGTCT ATTCGGTCGG CACTTATCTG 
CCATGAGTTT GGACGACGTG CCGCGATTGG TCGAACAATA 
AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 
AACTTGA 



This corresponds to the amino acid sequence <SEQ ID 34; ORF 006- 1>: 

mO 06-1. pep 

1 MWKMLKHIAQ THRKRLIGTF SLVGLENLLM LVYPVFGGRA INAVIAGEVW 
QALLYALVVL LMWLVGAV RR IADTRTFTRI YTBIAVFWL EQRQRQVPHS 
AVTARVALSR EFVSFFEEHL PI AATSWSI FGACIMLLVL EF WVGVSAVG 
ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDRR QLYRHYGLLA 
RLRVL1SNRE AFGY LCVGTA MGILFGFAFV M MTLKGYSSA GHVYSVGTYL 
WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 



51 
101 
151 
201 
251 



moo6-i/g006-i 95.5% identity in 288 aa overlap 



10 20 30 40 50 60 

m006-l.pep mwkmlkhiaqthrkrligtfslvglenllmlvypvfggrainaviagevwqallyalwl 

I H I I I I II :! I I I I I I I I I I I I I I I I I I I I i I I i i I I I [ I I I I : I ( I I I j j I [ i i : 

g oo6-i mwkm^khiakthrkrligtfspvglenllmlgypvfggwainaviagrvwqallyalwf 

10 20 30 40 50 60 

™ 80 90 100 110 120 

m006-i . pep lmwlvgavrriadtrtftriyteiavpvvleqrqrqvphsavtarvalsrefvsffeehl 
lillllliilllllll IIIIIIIIIIIIIMIIMIII IIIIIIIMIIill tlllini 
g oo6-i lmwlvgaarriadtrtftriyteiavpvvleqrqrqvphsavtarvalsrefvsffeehl 

70 80 90 100 110 120 

130 140 150 160 170 180 

m006-l.pep PIAATSVVSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRT.NNSLERD 

i 1 1 1 i i i 1 1 i 1 1 i i i i i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 « i r 1 1 1 1 1 1 1 1 1 1 1 f 

g006-l P I AAT S VV S I FG AC I ML LVLE FW VG V S AVG I L AL FLW L L P R FAA I S E N L Y FRLNN SLE RD 

130 140 150 160 170 180 

190 200 210 220 230 240 

m006-l.pep NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMI4TLKGYSSA 
M I I I I H : I I I I t I ! I I :: M I I I I I I I I I | | | | | | | : |( | | | l | | | | | | | | | | | | : | | 
g006-l NHFIRKGDERQLYRHYGLVSRLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSA 

1?0 200 210 220 230 240 

250 260 270 280 289 

m006-l.pep GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKD1GQRIEWSERNIKAGTX 
I I : I I I I I I M I I I 1 I I I I I I I I I I I I I I I I I I I I f | I M I I I I I I | M 
g006-l G H T Y S VGT YLWMFAM S L D DV PRL VEQY S N LKO I GQR I EW S ERN I KAGTX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 35>: 

a006-l.seq (partial) 

1 . . AGCCAAAACC ACCGCAAGCG ATTGATTGGC ACATTTTTTC TGGTCGGACT 

51 GGAAAACCTT TTGATGCTGG TGTATCCGGT GTTTGGCGGC TGGGCGATTA 

101 ATGCCGTGAT TGCGGGGCAG GCGTGGCAGG CGTTGCTGTA CGCTTTGGTT 

151 GTGCTTTTGA TGTGGCTGGT CGGTGCGGCG CGGCGGATTG CCGATACGCG 

201 CACGTTTACG CGGATTTATA CCGAAATCGC CGTGCCGGTT GTGTTGGAAC 

251 AGCGGCAGCG GCAAGTCCCG CATTCGGCGG TAACTGCGCG GGTTGCCCTG 

301 TCGCGTGAGT TTGTCAGCTT TTTTGAAGAA CACCTGCCGA TTGCCGCGAC 

351 ATCCGTCGTA TCCATATTCG GCGCGTGCAT CATGCTGCTG GTGCTGGAAT 

4 01 TTTGGGTCGG CGTGTCGGCG GTGGGCATAC TTGCGTTGTT TTTATGGCTT 

451 TTGCCACGTT TTGCCGCCAT CAGCGAAAAC CTGTATTTCC GCCTGAAGAA 

501 CAGCTTGGAA CGCGACAACC ACTTTATCCG AAAAGGCGAC GAGCGGCAGC 

551 TGGACCGCCA TTACGGACTG CTTGCGCGCC TGCGTGTGCT GATTTCCAAC 

601 CGCGAAGCCT TCGGCTATCT CTGCGTCGGC ACGGCGATGG GTATTTTGTT 

651 CGGCTTTGCT TTTGTGATGA TGACGCTCAA AGGCTACAGC AGCGCGGGGC 
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701 ATGTCTATTC GGTCGGCACT TATCTGTGGA TGTTTGCCAT AAGTTTGGAC 
751 GACGTGCCGC GATTGGTCGA ACAATATTCC AATTTGAAAG ACATCGGACA 
801 AC GG AT AG AG TGGTCGAAAC GGAACATCAA AGCCGGAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 36; ORF 006-l.a>: 

a006-l.pep (partial) 

1 . . SQNHRKRLIG TFFLVGLENL LMLVYPVFGG WAINAVIAGQ AWQALLYALV 

51 VLLMWLVGAA RRIADTRTFT RIYTEIAVPV VLEQRQRQVP HSAVTARVAL 

101 SREFVSFFEE HLPI AATSW SIFGACIMLL VL EF WVGVSA VGILALFLWL 

151 LPRFAAI SEN LYFRLKNSLE RDNJHFIRKGD ERQLDRHYGL LARLRVLISN 

201 REAFGY LCVG TAMGILFGFA FVMM TLKGY5 SAGHVYSVGT YLWMFAISLD 

251 DVPRLVEQYS NLKDIGQRIE WSKRNIKAGT * 



a006-l/m006-l 



a006-l.pep 



m006-l 



a006-l.pep 



m006-l 



95.7% identity in 280 aa overlap 

10 . 20 30 40 50 

SQNHRKRLIGTFFLVGLENLLMLVY PVFGGWAINAVIAGQAWQALLYALWL 
:|: IN llfll I i ! I M I 1 I II i I I I II I I I I I I I I :: I I I I I II I I I I 
MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALVVL 
10 20 30 40 50 60 

60 70 80 90 100 110 

LMWLVGAARRIADTRTFTRIYTEIAVPWLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
i r 1 1 1 1 1 = f 1 1 1 1 1 r 1 I I I I I 1 I | I 1 I | | | | I | | | I | | | 1 | I I I I I 1 I | | | | | | | | | | | | 

LMWLVGAVRRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
70 80 90 100 110 120 



120 130 140 150 160 170 

a006-l.pep PIAATSWSIFGACIMLLVLEFWVGVSAVGIT.ALFLWLLPRFAAISENLYFRLKNSLERD 
' N I I I I I I I I : I I I I I I I I I I I M I I I I I I I I I I I I I I I | M | | | | | j | | | | j : | | | | | } 
mO 0 6- 1 PIAATSWS I FG AC IMLLVLE FWVG VS AVG I LALFLWLL PRFAAI S EN L Y FRLNN SLERD 

130 140 150 160 170 180 

180 190 200 210 220 230 

a006-l . pep NHFIRKGDERQLDRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 

i ii ii Mi: iii i in mi in tin imim mini in it mini 

mC06-l NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
190 200 210 220 230 240 



240 250 260 270 280 

a006-l . pep GHVYSVGTYLV?MFAI3LDDVPRLVEQYSNLKDIGQRIEWSKF.NIKAGTX 
III I I I E I I I I I I I : I I III I I I i I I I | I I Ml I I 1 I I I I : III I | I | | 
m006-l GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSEF.NIKAGTX 
250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 37>: 

g007 . seq 

1 atgaACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGcgC 

51 CGCcGCTTCT GCCGccgaca acAGCatcat gaCaAAAGGG CAAAAAGTGT 

101 ACGAATCcAa ctGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGtTTC CTccgctTTT CCggtcgGac tgtattatga acaAACCGCa 

201 cgTCCtgctg cacagca-gg tcaaaggcAt cgacgggaca ttcaaagtgg 

251 agcggcaaaa cctacgacgg atttatgCcc gcaaccgcca tcagcgATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 38; ORF 007.ng>: 

g007.pop 

1 MNTTRLPTAF ILCCLCAAAS AA DNSIMTKG QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD CIMNKPHVLL HSMVKGIDGT FKVERQNLRR IYARNRHQRC 

101 GHCRRRHLYH ERL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 39>: 
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m007 . eeq 

1 ATGAACACAA CCCGACTGCC 

51 CGCCGCTTCT GCCGCCGACA 

101 ACGAATCCAA CTGCGTCGCC 

151 ACCATGTTTC CGCCGCTCTA 

201 GGTGCTGCTG CACAGCA7GG 

251 AACGGCAAAA CCTACAACGG 

301 GGACATTGCC GCCGTCGCCA 



GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 
ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
CCGCTCCGAC TTCATCATGA AAAAACCGCA 
TCAAAGGCAT CAACGGTACA ATCAAAGTC. 
ATTCATGCCC GCAACCGCCA TCAGCGATGC 
CTTATATCAT GAACGCCTTT GA 



This corresponds to the amino acid sequence <SEQ ID 40; ORF 007>: 

m007 -pep 

1 MNTTRLPTAL VLGCFCAAAS A ADN5IMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARNRHQRC 

• 101 GHCRRRHLYH ERL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41>: 

a007 . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTC. 

2 51 AACGGCAAAA CCTACAACGG ATTCATGCCC GCCACTGCCA TCAGCGATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 42; ORF 007. a>: 

a007.pep 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARHCHQRC 

101 GHCRRRHLYH ERL* 



m007/a007 97.3% identity over a 1 13 aa overlap 



10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I | | j | M | | | | | | | | | | 

a007 MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

70 80 90 100 110 

m007 . pep FIMKKPQVLLH SMVKGI NGT I KVXRQNLQRIHARNRHQRCGHCRRRH LYHERLX 

I I M I I t I I It f II M I II M M I I I I M I i » I I : 1 I i I I I I I I I I II I I I M 
a007 FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARHCHQRCGHCRRRHLYHERLX 

70 80 90 100 110 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from//, gonorrhoeae 

ORF 007 shows 86.7% identity over a 1 13 aa overlap with a predicted ORF (ORF 007.ng) 
from N. gonorrhoeae: 

m007/g007 



10 20 30 40 50 60 

ID007 . pep MNTT RL PT AL V LGC FC AAAS AA DN S I MT KGQKV Y ESN CVACHG K KG EGRG TM FP PL YR S D 

IMIMIII::! I : i I II I I I ( I I ! I I | | | | I : I I I I I I I I I I I I lllhlll 

g007 MNTTRLPTAFILCCLCAAASAADNSIMTKGOKVYESNCIACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 

70 80 90 100 110 

m007 . pep FIMKKPQVLLH SMVKGI NGT I KVXRQNLQRIHARNRHQRCGHCRRRH LYHERLX 

ll:M:|lllllllll:||:|| I i 1 I : I I : I I I | | | | I I I I I I I I I I I I I I 
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g007 CIMNKPHVLLKSMVKGIDGTFKVERQNLRRIYARNRHQRCGHCRRRHLYHERL 
70 80 90 100 110 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 43>: 

g007-l.seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

1C1 ACGAATCCAA CTGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGTTTC CTCCGCTTTT CCGGTCGGAC TATATTATGA ACAAACCGCA 

2 CI CGTCCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAGGC AAAAAAAAC. 

This corresponds to the amino acid sequence <SEQ ID 44; ORF 007-l.ng>: 

g007-l.pep (paxtial) 

1 MNTTRLPTAF ILCCLCAAAS A ADNSIMTKG QKVYESNCIA CHGKKGEGRG 
51 TAFPPLFRSD YIMNKPHVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKG KKN . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 45>: 
m007-l .seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 
201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 
2 51 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 
351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAGC AAAAAAAACT 
401 AA 

This corresponds to the amino acid sequence <SEQ ID 46; ORF 007- 1> 

m007-l .pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKS KKN* 

m007-i / g007-i 91.7% identity in 133 aa overlap 

10 20 30 40 50 60 

ra007-l.pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I I I I I I I :: I I : I M I M I I M I I I I I I I I I i I I i : 1 I | M | | ( | M | I t I I : I I I 
g007-l MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 

70 80 90 100 110 120 

m007-l.pep FIMKKPQVLLHSMVKGINGT IKVNGKTYNG FMPATAISDADIAAVATYIMNAFDNGGGSV 

: I I : I I : I I I I I I I I I I I I I II I I I I ! i I I I I I t I I I I I M I i t I I I t I I I I 

g007-l YIMNKPHVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 

130 

ra007-l .pep TEKDVKQAKS KKN X 

I I I I I I I t I : I I I 
g007-l TEKDVKQAKG KKN 

130 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47>: 

a007-l.seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
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151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

2 51 ACGGCAAAAC CTACAACSGA TTCATGCCCG CCACTGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAAC AAAAAA. . 

This corresponds to the amino acid sequence <SEQ ID 48; ORF 007-l.a>: 

a007-l.pep (partial) 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKN KK. . 

m007-i/a0O7-i 98.5% identity in 132 aa overlap 

10 20 30 40 50 60 

mC07-l . pep MN T T R L PT AL V LG C FC AAAS AA DN S I MT KGQKV YE S NC V ACH G KKGEGRGTM F P P L YR S D 
\ I I N I I I I I I I I 1:1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | M | | | | | | 
a007-l MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

70 80 90 100 110 120 

m007-l.pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 

a007-l FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
70 80 90 100 110 120 

130 

Itl007-1 .pep TEKDVKQAKSKKNX 

I I I I I I I I I : I I 
a 007-1 TEKDVKQAKNKK 
130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 49>: 

g008.seq 

1 ATGAACAACA GACATTTTGC CGTCAtcgCC TTGGGCAGCA ACCTTGACAA 

51 CCCCGCACAA CAAATacgcg gcgcattaga cgcgctctcg tcccatcctg 

101 acatccggct tgaaCaggtt tcctcactgt aTatgaccgc acctgtcggt 

151 tacgAcaaTC agcccgATTT CATCaatgcc gTCTgcaccg TTTCCACCAC 

201 CtTGGACGGC ATTGcccTGC TTGCCgaACT CAAccgTATC GAAGCCGATT 

251 TCGGACGCGA aCGCAGTTTC CGCAATGCAC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGCC TTACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAATACG GAAAGGTTGT CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGACA GGTAA 

This corresponds to the amino acid sequence <SEQ ID 50; ORF 008.ng>: 

g008 .pep 

1 MNNRHFAVIA LGSNLDNPAQ QIRGALDALS SHPDIRLEQV SSLYMTAPVG 

51 YDNQPDFINA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKYGKWELS 

151 KRLGNQGIRL LPDR* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 51>: 

mOOS.seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCTCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

1C1 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT TGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 TCTGGACGGC ATTGCCyTGC TTGCCGAACT CAACCGTATC GAGGCTGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GkATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACACsCGAC TcACCtTGCC 
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351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATCCGCCCT TTGGCAGAAA 
4 01 TCCTCCCTGA TTTTGTTTTA GGAAAACACG GAAAGGTTGC CGAATTGTCA 
4 51 AAACGGyTGG GCAATCAAGG TATCCGTCTT TTACCGGACA GGTAATT 

This corresponds to the amino acid sequence <SEQ ID 52; ORF 008>: 

m008.pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLXLD 

101 IIDFDGISSD DTRLTLPHPR AHERSFVIRP LAEILPDFVL GKHGKVAELS 

151 KRLGNQGIRL LPDR* 

The following partial DNA sequence was identified in N. meningitidis<SEQ ID 53>: 

aOOB.seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCCCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT CGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 CTTGGACGGC ATTGCCCTGC TTGCCGAACT CAACCGTATC GAAGCCGATT 

251 TCGGACGCGA ACGCAGCTTC CSCAACGCGC CGCGCACATT GGATTTGGAC 

301 AT TAT CG ACT TTGACGGCAT CTCCAGCGAC GACCCCCGAC TCACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAACACG GAAAGGTTGC CGAATTGTCA 

451 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGATA AGTAA 

This corresponds to the amino acid sequence <SEQ ID 54; ORF 008.a>: 

a008 .pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDJIL GKHGKVAELS 

151 KRLGNQGIRL LPDK* 

mO08/aooe 97.6% identity over a 164 aa overlap 

10 20 30 40 50 60 

m008 . pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 
Mil tdilllltlMIEIIIIIIIIIIIM! Ill III lIMIIillllllMII HIM 
a008 MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 

10 20 30 40 50 60 

70 00 90 100 110 120 

mOOO.pep VCTVSTTLDG IALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 

1 1 1 « 1 1 1 1 1 i 1 1 i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 1 1 r iiiiiiiiiiin iiiitiii 

a008 VCTV STTLDG IALLAELNRI EADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

m008 . pep AHERSFVIRPLAE ILPDFVLGKHGKVAELSKRLGNQG IRLLPDRX 

I I M I I I I I I I I I I I I I I : I II I I 1 I | M | | | | | | | | | | | | | | : | 
a008 AHERSFVIRPLAE I LPDFI LGKHGKVAELSKRLGNQGIRLLPDKX 

130 140 150 160 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 008 shows 92.7% identity over a 164 aa overlap with a predicted ORF (ORF008.ng) 
from N. gonorrhoeae: 

zn008/g008 

10 20 30 40 50 60 

m008 . pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 

lillllllMllll|:|||||:|:|||:||||||||| :l :||MIIIIM t|:|| 

g008 MNNRHFAVIALGSNLDNPAQQIRGALDALSSHPDIRI^QVSSLYMTAPVGYDNQPDFINA 
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10 2C 30 40 50 60 

70 8C 90 100 110 120 

mO 0 8 . pep VCTVSTTLDG I ALLAE LNRIEADFGRERS FRNAPRTLXLDI IDFDG I SS DDTRLTLPHPR 

11 II I I I M I II I I I I I I ! I I I I I I I I [ I H I ] I | | | i i I I I M I I M I t M I M M I 
g008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 
70 80 90 100 110 120 

130 140 150 160 

AHERS FVI RPLAE I LP DFVLGKHGK VAELSKRLGNQGI RLLPDRX 
I I I M I I I I 11 I I I 1 1 I I : II I r I I I : ) | | | | | | 1 1 | | M I M I I 
AHERS FVI RPLAE I LPDFILGKYGKWELSKRLGNQGI RLLPDRX 
130 140 150 160 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 55>: 

g00 9 . seq 

1 ATGCCCCGCG CTGCCGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GCGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGAA ACCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTcg cGTTCCAAGC 

2 CI TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AaaaGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 56; ORF 009.ng>: 

g009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARNQSVMAV 
51 QLPLVAFSDK WVAFQAWQ AEIQVFA DGG KTWQKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 57>: 

m009 . seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC CGGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 58; ORF 009>: 

m009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPPVAFSDK WVAFQAWQ AEIQVFA DGG KTWQKP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 009 shows 97.7% identity over a 86 aa overlap with a predicted ORF (ORF 009.ng) 
from N. gonorrhoeae: 

m009/g009 

10 20 30 40 50 60 

m009 . pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
I I M II I I I I I I I I I t I I I I > I I I I I I I I I I i I I I E I M I I I I : | I | H | | | | mill 
g009 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARNQSVMAVQLPLVAFSDK 
10 20 30 40 50 60 

70 80 
m009 . pep V V VAFQAV VQAE I QV FADGGKT WQK PX 

I I I I 1 1 I I I I 1 1 I I I 1 1 I 1 1 ! 1 1 I I I I 
g 0 0 9 V WAFQAVVQAE I QV FADGGKT WQK PX 

70 80 



m008.pep 
g008 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 59>: 

a009. seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CAT CATC AC A AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTC 

151 CAGCTGCCGC TCGTCGCCTT CTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTCTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 60; ORF 009.a>: 

a009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAE IA EGFAVGNQHT QARKQSVMAV 
51 QLPLVAFSDK VWAFQAVLQ AEIQVFA DGG KTWQKP* 

m009/aoo9 97.7% identity over a 86 aa overlap 

10 20 30 40 50 60 

m009 . pep M PRAAVAFE RHHH K SKAEQN T HRRAD AE I AEG FA VGN QHTQARKQ S VMAVQL P P V AFS DK 

H I I I I III I ! I II I I I II I I I I I I I I II I I I I I | 1 I I II I I I I I I | | | | | | I 

a009 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPLVAFSDK 

10 «20 30 40 50 60 

70 80 
mO 0 9 . pep WVAFQAWQAEI QVFADGGKT WQKPX 

I I I I I II I : I I I I I I I [ I I I I I I I | | I 
a 0 0 9 WVAFQAVLQAEI QVFADGGKTWQKPX 

70 80 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 61>: 

g010 . seq 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

401 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

451 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

701 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA. CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAacgc 

801 cgacggcgaA cgcTTTATGG AAcgctatgc GCcgACCGta aAagaCTTGG 

851 CTTCTCGCga cgtGGTTTCA CgcgcGatgG CGatggaAAt ctatgaaggt 

901 cgcggctgTG GtaaAAAcaA agaCCacgtC TTACTGAAAA TCGACcAtAt 

951 cggtGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCagtttgc cGGTATCGAT CCGATTAAAG ACCCGATTcc cgttgTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCcg aCCAATTATC ACGGTGAAGT 

1.101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

• 1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 

1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 62; ORF 010.ng>: 

gOlO.pep 
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1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

2C1 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

3C1 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVVVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLVVF RPTPR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 63>: 

mOlO.seq (PARTIAL) 

1 . . nTCCAATTAT CCAAATCCGG TCTGAATTGT GCCGTTTTGT CTAAAGTGTT 

51 CCCGACCCGT TCGCATACCG TAGCGGCGCA GGGCGGTATT TCCGCCTCTn 

101 TGGGTAATGT GCAGGAAGAC CGTTGGGACT GGCACATGTA CGATACCGTG 

151 AAAGGTTCCG ACTGGTTGGG CGACCAAGAT GCGATTGAGT TTATGTGCCG 

201 CGCCGCGCCT GAAGCCGTAA TTGAGTTGGA ACACATGGGT ATGCCTTTTG 

251 ACCGTGTGGA AAGCGGTAAA ATTTATCAGC GTCCTTTCGG CGGCCATACT 

301 GCCGAACACG GTAAACGCGC GGTAGAACGC GyCTGTGCGG TTGCCGACCG 

351 TACAGGTCAT GCGATGCTGC ATACTTTGTA CCAACAAAAC GTCCGTGCCA 

401 ATACGCAATT CTTTGTGGAA TGGACGGCAC AAGATTTGAT TCGTGATGAA 

451 AACGGCGATG TCGTCGGCGT AACCGCCATG GAAATGGAAA CCGGCGAAgT 

501 TTATATTTTC CACGCTAAAG CTGTGATGTT TGCTACCGGC GGCGGCGGTC 

551 GTATTTATGC GTCTTCTACC AATGCCTATA TGAATACCGG CGATGGTTTG 

601 GGTATTTGTG CGCGTGCAGG TATCCCGTTG GAAGACATGG AATTCTGGCA 

651 ATTCCAGCCG ACCGGCGTGG CGGGTGCGGG CGTGTTGATT ACCGAA 

This corresponds to the amino acid sequence <SEQ ID 64; ORF 01 0>: 

mOlO.pep (PARTIAL) 

1 . .XQLSKSGLNC AVLSKVFPTR SHTVAAQGGI SASXGNVQED RWDWHMYDTV 

51 KGSDWLGDQD AIEFMCRAAP EAVIELEHMG MPFDRVESGK IYQRPFGGHT 

101 AEHGKRAVER XCAVADRTGH AMLHTLYQQN VRANTQFFVE WTAQDLIRDE 

151 NGDWGVTAM EMETGEVYIF HAKAVMFATG GGGRIYASST NAYMNTGDGL 

201 GICARAGIPL EDMEFWQFQP TGVAGAGVL1 TE . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65>: 

aOlO.seq 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

451 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT AT AT GAT GGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CAT G ATT AAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 
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1301 ATGCCGGCGA ACTGACCCGC CAACGTATCG AGCGTTTGGA CAATCAAACT 

1351 GATGGTGAAA ACGTTGATGC ATTGCGCCGC GAACTGCAAC GCTCCGTACA 

14 01 ATTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTAATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG C3ATGATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATGCCAATA CCTTGTCCTA 

1701 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

1751 AGCGCGTTTA TTGA 

This corresponds to the amino acid sequence <SEQ ID 66; ORF 010.a>; 
aOlO.pep 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 

301 RGCGKNKDHV LLKI DHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

401 TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GYFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGARA SDDHPERDDE 

551 NWMKHTLYHS DANTLSYKPV HTKPLSVEYI KPAKRVY* 



moio/aoio 98.7% identity over a 231 aa overlap 

10 20 30 

mOlO .pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

I I I I I I I i I I I I I I I I I I I I I I I II I I I I I I I II! 
a010 MGFPVRKFDAVIVGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 
10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO.pe? Q E DRW DWHM Y DT VK G S D W LG DQ D A I E FMCRAA PE AV I E LE HMGM P FDR VE S GK I YQR P FG 

1 I I t I I I II I I I I I I I I I I I I I I I I I I t I II I I I I I I I I I I I 1 I I | | | | t I I I I I I i I i 
a010 QEDRWDWHMYDTVKGSDWLGDQ DA I EFMCRAAPEAVIELE HMGMPFDRVE SGKIYQRPFG 

70 80 90 100 110 120 

100 110 120 130 140 150 

mOl 0 . pep G HT AEHG KRAVERX C A V ADRT GHAMLHT L YQQN VRANT QF EVE WT AQ D L I R DEN G DWGV 

IMMMIIIM! I I I I 1 I I I I I I 1 I I I I I I I I ! I r I f I J I I I t f I 1 I I J I I I I I I I | J 
aOlO GHTAEHGKRA VERA CAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 

130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO.pep T AMEMET GE V Y I FHAKAVM FATGGGGRI YAS STN AYMNTG DGLG I CARAG I PLE DME FWQ 

H I I I ! I t I I I I I I t I ! I I I I I I I | I I I M f I 1 I I I I I I ! I M I I I M i I j I I I I M 1 I | 
aOlO TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 
190 200 210 220 230 240 

220 230 
mO 1 0 . pep FQPTGVAGAG VLITE 

( : I I I I I I I I I I f t I 

a 0 1 0 FHPTGVAGAGVLITEG VRGEGG I LLNADGER FMERYAPTVKDLASRD WSRAMAME I YEG 

250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 010 shows 98.7% identity over a 231 aa overlap with a predicted ORF (ORF OlO.ng) 
from N. gonorrhoeae: 

m01O.pep/gO10 .pep 

10 20 30 

mO 10. pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

M N I I I I I I I I I I | | | | | | | | | | | [ | | | M t Hi 
g010 MGFPVRKFDAVIVGGGGAGLRAALQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 
10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO.pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

Mil M INI III Mill! I llll I II I II || || MM I II I III II INI! Ill 

gOlO QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
70 80 90 100 110 120 

100 110 120 130 140 150 

mOl 0 . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 

ilMilimill M III I III MM II ! IMII III II M I I II llll II llll MM 
gOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 
130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO.pep TAMEMETGEVY I FHAKAVMFATGGGGRI YAS STNAYMNTGDGLG ICARAG I PLEDMEFWQ 

MMIf IMIMI IMIMIMM Mill MIIIMIIIIMI Ml III Ml IMIMII 
gOlO TAMEMETGEVY I FHAKAVMFATGGGGRI YAS STNAYMNTGDGLG I CARAGT PLEDMEFWQ 

190 200 210 220 230 240 

220 230 
mOl 0 . pep FQ PT GV AG AG V LITE 

I : I I I I I I I I I 1 I I I 

gOlO H'HPTGVAGAGVLITEGVRGEGGILLNADGERFNERYAPTVKDLASRDWSRAMAMEIYEG 
250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 67>: 

g010-l .seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC . GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

IC'1 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

3 CI CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

4 CI CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

4 51 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

7 01 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAACGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTGGTTTCA CGCGCGATGG CGATGGAAAT CTATGAAGGT 

901 CGCGGCTGTG GTAAAAACAA AGACCACGTC TTACTGAAAA TCGACCATAT 

951 CGGTGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTTGC CGGTATCGAT CCGATTAAAG ACCCGATTCC GGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 
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1201 ACGAACTCCC TGCTGGACTT 3GTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 68; ORF 010-Lng>: 

g0 10-1. pep 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIK EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

401 TNSLLDLVVF RPTPR* 

g010-l / P10444 

sp|P10444|DHSA_ECOLI SUCCINATE DEHYDROGENASE PLAVOPROTEIN SUBUNIT 

gnl|PID|dl015210 (D90711) Succinate dehydrogenase, flavoprotein [Escherichia coli) gi| 1786942 
(AE000175) succinate dehydrogenase flavoprotein subunit [Escherichia coli] Length = 588 

Score = 1073 (49S.6 bits), Expect = 6.7e-169, Sum P(2) - 6.7e-169 

Identities = 191/303 (63%), Positives = 238/303 (78%) 



Query: 1 MGFPVRKFDAVIVXXXXXXXXXXXXXSKSGI^CAVI^KVFPTRSHTVAAQGGISASLGNV 60 

M PVR+FDAV++ S+SG CA+LSKVFPTRSHTV+AQGGI+ +LGN 

Sbjct: 1 MKLPWEFDAWIGAGGA^IRAALQISQSGOTCALLSKVFPTRSHTVSAOGGIT-^AI^NT 60 

Query: 61 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 120 

ED W+WHMYDTVKGSD4-+GDQDAIE+MC+ PEA++ELEHMG+PF R++ G+IYQRPFG 
Sbjct: 61 HKDNWKWH^mm^GSDYIGDQDAIEYNCKTGPEAII^I^^^K2LPFSRIJ3DGRIYQRPFG 120 

Query: 121 GHTAEHGKJttVERACAVAIDRTGHA>^HTLYQQNVRANTQ 180 

G + G R A ADRTGHA+LHTLYQQN++ +T F EW A DL+++++G WG 

Sbjct: 121 GQSKNFGGEQAARTAAAADRTGHA1.LHTLYQQNLKNHTTIFSF^ALDLVKNQDGAVVGC 180 

Query: 181 TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYhN 240 

TA+ +ETGEV F A+A + ATGG GRIY S+TNA++NTGDG+G+- RAG+P++DME WQ 
Sbjct: 181 TALC I ETG EVVYFKARATVLATGGAGRI YQSTTNAHI NTGDG VGMA I RAGVP VQDMEMWQ 240 

Query: 241 FHPTGVAGAG VL I TEG VRG EGG I LLN ADGERFMERYAPTVKDLASRD VVS RAMAME I YEG 300 

FHPTG+AGAGVL+TEG RGEGG LLN GERFMERYAP KDLA RDW+R++ +EI EG 
Sbjct: 241 FHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPNAKDLAGRDVVARSIMIEIREG 300 

Query: 301 RGC 303 
RGC 

Sbjct: 301 RGC 303 

Score = 249 (115.0 bits), Expect « 6.7e-169, Sum P(2) = 6.7e-169 
Identities = 53/102 (51%), Positives = 62/102 (60%) 

Query: 309 HVIjLKIDHIGAEKIMEKLPGIREISIQFAGXXXXXXXXXX 368 

H LK+DH+G E + -i-LPGI E+S FA T HYMMGGIPT G+ + 

Sbjct: 310 HAKLKLDHLGKE VLESRL PG I LE LSRTFAHVD PVKE P I P VI PTCHYMMGG I PTKVTGQAL 369 

Query: 369 VPQGD EYEVPVKG LYAAGECACAS VHGANRLGTNS LLDLWF 410 

+V V GL+A GE AC SVHGANRLG NSLLDLWF 
Sbjct: 370 TVNEKGED VW PGLFAVGE IACV S VHGANRLGGNSLLDLWF 411 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 69>: 

mOlO-l.seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCAGCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAg 

151 GGCGGT ATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCGGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 
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451 CAACAAAACG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGTCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1031 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTACC ACGGCGAAGT 

1101 TGTCGTTCCG CAAGGTGAAG ATTACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCTTGT GCTTCCGTAC ACGGTGCGAA CCGCTrGGGT 

1201 ACCAACTCCC TGTTGGACTT GGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 ATGCAGGTGA GTTGACCCGC CAACGTATCG AGCGTTTGGA CAACCAAACC 

1351 GATGGTGAAA ACGTTGATGC ATTGCGTCGC GAACTGCAAC GCTCTGTACA 

1401 ACTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTGATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGATGATGAA 

1651 AACTGGATGA AACATAC3CT GTACCATTCA GATATCAATA CCTTGTCCTA 

1701 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

1751 AGCGCGTTTA TTGATGA 

This corresponds to the amino acid sequence <SEQ ID 70; ORF 010-1>: 

xnOlO-l.pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI R DENG DWG V TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGEDYEVPVK GLYAAGECAC ASVHGANRLG 

401 TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

451 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWOTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DINTLSYKPV HTKPLSVEYI KPAKRVY* 

m010-l / g010-l 99.5% identity in 410 aa overlap 

10 20 30 40 50 60 

raOl 0- 1 . pep MGFPVRKFDAVXVGGGGAGLRAALQIjSKSCT 

i i<i nil ii tu iii ii ii in ii it inn iii i ii nun ii ii ii ii in ii in 

9010-1 MGFPVRKBTIAVIVGGGGAIJIJUVJ^LSKSGI^C^VI^ 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOl 0- 1 . pep QEDRWDWHMYDTVRGSDWI^QDAIEET^RA^ 

mi mini 1 1 1 1 1 1 1 1 1 it 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 [ 1 1 

gOlO-l QEDRTOWHMYDTVKG SDWI^GDQDAl EPMCRAXPEAVl ELEHMGMPFDRVES GKTYQRPFG 

70 60 90 100 110 120 

130 140 150 160 170 180 

m010-l. pep CTIAEH GKRAVERACAVADRTGHAMLRTLYQQNVRAN 

iiniiiiiiiiiiiimiiiiiiiiin iiiiitiiiiiiiiin tiiiini 

g010-l GHTAEHGKRAVERACAVADRTGRAMLHTLYQQNVRAKTQ 

130 140 150 160 170 180 

1^0 200 210 220 230 240 

mOlO- 1 .pep TAMBMBTGEVYIFHAKAVMFATGGGGRXXASSTNAIMNTGDGLGICARAGI PLEDMEFWQ 

1 1 1 1 S 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

g010-l TAMEMBTGEVYIFHAKAVMFATGGGGRIYi^STNAYKOTGDGXGICARA^ 
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190 200 210 220 230 240 

250 260 270 280 290 300 

m010-l. pep FHPTGVAGAGVLITEGVRGEGGILLNADGERT^RYW 

I I I II II II I I I I I I I I II II I I II I I I I I I I t I I II I I I I I I I I I I I I I M I I I I I I | I 
g010-l BU PTGVAGAGVI* I TEGVRGE GGHLLNAD GERFMER YAPTVKDLASRDVVS RAMAME I TEG 

250 260 270 280 290 300 



310 320 330 340 350 360 

m010-l .pop RGCGKNKDHVI^KIDHIGAEKlMEKLPGlREISIQFAGIDPIKDPIPVVPrTHY>lMGGIP 
I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I 
g010-l RCCGIQfXDHVIXKXDHIGAEKIKEKLPGXREZSIQFAGZDPIKDPIPVVPTTHy^lMGGIP 

310 320 330 340 350 360 



370 380 390 400 410 420 

m010-l. pep TNYHGBVVVPQGBD YBVPVKGLYAAGECACA3 VH GANRLGTNSLI^LWFGKAAGD SMIK 

I I I I I I I I I I I I :: I I I I I i I I t I I t I I i I I I I I I I I I I I I I I I [ | I I f | 
g010-l TNYHGEVVVPQGDEYEVPVKGLYAAGECAC^SVHGANFX^ 

370 380 390 400 410 



430 440 450 460 470 480 

m010-l. pep FIl^QSDWireiiPANAGELTRQRIERIJ3NQTIX3ENVDALKR^ 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 71>: 

a010-l . seq. . 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

4 01 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

7 01 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATC3AT CCGATTAAAG ACCCGATTCC CGTTG7GCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 ATGCCGGCGA ACTGACCCGC CAACGTATCG AGCGTTTGGA CAATCAAACT 

1351 GATGGTGAAA ACGTTGATGC ATTGCGCCGC GAACTGCAAC GCTCCGTACA 

14 01 ATTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTAATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGATGATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATGCCAATA CCTTGTCCTA 

1701 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

17 51 AGCGCGTTTA TTGA 



This corresponds to the amino acid sequence <SEQ ID 72; ORF 01 0-1 .a>: 
a010-l.pep.. 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 
51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 
101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 
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151 
201 
251 
301 
351 
401 
451 
501 
551 



QQNVRANTQF 
ATGGGGRIYA 
VLITEGVRGE 
RGCGKNKDHV 
TTHYMMGGIP 
TNSLLDLVVF 
DGENVDALRR 
KSKVWNTARI 
NWMKHTLYHS 



FVEWTAQDLI 
SSTNAYMNTG 
GGILLNADGE 
LLKIDHIGAE 
TNYHGEVWP 
GKAAGDSMIK 
ELQRSVQLHA 
EALELDNLIE 
DANTLSYKPV 



RDENGDWGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGDEYEVPVK 
FIKEQSDWKP 
GVFRTDEILS 
VAKATLVSAE 
HTKPLSVEYI 



TAMEMETGEV 
IPLEDMEFWQ 
KDLASRDVVS 
EISIQFAGID 
GLYAAGECAC 
LPANAGELTR 
KGVREVMAIA 
ARKESRGAHA 
KPAKRVY* 



YIFHAKAVMF 
FHPTGVAGAG 
RAMAMEIYEG 
PIKDPrPWP 
ASVHGANRLG 
QRIERLDNQT 
ERVKRTEIKD 
SDDHPERDDE 



m010-l / a010-l 99.3% identity in 567 aa overlap 

10 20 30 40 50 60 

aOl 0 - 1 . pep MGFPVRKFDAVIVGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

liiiiiimiiiiiiiiMii liiiiiitiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

aOl 0 - 1 MGFPVRKFDAVIVGGGGAGLKAALQLSKSGLNCA VLSKVFPTRSHTVAAQGGISASLGNV 

10 20 30 40 50 60 

70 80 90 100 110 120 

aOl 0 - 1 .pep QEDRWDWHWVTDTVKGSDWLGCODAIEFMCRAAPSAVIEI^HMGMPFDRVESGKIYQRPFG 
llllllllllltlllllllilllllilllllllillllllillllllilllllltlllM 
mOl 0 - 1 QEDRWDVH IMYDTVKG3DWLGDQDA I E FMCRAAPE AV I E LEHMGMPFDR VE SG KI YQRP FG 

70 80 90 100 110 120 

130 140 150 160 170 180 

aO 1 0 - 1 . pep GHTAEHGKRAVERACAVADRTGHAMLHTLYQONVRANTOF FVEWTAQD LI RDENGDWGV 

!ll]l!llillllltllillll!illllllllll!lllilNllllllllilll[lllll 

raO 1 0 - 1 GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFV 

130 140 150 160 170 180 

190 200 210 220 230 240 

a010-l - pep TAMEMETGEVYI FHAKAVMFATGGGGRI YASSTNAYMNTGDGLGI CARAG I PLEDMEFWQ 

1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 

m010-l TAMEMETGEVYI FHAKAVMFATGGGGRI YASSTNAYMNTGDGLGI CARAG I PLEDMEFWQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

aO 10 - 1 . pep FK PTGVAGAGVLI TEGVRGEGGI LLNADGERFMERYAPTVKDLAS RDWSRAMAME I YEG 

I II UNI lllllil II IMII IIIIIMIMI MllllliM II MM! II II IMIII 

mOlO -1 FH PTGVAGAGVLI TEG VEGEG3 1 LlWADGERFMERYAPTVlCDIiAS RDWSRAMAME I YEG 

250 260 270 280 290 300 

310 320 330 340 350 360 

a010-l .pep RGCGKNKDHVI^KIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 

llllllllllllllllilllllllM II IlilllMI II IMIIIMIMI I! Illlll 

m010-l RGCGKNKDHVI^KIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 
310 320 330 340 350 360 

370 380 390 400 410 420 

a010-l .pep TNYRGEVWPQGD E YEV P VKGLYAAGECACASVHG ANRLGTNS LLDLWFGKAAGDSMI K 

IMMMI llll-MMl I II MM Ml Mlllll M MM MIIMMI! HIMIM 
mO 10 - 1 TNYHGEVWPC^EDYEVPVKGLYAAGECACASVHGANRXGTN^ K 

370 380 390 400 410 420 



430 440 450 .460 470 480 

aO 10 - 1 . pep F IKEQSDW KPLPANAGE LTRQR I ERLDNQTDGENVDALRRELQRS VOLHAGVFRTDB I LS 

ii iiiiiMiiiiiiiiiiiii iiiiiiiiiiiiiiii iitiiiiiiiiiiiiiiiiin 

m010-l FIKEQSTJWKPLPANAGELTRQRIER1J5NQIXW3ENVDALRRELQRSVQLHAGVFRTDBILS 
430 440 450 460 470 480 

490 500 510 520 530 540 

a010-l .pep KGVREVMAI AERVKRTE IKDKSKVWNTARIEALELDNLI EVAKATLVSABARKESRGAHA 

1 1 till II INI I MIliM II II II II Mil MINI llllllilllllllllllllll 

mOlO - 1 KGVREVMAIA£RVKJ^TEIKDK5KV>Q7rARIEALEIJ)NLIEVAKATLVSAEARKESRGAHA 
490 500 510 520 530 540 

550 560 570 580 

aOl 0 - 1 . pep SDDHPERDDENWMKHTLYHSDANTLSYKPVHTKPLSVEYI KPAKRVYX 

lllllllllllllllllllll llllllllllllllllilllllllM 
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m010-l SDDHPERDDENWMKHTLYHSDINTLSYKPVHTKPLSVEYI KPAKRVYX 

550 560 570 580 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 73>: 

gOll . seq 

1 ATGAAGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAC CCGCCTTACC GAAGATATGA AAACCGCGAT GCGCGCCAAA 

151 GATCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAATGCCG CCGTCAAACA 

201 GTTTGAAGTA GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

251 TCCTGACCAA AATGGTCAAA CAGCGCAAAG ACGGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGACGT 

351 GCTGCACCGC TACCTGCCGC AAATGCTCTC CGCCGGCGAA ATCCGCACCG 

401 CCGTCGAAGC AGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

451 GGCAAAGTGA TGGTCGTATT GAAAAcccGC CTCGCCGGCA AAGccgATAT 

501 GGGCGAAGTC AACAAAATCT TGAAAAccGt aCTGACCGCC tga 



This corresponds to the amino acid sequence <SEQ ID 74; ORF 01 1 .ng>: 

gOll.pep 

1 MKTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKTRLT EDMKTAMRAK 

51 DQV5LGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDGAKIYT 

101 EAGRQDLADK ENAEIDVLHR YLPQMLSAGE IRTAVEAAVA ETGAAGMADM 

151 GKVMWLKTR LAGKADMGEV NKILKTVLTA * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 75>: 

m0 1 1 . seq (partial ) 

1 ATGAGGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAT CCGCCTTACC GAAGACATGA AAACCGCGAT GCGCGCCAAA 

151 GACCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAACGCCG CCGTCAAACA 

201 GTTTGAAGTG GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

251 TCCTGACCAA AATGGTCAAA CAGCGAAAAG ACAGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGAGGT 

351 ACTGCACCGC TACCTTCCCC AAATGCTTTC CGCCGGCGAA ATCCGTACCG 

401 AGGTCGAAGC TGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGTAAAGTCA TGGGGCTGCT GAAAACCCGC CTCGCAGGTA AAGCCGA. . . 



This corresponds to the amino acid sequence <SEQ ID 76; ORF 01 1>: 

mOll.pop (partial) 

1 MRTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKIRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDSAKIYT 

101 EAGRQDLADK ENAEIEVLHR YLPQMLSAGE IRTEVEAAVA ETGAAGMADM 

151 GKVMGLLKTR LAGKA 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 01 1 shows 95.8% identity over a 165 aa overlap with a predicted ORF (ORF 01 l.ng) 
from N. gonorrhoeae: 
mOll/gOll 

10 20 30 40 50 60 

mOll.pep MRTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKIRLTEDMKTAMRAKDQVSLGTIRL 

i : n 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 i 1 1 1 1 1 f 1 1 1 r i 

gOll MKTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKTRLTEDMKTANRAK DQVSLGTIRL 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOll.pep INAAVKQFEVDERTEADDAKITAILTKMVKQRKDSAKIYTEAGRQDLADKENAEIEVLHR 
I > t I I I I 1 1 I I I J I I I I I t I I | I | | | | | | I | | | | : | | | | | | | | | i | i | | | | | 1 1 i r 1 1 | | 
SOU INAAVKQFEVDERTEADDAKITAILTKMVKQRKDGAKIYTEAGRQDLADKENAEIDVLHR 
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70 80 90 100 110 120 

130 140 150 160 

mOll . pep YLPQMLSAGE I RTE VZAAVAETGAAGMADMGKVMGLLKTRLAGKA 

I I I I M I I I I ; I I I I I I I I I I I I I I I I I I I I I I : I I I I I I | I | 
g 0 1 1 YLPQM LS AGE I RT A V2 AAVAETGAAGMADMGK VMVVLKTRL AGKADMGEVN KI LKTVLTA 

130 140 150 160 170 180 

gOll X 



The following partial DNA sequence was identified in M. gonorrhoeae <SEQ ID 77>: 

g012 .seq 

1 ATGCTCGCCC GTCGCTATTT TTTCAATATC CAACCCGGGG CGGTTTTCAC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGCCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACa 

201 gGcggTGGAT ATTCGgcact tccgCcacca cacccaccga accgatgacc 

251 gcaaacggaG CGGAAACAAT TTTATCCGCc acacacgcca tcatatagcc 

301 gcCGCTTGCC GCGACCTTAT CGAcggcgac ggTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GG AT TCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCGCCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CTTTTTTTTC CTGATGTTTT GTCTCTTCCT 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 78; ORF 012.ng>: 

g012.pep 

1 MLARRYFFNI QPGAVFTDKL LEQLMRFLQF LPEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRHFRHHTHR TDDRKRSGNN FIRHTRHHIA 

101 AACRDLI DGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPAL LQTLFLCFGF 

201 RLFLFLFFFF LMFCLFLA* 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 79>: 

ro012 . seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCn nnnnnnnnnn nnnnnnnnim iinrinminnnn nnnnnnnnnn 

351 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

401 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 51 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

501 nnnnnnnnnn nnnnnnnnnC AACACAAAAA GGCGTGATTT nTGCGTTTCG 

551 GCAGATTTCT CCCCACCCTC CTTCAAACGT TTTTCcTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTGT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 80; ORF 012>: 

m012.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

1C1 AARXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXQHKKA*F XRFGRFLPTL LQTFFLCFGF 

2C1 RLFLFLFLFF LMLCLFPA* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 81>: 
a012 .seq 

1 ATGCTCGCCC GTTGCCACTT 

51 TGACAAACTG CTTGAACAAC 

101 TTCTGTTTGC CCTTTTCCGT 

151 AAATTCGCCC GCCGTCATCA 

201 GGCGGTGGAT ATTCGGTACT 

251 GCAAACGGAG CGGAAACAAT 

301 ACCGCTCGCC GCCACCTTAT 

351 CGCGCAAACG CCTAAGCTGC 

401 CCGCCCGGAC TTTCCAATCT 

451 CAAAAGCACC GCCGTAATCT 

501 ACAAATCGCC GTCAAAATCC 

551 GAAGATTTCT CCCCACCCTC 

601 CGCCTTTTCC TTTTTCTTTT 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 82; ORF 012.a>: 

a012.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLOF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLI FRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 R LFLFLFLFF LMFCLFPA * 

moi2/aoi2 64.2% identity over a 21 8 aa overlap 

10 20 30 40 50 60 

MLARCHFLN IQLRAVLADKLLEQLMRFLQFLSE FLFALFRI FTHKSNRALKFARRHH IH I 

1 1 I I I M 1 1 1 1 1 II I i I I I I I I I II I I I I I 1 1 1 1 I I 1 1 1 1 1 1 II I I I I I II i I I I I I I | I 

MLARCHFLNI QLRAVLADKLLEQLMRFLQFLSEFLFALFKI FT HKSNRALKFARRHHIHI 
10 20 30 40 50 60 

70 80 90 100 110 120 

NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
I I I I I I I I I I I I I I I : : ! I I I I I I I I I I : I I I I I I I I I I I : I I : 
NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
70 80 90 100 110 120 

130 140 150 160 170 180 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 
: : : : I I I I I I 

PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQKKKAGF 
130 140 150 160 170 180 

190 200 210 219 

XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I 1 1 I 1 1 I t I I I I : I I I I II I I I I IJ I I I I I I : I I I I I I 
LRFG R FL PT LLQT L FLC FG FRL FL FL FLF FLM FC L FPAX 
190 200 210 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 012 shows 58.7% identity over a 218 aa overlap with a predicted ORF (ORF 012.ng) 
from N. gonorrhoeae: 

m012/g012 

10 20 30 40 50 60 

m012 . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARHHHIHI 
MM MMII II:: I III IH Mill II I M I I I I I I I I I I I I I I I I I I I I I M I I 
g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIF7HKSNRALKFARRHHIHI 



CCTCAATATC CAATTGAGGG CGGTTCTCGC 
TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 
ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 
CATCCACATC AATATCATGT TTTTTCAACA 
TCCGCTACAA CACCCACCGA ACCGACAATC 
TTTATCCGCC ACACACGCCA TCATATAACC 
CGACGGCGAC GGTCAGCGGA ATATTGCGTT 
GAAGCCGCCA AACCGTAACC GTGAACCACG 
AAGCAGAACC TCATCTTCAG GCTTGGCAAT 
CATGACGCAA GGATTCTACG GCGTGTGCAT 
AACACAAAAA GGCGGGATTT TTGCGTTTCG 
CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 
CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 



m012 .pep 
a0!2 

m012.pep 
a012 

m012 .pep 
a012 

m012 . pep 
a012 
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10 20 30 40 50 50 

70 80 90 100 110 120 

m012 . pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 

I I I I I I I I I I I I : I I I I I I I I I : I I I I I : I I I I I I I I I I : I I 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLI DGDGQRN IAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m012 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

: : : : I I I I I I 

g012 PKLR S RQT VTVN HAART FQSEQN LI FRLGNQKHR RN LMTQG FYG VCI Q I AVK I QHKKAG F 

130 140 150 160 170 180 

190 200 210 219 

m012 . pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I I I I I I : I I I I : I I I I I I I I I I I I I: I I I I : I I I II 
g012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 83>: 

m012-l.seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CyTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

401 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

451 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 84; ORF 012- 1>: 

m012-l .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 
51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKREGSN FIRHTRHHIT 
101 AARRHLIDGD GQRN IAFAQT XKLRSRQTVT VNHAARTFQS EQNLIFRLGN 
151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 
201 RLFLFLFLFF LMFCLFPA* 



m012-l/g012 91.7% identity in 218 aa overlap 



10 20 30 40 50 60 

m012-l.pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSMRALKFARRHHIHI 
Mil : I : I I I I I :: I I I i I I I I I I I I I I I I ! I I I II I I I I I I I I I I I I I I I I I i I 
g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 



70 80 90 100 110 120 

m012-l.pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 
I I I I I II II I I I : I I II I I I I I : I I I I I : I I I I I I I II I : I , i I I I I I I I I I I I I I I I 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m012-l.pep XKLRSRQTVTVNHAARTFQSEQNL I FRLGNQKHRRN LMTQG FYGVC I QIAVKI QHKKAG F 
I I I I I I I I ! I I i II I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I M I II I I I 
g012 P K L R S RQT VT VN HAART FQS EQN L I FR LGN QKH RRN LMT QG F YG VC I QI AVKI QH KK AG F 
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130 



140 



150 



160 



170 



180 



190 200 210 219 

m012-l .pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I I I I I : I I I I I I II I I I I I I I I I I : I I I I t I I ! f I 
g 0 1 2 LRFGR FLPALLQTL FLC FGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was 

a012-l . seq 

ATGCTCGCCC GTTGCCACTT 
TGACAAACTG CTTGAACAAC 
TTCTGTTTGC CCTTTTCCGT 
AAATTCGCCC GCCGTCATCA 
GGCGGTGGAT ATTCGGTACT 
GCAAACGGAG CGGAAACAAT 
ACCGCTCGCC GCCACCTTAT 
CGCGCAAACG CCTAAGCTGC 
CCGCCCGGAC TTTCCAATCT 
CAAAAGCACC GCCGTAATCT 
ACAAATCGCC GTCAAAATCC 
GAAGATTTCT CCCCACCCTC 
CGCCTTTTCC TTTTTCTTTT 
CGCTTAA 



identified in N, meningitidis <SEQ ID 85>: 



l 

51 
101 
151 
201 
251 
301 
351 
431 
451 
501 
551 
601 
651 



CCTCAATATC 
TGATGCGTTT 
ATTTTCACCC 
CATCCACATC 
TCCGCTACAA 
TTTATCCGCC 
CGACGGCGAC 
GAAGCCGCCA 
AAGCAGAACC 
CATGACGCAA 
AACACAAAAA 
CTTCAAACGC 
CCTCTTTTTC 



CAATTGAGGG 
CCTCCAGTTC 
ATAAAAGTAA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATCTTCAG 
GGATTCTACG 
GGCGGGATTT 
TTTTTCTCTG 
CTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAATC 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCTTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 86; ORF 012-1 .a>: 
a012-l.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 
51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 
101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 
151 QKI1RRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 
201 RLFLFLFLFF LMFCLFPA* 



a012-l/m012-l 



a012-l.pep 



m012-l 



97.2% identity in 218 aa overlap 

10 20 30 40 50 60 

MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I I I I I I I M I 1 I I I I II I I I I I I I II I I I I I I I I I I I I | | || | | I | | | | | | | | | | | | | | | 
MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
10 20 30 40 50 60 



70 80 90 100 110 120 

a012-l . pep NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
I I 1 1 1 I I i I I I 1 1 I I :: I I I I 1 1 t 1 1! I : I 1 1 I I I M I I I : I I I I I : 1 1 I I I I! I I I I I I 
m012-l NIMFFQQAVDIRYFPxHKTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

a012-l.pep PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNL.MTQGFYGVCIQIAVKIQHKKAGF 

I I I I I M I I I I I I I I I I I h I I I II I I I i I I I I I I II I I I [ I I I I I I I I I I I I I 

m012-l XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 



190 200 210 219 

a012-l .pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I I II I I I ! I I I I I I I i I I I I I I I I I I It I I I I I I | I 
mC'12-1 LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 87>: 
g013.seq 

1 aTgcctttga ccatgctgtg cagcaGGAcg tGCGGTTtgt tcataataca 
51 gtCcgaccGG AAAagcggAG GAAaCGCAGT GCCGCGCCCT TCCCCTTTCT 
101 TGCCGTGGCA GGCGATGCag tTgGATTCGT ACACTTTTTG CCCTTTtGtC 
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151 atgatGCTgt tgtcggCGGC AGAAGCgGCG GcgCAGAGGC AGCACAAGAT 

201 GAAGGCGGTC GGCAGTCGGG TTGTGTtcat tGgcgTTTCC cctaatgttt 

251 tgaaaccttg ttttttgatt Ttgcctttac ggggtgaaaa gtttttTtgg 

301 cccaaatccg gaatttag 

This corresponds to the amino acid sequence <SEQ ID 88; ORF 013.ng: 

g013 .pep 

1 MPLTMLCSRT CGLFIIQSDR KSGGNAVPRP SPFLPWQAMQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQHKMKAV GSR VVFIGVS PNVLKPCFLI LPLRGEKFFW 
101 PKSGI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 89>: 

m013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAAGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTCATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT TTTGCCGAAT 

301 CAGTAG 

This corresponds to the amino acid sequence <SEQ ID 90; ORF 01 3>: 
m013 .pep 

1 MPLTMLCSST CGFFMMKSER XSGGNMVPRP SPFLPWQATQ LDSYTFCPFV 

51 MMLLSAAEAA AQKQPKTRAV GSR VVFIGVS FMFETLLLI L RSGXKIFLPN 

101 Q* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 91>: 

a013.seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAGGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTAATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT CTTGCCGAAT 

301 CGGTAG 

This corresponds to the amino acid sequence <SEQ ID 92; ORF 013.a>: 

a013 .pep 

1 MPLTMLCSST CGFFMMKSER *SGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQPKTRAV GSR VVFIGVS LMFETLLLIL RSG*KIFLPN 
101 R* 

m0i3/aoi3 97.0% identity over a 101 aa overlap 

10 20 30 40 50 60 

m013.pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 

miimmmiiimiimimmmimimmimmimi mi iiiiiiim mi n 

a013 MPLTMLCS STCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 

70 80 90 100 

m013 .pep AQKQPKTRAVG S RV V F I GV S FM FET LLL I LRS GXK I FL PNQX 

III! Ill Mill I M I: III III IIMIII Mill!:) 
a013 AQRQPKTRAVGSRWFIGVSLMFETLLLILRSGXKIFLPNRX 

70 80 90 100 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 013 shows 73.3% identity over a 101 aa overlap with a predicted ORF (ORF 013.ng) 
from N. gonorrhoeae: 
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m013/gO13 

10 20 30 40 50 60 

m013.pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
I I I II I I I I I I: I : : : I : I I I I 1 I I I I I II I I I I I II I I I M I M I I I I I I I I I I 
g013 MPLTMLCSRTCGLFIIQSDRKSGGNAVPRPSPFLPWQAMQLDSYTFCPFVHMLLSAAEAA 

10 20 30 40 50 60 

70 80 90 100 

m013 .pep AQKQPKTRAVGSRWFIGVS F-MFETLLLI LR-SGXKIFLPNQX 

I I : I E : I I I I I I I I I I I I :: : : I I I I I : I I : 
g013 AQRQHKMKAVGSRVVFIGVS PNVLKPCFLI LPLRGEKFFWPKSGIX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 93>: 

g015 . seq 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTTTTTTCCT ACTTTGGAAA AATCCAGAAA 

131 AGCCCTTGGT CGGCTTTTGG AAAGCACTGC CCCACCTCAA CGACACGATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAG ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGCA CAAAAATCCT GCTCCTGTTC GCCTACATCG 

251 CACTGGGCAT GGTAATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCTAT GTGTTGCATC GCCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCAT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 94; ORF 015.ng>: 

g015 .pep 

1 MQYLIVKYSH Q IFVTITILV FNIRFFLLW K NPEKPLVGFW KALPHLNDTM 

51 LLFTGLWLMK ITHFSPFNAP WLGT KILLLF AYIALGMVMM RA RPRSTKFY 

101 TVYLLAMCCI ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 95>: 

mO 1 5 . seq { part ial ) 

1 . . AAAATCAGAA AAGCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA 

51 CGACACCATG CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT 

101 TCTCCCCGTT CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC 

151 GCCTATATCG CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC 

201 CAAGTTCTAC ACCGTTTACC TGCTCGCCAT GTGTTGCGTC GCCTGCATCG 

251 TTTACCTTGC CAAAACCAAA GTCCTGCCTT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 96; ORF 015: 

raOlS.pep (partial) 

1 . . KIRKALAGFW KALPHLNDTM LLFTGLWLMK ITHFSPFNAP WLGT KILLLL 
51 AYIALGMMMM RARPR3TKFY TVYLLAMCCV ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 97>: 

a015.seq 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTGTTTTCNT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA CGACACCATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC GCCTATATCG 

251 CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCCAT GTGTTGCCTC ACCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCTT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 98; ORF 015.a>: 

a015.pep 
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1 MQYLIVKYSH Q IFVTITILV ?NI RVFXL WK NPEKPLAGFW KALPHLNDTM 
51 LLFTGLWLMK ITHFSPFNAP WLGTK ILLLL AYIALGMMMM RA RPRSTKFY 
101 TVYLLAMCCL TCIVYLAKTK VLPF* 



m0i5/a0i5 96.7% identity over a 91 aa overlap 



10 20 30 

m015.pep KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

I I I I I I I I I I I I I I I I I I I I I II I I I II I 
a015 LIVKYSHQIFVTITILVFbJrRVFXLWKNPEKPLiAGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 

40 50 60 70 80 90 

mOlS.pep FS P FN A PW LGTK I LLLLA Y I ALGMMMMRAR PR S T K FYT VY LLAMCCVACI VYLAKTKVLP 

I I I I I I I I I I I I I I I I I I [ 1 I I I I I I I I I I I I t I I I I I I I I I I I I i :: I I I I I I I I I I 1 I 
a015 FS PFNAPWLGTKILLLLAY IALGMMMMRARPRSTKFYTVYLLAMCCLT CI VYLAKTKVLP 

70 80 90 100 110 120 



m015.pep FX 
I I 

a015 FX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 015 shows 94.5% identity over a 91 aa overlap with a predicted ORF (ORF 015.ng) 
from N. gonorrhoeae: 

m015/g015 



10 20 30 

mO 1 5 . pep KI RKALAGFW KALPHLNDTMLLFTGLWLMKI TH 

I Mill III III III I I UN II I II I I 
g015 LIVKYSHQIFVTITILVF^IRFFLLWKNPEKPLVGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 



40 50 60 70 80 90 

m015 .pep FSPFNAPWLGTKILLLW\YIALGMM^ 

I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I III I I I I I I I I I I : I I I I I I I I I I I I I 
g015 FSPFTJAPWLGTKILLLFAYIAIXSMVMMRARPRSTKFYT^ 

70 80 90 100 110 120 



m015.pep FX 
II 

g015 FX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 99>: 

g018.seq 

1 atGCAGCAGG GGCagttggt tggacgcgtc gcccgcaata AAGATATGCG 

51 GAATgctggt CTGCATggtC AGCGGATCGG CAACGGGtac gccgcgcgcg 

101 tctttgTCGA TATTGATGTT TTCCAAACCG ATATtgTCAA CGTTCGGACG 

151 GCgACCTACG GCTGCCAACA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT tCtACATTGC CGTCTGCGTC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT tctTCTCCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 100; ORF 018.ng>: 

g018 . pop 

1 MQQGQLVGRV ARNKDMRNAG LHGQRIGNGY AARVFVDIDV FQTDIVNVRT 

51 ATYGCQHIFG NKY AFFAILL PMDFYIAVCV EFDLGFSIQM QFQFFSEHGF 
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101 RLV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 101>: 

m018.seq 

1 ATGCAGCAGA GGCAGTTGGT TGGACGCATC GCCTGCGATG AAGATATGCG 

51 GAATACTGGT CTGCATGGTC AGCGGGTCGG CAACAGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATATT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCCACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCATC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT TCTTCGCCGA ACACGGCGTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 102; ORF 01 8>: 

m018.pep 

1 MQQRQLVGRI ACDEDMRNTG LHGQRVGNRY AARIFFDIDI FQTDIVNVRT 
51 AAHGCQHIFG NKY AFFAILL PMDFYIAVCI EFDLGFSIQM QFQFFAEHGV 
101 RLV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 103>: 

a018. seq 

1 ATGCAGCAGG GGCAGTTGGT TGGACGCGTC GCCCGCAATA AAGATATGCG 

51 GAATACTGGT CTGCATAGTC AGCGGATCGG CAACGGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATGTT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCTACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCGTC GAGTTTGGCC 

251 TCGGTTTTAG CATCCAAATG CAGTTTCAAT TCTTCACCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 104; ORE 018.a>: 

a018.pep 

1 MQQGQLVGRV ARNKDMRNTG LHSQRIGNGY AARIFFDIDV FQTDIVNVRT 
51 AAYGCQHIFG NKY AFFAILL PMDFYIAVCV EFGLGFSIQM QFQFFTEHGF 
101 RLV* 

nt0i8/a0i8 86.4% identity over a 1 03 aa overlap 

10 20 30 40 50 60 

m018 . pep MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAARIFFDIDIFQTDIVNVRTAAHGCQHIFG 
III 11111:1 : : I I I I I I I I : I I : I I i I I I I I I M I : I I I I I I I I I I I I : I I I 1 I M 
a018 MQQGQLVGRVARNKDMRNTGLHSQRIGNGYAAR I FFD I DVFQTD I VNVRT AAYGCQHIFG 

10 20 30 40 50 60 

70 80 90 100 

m018 . pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I I I I I I ! I I I I I I I I I I I : I I 111111111111:111 I I I I 
a018 N K Y AFFA I LL PM D FY I AVC VE FGLG F S I QMQ FQ F FTE HG FR L VX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 018 shows 84.5% identity over a 103 aa overlap with a predicted ORF (ORF 018.ng) 
from N. gonorrhoeae: 

m018/g018 

10 20 30 40 50 60 

mO 1 8 . pep MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAAR I FFDIDI FQTD I VNVRTAAHGCQH I FG 

III 11111:1 :: I I | I : | | | | | | : | | II II : I I I I : I I I I I I I I I I I :: I I II I I I 
g018 MQQGQLVGRVARNKDMRNAGLHGQRIGNGYAARV FVD I DV FQTD I VNVRTATYGCQH I FG 

10 20 30 40 50 60 
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70 80 90 100 

mOl 8 . pep NKYAFFAI LLPMDFYI AVCIEFDLGFS IQMQFQFFAEHGVRLVX 

I I I I I I I I I I I I i I I I I I I : I I I I I I I I I I I I I I I : I I I I I I I 
g018 NKYAFFAI LLPMDFYI AVCVEFDLGFS I QMQFQFFSEHGFR LVX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 105>: 

g019.seq (partial) 

1 ..ctgctggcgg ccctggtgct tgccgcgtgt tcttcgACAA ACAcacTGCC 
51 AGCCGGCAAG ACCCCGGCAG ACAATATAGA AActgcCgAC CTTTCGGCAA 
101 GCGTTCCCAC ccgcCCTGCC GAACCGGAAG GAAAAACGCT GGCAGATTAC 
151 GGCGGCTACC CGTCCGCACT GGATGCAGTG AAACAGAACA ACGATGCGGC 
201 AGCCGCCGCC TATTTGGAAA Acgcaggaga cagCGcgatg gcGGAAAatg 
251 tccgcaagga gtgGCTGa 

This corresponds to the amino acid sequence <SEQ ID 106; ORF 019.ng>; 

g019.pep (partial) 

1 . .LLAALVLAAC SSTNTLPAGK TPADNIETAD LSASVPTRPA EPEGKTLADY 
51 GGYPSALDAV KQNNDAAAAA YLENAGDSAM AENVRKEWL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 107>: 

m019.seq (partial) 

1 ATGTACCTAC CCTCTATGAA GCATTCCCTG CCGCTGCTGG CGGCCCTGGT 

51 GCTTGCCGCG TGTTCTTCGA CAAACACACT GCCAGCCGGC AAGACCCCGG 

131 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCCGCCCT 

151 GCCGAACCCG AAAGAAAAAC GCTGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCACGCAG ACAGTGGACG CTGTTTGCAC AGGAATACGC 

351 CAAACTCGAA CCGGCAGGGC GCGCCCAAGA AGTCGAATGC TACG CCGATT 

4 01 CGAGCCGCAA CGACTATACG CGTGCCGCTG AACTGGTCAA AAATACGGGC 

4 51 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

6 01 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGOGTG GGGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 CGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCCTTGCGC GCCCGACGTT GGGACGAGCT GGCCTCCGTT 

901 ATCTCGCATA TGCCCGAAAA ACTGCAAAAA AGCCCGACCT GGCTCTACTG 

951 GCTGGCACGC AGCCGCGCCG CAACGGGCAA CACGCAAGAG GCGGAAAAAC 

1001 TTTACAAACA GGCGGCAGCG ACGGGCAGGA ATTTTTATGC GGTGCTGGCA 

1051 GGGGAAGAAT TGGGTCGGAA AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAAAC AGCGTCCGCC GCATGGCGGA AGACGGTGCA GTCAAACGCG 

1151 CACTGGTACT GTTCCAAAAC AGCCAATCTG CCGGTGATGC AAAAATGCGC 

1201 CGTCAGGCTC AGGCGGAATG GCGTTTTGCC ACACGCGGCT TTGACGAAGA 

12 51 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

13 01 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 
1351 CGCTATATTT CGCCGTTTAA AGACACGGTA ATCCGCCACG CGCAAAATGT 

14 01 TAATGTCGAT CCGGCTTGGG TTTATGGGCT GATTCGTCAG GAAAGCCGCT 
14 51 TCGTTATAGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 
1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 
1551 ACAACTTTAC ACCGCCGACG GG. . . 

This corresponds to the amino acid sequence <SEQ ID 1 08; ORF 019>: 

m019.pep (partial) 

1 MYLPSMKHSL PLLAALVLAA CSSTNT LPAG KTPADNIETA ELSASVPTRP 
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51 AEPERKTLAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LFAQEYAKLE PAGRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKSPNAAAL LSEMESGLSL EQRS FAWGVL 

251 GHYQSQNIiNV PAALDYYGKV ADRRQLTDDQ IEWYARAALR ARRWDELASV 

301 ISHMPEKLQK SPTWLYWLAR SRAATGNTQE AEKLYKQAAA TGRNFYAVLA 

3 51 GEELGRKIDT RNNVPDAGKN SVRRMAEDGA VKRALVLFQN SQSAGDAKMR 

4 01 RQAOAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
451 RYISPFKDTV IRKAQNVNVD PAWVYGLIRQ ESRFVIGAQS RVGAQGLMQV 
501 MPATAREIAG KIGMDAAQLY TADG. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 109>: 

a019.seq 

1 ATGTACCCAC CCTCTCTGAA GCATTCCCTG CCGCTGCTGG TGGNCCTGGT 

51 GCTTGCCGCG TGTTCTTNGA CAAACACACT GTCAGCCGAC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCNGCCCT 

151 GCCGAACCCG AANGAAAAAC GTNGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCGCGCAG ACAGTGGACG CTGTNTGCAC ANGAATATGC 

351 NAAACTCGAA CCGGCANGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

4 01 CGAGCCGCAA CGACTATACG CGTGCCGCCG AACTGGTCAA AAATACGGGC 

4 51 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC CACAGACSCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG C-GGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 NGGCAAGGTT GCCGACCQCC C-CCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCNNTNNGC NNNCGNNGTT NGNANGANNT GGCNNCCGNN 

901 ANCNCGNNNN TGCNNGANAA ACNNNNNNAN AGNCNNANNT NGNTNNANTG 

951 NNTGGCACGC AGCCGCGCCG CNACGGGCAA CACGCAANAN GCGGATJAAAC 

1001 TNTACAAACA GGCGGCAGCA NCGGGCANGA ATTTTTATGC NGTGCTGNCN 

1051 GGGGAAGAGT TGGGGCGCAN AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAANC AGCGTCCTCC GTATGGCGGA AGACGGCGCG ATTAAGCGCG 

1151 CGCTGGTGCT GTTCCGAAAC AGCCGAACCG CCGGCGATGC GAAAATGCGC 

1201 CGTCNGGCTC AGGCGGAATG GCGTTTCGCC ACACGCGGCT TCGATGAAGA 

1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

1301 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

1351 CGCTACATTT CGNNNNNTNA NGACACGGTA ATCCGCCACG CGCAAAATGT 

1401 TAATGTCGAT CCGGCGTGGG TTTACGGGCT GATTCGTCAG GAAAGCCGCT 

1451 TCGTTATGGG CGCGCAATCC CGCGTAGGCG CGCAGGG3CT GATGCAGGTT 

1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCG3TA TGGATGCCGC 

1551 ACAACTTTAC ACCGCCGACG GCAATATCCG TATGGGGACG TGGTATATGG 

1601 CGGACACCAA ACGCCGCCTG CAAAACAACG AAGTCCTCGC CACCGCAGGC 

1651 TATAACGCCG GTCCCGGCAG GGCGCGCCGA TGGCAGGCGG ACACGCCCCT 

1701 CGAAGGCGCG GTATATGCCG AAACCATCCC GTTTTCCGAA ACGCGCGACT 

1751 ATGTCAAAAA AGTGATGGCC AATGCCGCCT ACTACGCCTC CCTCTTCGGC 

1801 GCGCCGCACA TCCCGCTCAA ACAGCGTATG GGCATTGTCC CCGCCCGCTG 

1851 A 

This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 019.a>: 

a019 .pep 

1 MYPPSLKHSL PLLVXLVLAA CSXTNT LSAD KTPADNIETA DLSASVPTXP 

51 AEPEXKTXAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LXAXEYAKLE PAXRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKSPNAAAL LSEMESGLSL EQRS FAWGVL 

251 GHYQSQNLNV PAALDYXGKV ADRRQLTDDQ IEWYARAAXX XRXXXXXAXX 

301 XXXXXXKXXX XXXXXXXXAR SRAATGNTQX AXKLYKQAAA XGXNFYAVLX 

351 GEELGRXIDT RNNVPDAGKX SVLRMAEDGA IKRALVLFRN SRTAGDAKMR 

4 01 RXAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
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451 RYISXXXDTV IRHAQNVNVD PAWVYGLIRQ ESRFVMGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADGNIRMGT WYMADTKRRL QNNEVLATAG 

551 YNAGPGRARR WQADTPLEGA VYAETIPFSE TRDYVKKVMA WAAYYASLFG 

601 A PHI PLKQRM GIVPAR* 



moi9/aoi9 88.9% identity over a 524 aa overlap 



10 20 30 40 50 60 

m019.pep MYLPSMKHSLPLLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPERKTLAD 
I I I I : I I I I I I I : I I I I It I I I I I I I I I I I I I II I I M I I I I I I I I I II II 
a019 MYPPSLKHSLPLLVXLVLAACSXTNTLSADKTPADN1ETADLSASVPTXPAEPEXKTXAD 
10 20 30 40 50 60 

70 80 90 100 110 120 

m019.pep YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 

I II II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I MINI 
a019 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLXAXEYAKLE 

70 80 90 100 110 120 

130 140 150 160 170 180 

mOl 9 . pep PAGKAQEVECifADSSKNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

II I I I I I I I I I I I I II I I II I 1 1 II I M M I I I I II I I I II 1 I II I I I II I I I I I I I I I 

a019 PAXRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 
130 140 150 160 170 180 

190 200 210 220 230 240 

m019.pep LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 
Mill I lllll llllltllllilMlillllllllllllll IIIMIIIIIM Ilill II 
a019 LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m019.pep EQRSFAWGVLGHYQSQNLNVPAALDYYGKVADRRQLTDDQIEWYARAALRARRWDELASV 

I II I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I f I 
a019 EQRSFAWGVLGHYQSQNLNVPAALDYXGKVADRRQLTDDQIEWYARAAXXXRXXXXXAXX 
250 260 270 280 290 300 

310 320 330 340 350 360 

mOl 9 .pep ISHMPEKLQKSPTWLYWLARSRAATGNTQEAEKLYKQAAATGRNFYAVLAGEELGRKIDT 
I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I Ml 

aO 1 9 XXXXXXKXXXXXXXXXXX7VRSRAATGNTQXAXKLYKQAAAXGXNFYAVLXGEELGRXIDT 

310 320 330 340 350 360 

370 380 390 400 410 420 

mO 1 9 . pep RNN VPDAGKN S VRRMAE DGAVKRAL VL FQN SQ SAG DAKMRRQAQAE WR FATRG FDE DKLL 

I I I I I I I I I II I I II I I I : I I I i 1 1 I : I I :: II I I I I I I I I I I II I I I I I I I I I I I I 
a019 RNN VPDAGKXSVLRMAEDGAIKRALVLFRNSRT AG DAKMRRXAQAEWR FATRG FDEDKLL 

370 380 390 400 410 420 

430 440 450 460 470 460 

mOl 9 .pep TAAQT A F DHG FY DMAVN S AE RT DRKLN Y T LR Y I S P FKDT V I RH AQN VN V DP AW VY G L I RQ 

I I I I I I M I I II I II I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I | | J 
a019 TAAQTAFDHGFYDMAVWSAERTDRKLNYTLRYISXXXDTVIRHAQNVNVDPAVIVYGLIRQ 

430 440 450 460 470 460 



490 500 510 520 

mOl 9 . pep ESRFVI GAQS RVGAQGLMQVMPATAREI AGKI GMDAAQLYTADG 

I II I I : I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I 
a019 ESRFVMGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADGNIRMGTWYMADTKRRL 

490 500 510 520 530 540 



a019 



QNNEVLATAGYNAGPGRARRWQADTPLEGAVYAETIPFSETRDYVKKVMANAAYYASLFG 
550 56C 570 580 590 600 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 019 shows 95.5% identity over a 89 aa overlap with a predicted ORF (ORF 019.ng) 
from N. gonorrhoeae: 
g019/m019 

10 20 30 40 49 

g 0 1 9 . pep LLAALVLAACSSTNTLPAGKTPADNI ETADLSASVPTRPAEPEGKTLAD 

i I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I || | I M 
m019 OTLPSMKHSLPLIAALVLAACSSTNTLPAGKTPADNIBTADIjSASVPTRPAEPERKTLAD 
10 20 30 40 50 60 



50 60 70 80 89 

g 0 1 9 . pep YGGYPSALDAVKQNNDAAAAAYLENAGDSAMAENVRKEWL 
I I I I I I I I I I I I I : I I I h I I I I I I I I I I I I I I I I I : M I 
m019 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 
70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 1>: 
g023 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 AATGCAGCGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCCCTG CCTAAAGAAT ATCCGGCATG GCAGGCATTT 

151 TTTAGTCAAG CTTGGGTAAA AGTATTTACC CAAGTGAGCT TTATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT TGtctGGCTG 

301 GTCGGCTGCC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 023 ,ng>: 

g023 .pep 

1 MVERKLTGAH YGLRDWVMQR ATAVIMLIYT VALLWLFAL P KEYPAWQAF 

51 FSQAWVKVFT QVSFIAVFLH AWVGTRDLWM DYIKPFGV RL FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 
m023 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 GATGCAACGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTTCCCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTTTGGCTG 

301 GTCGGCTGTC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 023>: 

m023 .pep 

1 MVERKLTGAH YGLRDWVMQR ATA VI MLIYT VALLWLFSL P KEYSAWQAF 

51 FSQTWVKVFT QVSFIAVFLH AWVGIRDLWM DYIKPFGVR L FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1 15>: 
a023.seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GGGATTGGGC 

51 GATGCAACGT GCGACCGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCTCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 
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201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG G ACT AT AT N A 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTCTGGCTG 
301 GTCGGCTGCT TGGTGTATTC AATTAAAGTA ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 023.a>: 

a02 3 .pep 

1 MVERKLTGAH YGLRDWAMQR ATA V IMLIYT VALLVVLFAL P KEYSAWQA F 

51 FSQTWVK VFT QVS FIAVFLH AWVGI RDLWM DYXKPFGVR L FLQVAT I VWL 

101 VGCLVY SIKV IWG* " ' 

m023/a023 96.5% identity over a 1 1 3 aa overlap 

10 20 30 40 50 60 

m02 3 . pep MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLWLFSLPKEYSAWQAFFSQTWVKVFT 
I I I I I I I I I I I I I I I ! : I I II I I I I I I I I I I I II I I I I : I! I I I I I I I I I I ! I I I I I I I | 
a023 MVERKLTGAHYGLRDWAMQRATAVIMLIYTVALLVVLFALPKEYSAWQAFFSQTWVKVFT 
10 20 30 40 50 60 

70 80 90 100 110 

m023.pep QVS F I AV FLH AWVG IRDLWMDYIKP FG VRL FLQVAT I VWL VGC LV Y SVKV I WGX 

I I I I I i I II I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I M I : I I I I I I 
a023 QVSFIAVFLHAWVGIRDLWMDYXKPFGVRLFLQVATIVWLVGCLVYSIKVIWGX 

70 80 90 100 110 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 023 shows 97.3% identity over a 1 13 aa overlap with a predicted ORF (ORF 023 .ng) 
from N. gonorrhoeae: 

g023/m023 

10 20 30 40 50 60 

g02 3 . pep MVER KLTGAH YGLRDWVMQRATAV IML I YTVALLWLFAL PKE YP AWQ AFFS Q AWVKVFT 

I I I I I I i I I II I II I I I I I I I I I I I I I I I I t I I I I I I = = I I I I I I I I I I I t I : I I I I t I 
m02 3 MVERKLTGAH YGLRDVrVMQ RAT AVI ML I YTV ALL WLFSLPK^YS AWQ AFFS OTWVKVFT 

10 20 30 40 50 60 



70 80 90 100 110 

g023 . pep QVS F I AVFLHAWVG I RDLWMD Y I KP FGVRL FLQVAT I VWLVG CLVY SVKV I WGX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I - I I I I I I I I t I I I I I I | 
m023 QVS F I AVFLHAWVG I RDLWMD Y I KPFGVRL FLQVAT I VWLVG CLV Y SVKVI WGX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 17>: 
g025 . seq 

1 ATGTTGAAAC AAAcgACACT TTTGGCAGCT TGTACCGCCG TTGCCGCTCT 

51 GTTGGGCGGT TGcgCCACCC AACAGCCTGC TccTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTATCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 

151 ACGCCGTACA ATGCCGCTCC TGCCGCCAac gatgcGCCgT ATGTGCCGCC 

201 CGTGCAAact gcgccggttT ATTCGCCTCC TGCTTATGTT CCGCcgtCTG 

251 CACCTGCCGT TTCGGgtaca tatgtTCCTT CTTACGCACC CgtcgACATC 

3 01 aacgCGGCGa cgCataCTAT TGTGCGTGGC GACACgGtgt acaACATTTc 

3 51 caaAcgCtac CATATCTCTC AAGACGATTT CCGTGCGTGG AACGGCATGA 

4 01 CCGACAATAC GTTGAGCATC GGTCAGATTG TTAAAGTCAA ACCGGCaggA 
4 51 TATGCCGCAC CGAAAACCG C AGCCGTAGAA AGCAGGCCCG CCGTACCGGC 
501 TGCCGCGCAA ACCCCTGTGA AACCCGCCGC gcaACCGCCC GTTCAGTCCG 
551 CGCCGCAACC TGCCGCGCCC GCTGCGGAAA ATAAAGCGGT TCCCGCCCCC 
601 GCGCCCGCCC CGCAATCTCC TGCCGCTTCG CCTTCCGGCA CGCGTTCGGT 
651 CGGCGGCATT GTTTGGCAGC GTCCGACCCA AGGTAAAGTG GTTGCOGATT 
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701 TCGGCGGCGG CAACAAGGGT GTCGATATTG CCGGCAATGC CGGACAACCC 

751 GTTTTGGCGG CGGCTGACGG CAAAGTGGTT TATGCCGGTT CAGGTTTGAG 

BOl GGGATACGGA AACTTGGTCA TCATCCAGCA CAATTCCTCT TTCCTGACCG 

851 CGTACGGGCA CAACCAAAAA TTGCTGGTCG GCGAAGGTCA GCAGGTCAAA 

901 CGCGGTCAGC AGGTTGCTTT GATGGGTAAT ACCGATGCTT CCAGAACGCA 

951 GCTTCATTTC GAGGTGCGTC AAAACGGCAA ACCGGTTAAC CCGAACAGCT 

1001 ATATCGCGTT CTGA 

This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 025.ng>: 

g025 .pep 

1 MLKQTTLLAA CTAVAALLGG CATQQPAPVI AGNSGMQTVS SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQT APVYSPPAYV PPSAPAVSGT YVPSYAPVDI 

101 NAATHTIVRG DTVYNISKRY HISQDDFRAW NGMTDNTLSI GQIVKVKPAG 

151 YAAPKTAAVE SRPAVPAAAQ TPVKPAAQPP VQSAPQPAAP AAENKAVPAP 

201 APAPQSPAAS PSGTRSVGGI VWQRPTCGKV VADFGGGNKG VDIAGNAGQP 

251 VLAAADGKW YAGSGLRGYG NI.VTIQHNSS FLTAYGHNQK LLVGEGQQVK 

301 RGQQVALMGN TDASRTQLHF EVRQNGKPVN PNSYIAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 19>: 

m025.seq (partial) 

1 . . GTGCCGCCGG TGCAAAGCGC GCCGGTTTAT ACGCCTCCTG CTTATGTTCC 

51 GCCGTCTGCA CCTGCCGTTT CGGGTACATA CGTTCCTTCT TACGCACCCG 

101 TCGACATCAA CGCGGCGACG CATACTATTG TGCGCGGCGA CACGGTGTAC 

151 AACATTTCCA AACGCTACCA TATCTCTCAA GACGATTTCC GTGCGTGGAA 

201 CGGCATGACC GACAATACGT TGAGCATCGG TCAGATTGTT AAAGTCAAAC 

251 CGGCAGGATA TGCCGCACCG AAAGCCGCAG CCGTAAAAAG CAGGCCCGCC 

301 GTACCGGCTG CCGCGCAACC GCCCGTACAG TCCGCACCCG TCGACATTAA 

351 CGCGGCGACG CATACTATTG TGCGCGGCGA CACGGTGTAC AACATTTCCA 

401 AACGCTACCA TATCTCTCAA GACGATTTCC GTGCGTGGAA CGGCATGACC 

451 GACAATATGT TGAGCATCGG TCAGATTGTT AAAGTCAAAC CGGCAGGATA 

501 TGCCGCACCG AAAACCGCAG CCGTAGAAAG CAGGCCCGCC GTACCGGCTG 

551 CCGTGCAAAC CCCTGTGAAA CCCGCCGCGC AACCGCCTGT GCAGTCCGCG 

601 CCGCAACCTG CCGCGCCCGC TGCGGAAAAT AAAGCGGTTC CCGCGCCCGC 

651 CCCGCAATCT CCTGCCGCTT CGCCTTCCGG CACGCGTTCG GTCGGCGGCA 

701 TTGTTTGGCA GCGTCCGACG CAAGGTAAAG TGGTTGCCGA TTTCGGCGGC 

751 AACAACAAGG GTGTCGATAT TGCCGGTAAT GCGGGACAGC CCGTTTTGGC 

801 GGCGGCTGAC GGCAAAGTGG TTTATGCCGG TTCAGGTTTG AGGGGATACG 

851 GAAACTTGGT CATCATCCAG CATAATTCTT CTTTCCTGAC CGCATACGGG 

901 CACAACCAAA AATTGCTGGT CGGCGAGGGG CAGCAGGTCA AACG CGGTCA 

951 GCAGGTTGCT TTGATGGGCA ATACCGATGC TTCCAGAACG CAGCTTCATT 

1001 TCGAGGTGCG TCAAAACGGC AAACCGGTTA ACCCGAACAG CTATATCGCG 

1051 TTCTGA 

This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 025>: 

m025 -pep (partial) 

1 . .VPPVQSAPVY TPPAYVPPSA PAVSGTYVPS YAPVDINAAT HTIVRGDTVY 

51 NISKRYHISQ DDFRAWNGMT DNTLSIGQIV KVKPAGYAAP KAAAVKSRPA 

101 VPAAAQPPVQ SAPVDINAAT HTIVRGDTVY NISKRYHISQ DDFRAWNGMT 

151 DNMLSIGQIV KVKPAGYAAP KTAAVESRPA VPAAVQTPVK PAAQPPVQSA 

201 POPAAPAAEN KAVPAPAPQS PAASPSGTRS VGGIVWQRPT QGKWADFGG 

251 NNKGVDIAGN AGQPVLAAAD GKWYAGSGL RGYGNLV1IQ HNSSFLTAYG 

301 HNQKLLVGEG QQVKRGQQVA LMGNTDASRT QLHFEVRQNG KPVNPNSYIA 

351 F* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

a025 . seq 

1 ATGTTGACAC CAACAACACT TTAGGTAGCT TGTACCGCCC TTGCCGCTCA 

51 GTTGGGCGGA TGCCCCACCC AACACCCTTC TCCTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTACCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 
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151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
831 
851 
901 
951 
1031 
1051 
1101 
1151 
1201 



ACGCCGTACA 
GGTGCAAAGC 
CACCTGCCGT 
AACGCGGCGA 
CAAATGCTAC 
CCGACAATAC 
TATGCCGCAC 
TGCCGCGCAA 
CGCATACTAT 
CATATCTCTC 
GTT GAG CATC 
CGAAAGCCGC 
ACCCCTGTGA 
TGCCGCGCCC 
CTCCTGCCGC 
CAGCGTCCGA 
GGGTGTCGAT 
ACGGCAAAGT 
GTCATCATCC 
AAAATTGCTG 
CTTTGATGGG 
CGGCAAAACG 



ATGCCGCTCC 
GCGCCGGTTT 
TTCGGGTACA 
CC CAT ACT AT 
CATATCTCTC 
GTT GAG CATC 
CGAAAGCCGC 
CCGCTCGTAC 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AACCCGCCGC 
GCTGC GGAAA 
TTCGCCTTCC 
CGCAAGGTAA 
ATTGCAGGAA 
GGTTTATGCA 
AGCATAATTC 
GTCGGCGAAG 
CAATACCGAG 
GCAAACCGGT 



TGCCGCCAAC 
ATANGCCT CC 
TACGTTCCTT 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AGTCCGCACC 
GACACGGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
GCAACCGCCT 
ATAAAGCGGT 
GGCACGCGTT 
AGTGGTTGCC 
ATGCGGGACA 
GGTTCCGGTT 
TTCCTTCCTG 
GCCAGCAGGT 
GCTTCTAGAA 
TAATCCGAAC 



GATGCGCCGT 
TGCTTATGTT 
CTTACGCANC 
GACACCGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
CGTCGACATC 
ACAACATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
GTGCAGTCCG 
TCCCGCGCCC 
CGGTCGGCGG 
GATTTCGGCG 
GCCCGTTTTG 
TGAGGGGATA 
ACCGCATACG 
CAAACGCGGG 
CGCAGCTTCA 
AGCTATATCG 



ATGTGCCGCC 
CCGCCGTCTG 
CGTCGACATC 
ACAAGATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
AACGCGGCGA 
CAAACGCTAC 
CCGACAATAC 
TATGCCGCAC 
TGCCGTGCAA 
CGCCGCAACC 
GCCCCGCAAT 
CATTGTTTGG 
GCAACAACAA 
GCGGCGGCTG 
CGGCAATTTG 
GGCACAACCA 
CAGCAGGTCG 
TTTCGAGGTG 
CGTTCTGA 



This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 025. a>: 

a025.pep 

1 MLTPTTL*VA CTALAAQLG G CPTQHPSPVI AGNSGMQTVP SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQS APVYXPPAYV PPSAPAVSGT YVPSYAXVDI 

101 NAATHTIVRG DTVYKISKCY HISQDDFRAW NGMTDNTLSI GQIVKVKPAG 

151 YAAPKAAAVK SRPAVPAAAQ PLVQSAPVDI NAATHTIVRG DTVYNISKRY 

201 HISQDDFRAW NGMTDNTLSI GQIVKVKPAG YAAPKAAAVK SRPAVPAAVQ 

251 TPVKPAAQPP VQSAPQPAAP AAENKAVPAP APQSPAASPS GTRSVGGIVW 

301 QRPTQGKWA DFGGNNKGVD IAGNAGQPVL AAADGKWYA GSGLRGYGNL 

351 VIIQHNSSFL TAYGHNQKLL VGEGQQVKRG QQVALMGNTE ASRTQLHFEV 

401 RQNGKPVNPN SYIAF* 



m025/a025 



97.4% identity over a 351 aa overlap 



10 20 30 

m025.pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

I I I I I! I I II : I I I I I I I I I I I I I I I I I I I 
a025 GMQTVPSAPVYNPYGATPYNAAPAANDAPYVPPVQSAPVYXPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 

40 50 60 70 80 90 

m025.pep YAPVDINAATHTIVRGDTVYN I SKRYH I SQDDFRAWNGMTDNTLSI GQIVKVKPAG YAAP 

II I I I I I I I I I I I I II I I I : I I I I I I I I I I I I I I I I I I I I I i I I I I I I I II I I I I I I I 
a025 Y AXVD I N AAT HTIVRGDTVYK I SKCYH I SQDDFRAWNGMTDNTLSI GQIVKVKPAG YAAP 

100 110 120 130 140 150 

100 110 120 130 140 150 

m025 . pep KAAAVK S R P AV PAAAQ P P VQ S AP V D I N AAT HT I VRG DTVYN1SKRYHISQDD FRAWNGMT 

1 1 1 1 e 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 i 1 1 m r 1 1 e 1 1 1 1 1 1 r 1 1 i 1 1 1 1 1 1 1 1 1 1 

a025 KAAAVK SRPAV PAAAQ PLVQSAP VD I NAATHT I VRG DTVYN I SKRYH I SQDD FRAWNGMT 

160 170 180 190 200 210 

160 170 180 190 200 210 

m025.pep DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 
II I I I I I I t I I I I I I I I ill : I I I : Mi I I I I I I I I | | I | | | I | | | | | [ | | | f I I I I I I 
a025 DNTLSIGQIVKVKPAGYAAPKAAAVKSRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 
220 230 240 250 260 270 



220 230 240 250 260 270 

m025 . pep KA V P A P A PQ S P AA S PS G TR S VG G I VWQR PTQG KW A DFGGNN KG V D I AG NAG Q P VLAAA D 

Mllllll llllll lltllllllMMIIIIIIIllllllllillli III llllllllll 
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a025 



KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAAAD 
280 293 300 310 320 330 



280 290 300 310 320 330 

m025.pep GKVVYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQC'VKRGQQVALMGNTDASRT 
I I I i I I I I I I I I II Ml I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M : | | | | 
a025 GKVVYAGSGLRGYGNLVIIOHNSSFLTAYGHNQKLLVGEGQCVKRGQQVALMGNTEASRT 
340 350 360 370 380 390 



340 350 
mU25 .pep QLHFEVRQNGKPVNPNSY1AFX 
I I i I I I I I I I I I I I I I I I I I I I 
a 02 5 QLH FE VRQNGKPVNPNS Y I AFX 

400 410 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 025 shows 75.6% identity over a 353 aa overlap with a predicted ORF (ORF 025 .ng) 
from N. gonorrhoeae: 

m025/g025 

10 20 30 

m02 5 . pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

llllhlllhll II HI i III II I Mil I 

g025 GMQTVSSAPVYN PYGATP YNAAPAANDAP YVPPVQTAPVYS PP AYVP PS APAVSGTYVPS 

40 50 60 70 80 90 

40 50 60 70 80 90 

m02 5 . pep YAPVD I NAATHT I VRGDTVYNI S KRYHISQDDFRAWNGMTDNTLS IGQI VKVKPAGYAAP 

II IMIIII IIIMII MIIIIIIIIIMII I III III IIMIIIIIMIMIIIIIIM 

g02 5 YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
100 110 120 130 140 150 

100 110 120 130 140 150 

m025 . pep KAAAVKSRPAVPAAAQPPVQ3APVDINAATHTI VRGDTVYNI S KRYHI SQDDFRAWNGMT 

I 

g025 K 



160 170 180 190 200 210 

m02 5 . pep DNMD S I GQ I VKVKPAG YAAP KT AAVE SRP AV PAAVQTPVKPAAQP P VQS APQ P AAPAAEN 

Ml MM I II MM IMIIIII HIM I II IIMMMI 

g025 TAAVESRPAVPAAAQTPVKPAAQPPVQSAPQPAAPAAEN 

160 170 180 190 

220 230 240 250 260 

m025 . pep KAVPAPAP - - QSPAASPSGTRSVGGIVWQRPTQGKWADFGGMNKGVDI AGNAGQPVLAA 

IMIIIII M MM IMIMIM llllll IMMMIIIMMIMI II ill Ml III 

g025 KAVP APAPAPQS PAAS PSGTRS VGGI VWQRPTQGKWADFGGGNKGVDI AGNAGQPVLAA 

200 210 220 230 240 250 

270 280 290 300 310 320 

mO 2 5 . pep ADGKW YAGS GLRG YGNLV I IQHNSS FLTAYGHNQKLLVGEGQQV KRGQQVALMGNTD AS 

lllllllllllllllllllllllllllllllllllll IIIIIIIIIIIIMIIIIIIII 

g02 5 ADGKWYAGSGLRGYGNLVI IQHNSSFLTAYGHNQKLLVGEGQOVKRGQQVAIiMGNTDAS 

260 270 280 290 300 310 



330 340 350 

m025.pep RTQLH FE VRQNGKPVN PNS Y I AFX 
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RTQLHFEVRQNGKPVNPNSYIAFX 
320 330 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 13>: 

g031 . seq 

1 ATGGTGTCCC TCCGCTTCAG ATTCGGCAAC CACTTTAAAC GCCGACATTC 

51 TGACAATTTC CTTTTCCGCC AGCCAAATAT CATGCGTATC TTTCGGTTCG 

101 GGCTTGTTGG GCATGGCAAC CTTCAACAGC CGCGCCATCA CAGGAATCGT 

151 CGTTCCCTGA ATCAGCAGCG ACAGCACCAC CACGGCAAAC GCCACATCAA 

201 ACAGCAGGTG CGAATTGGGA ACGCCCATCA CCAGCGGCAT CATCGCCAGC 

251 GAAATCGGTA CGGCTCCTCG CAAGCCCAAC CAACTGATAT ACGCCTTTTC 

301 ACGCAGGCTG TAATTGAATT TCCACAAACC GCCGAACACT GCCAGCGGAC 

351 GCGCGACCAG CATCAGGAAC GCCGCAATCG CCAAGGCTTC CGCCGCCCTG 

4 01 TCCAACACGC CGG CGGGAGA AACCAGCAGA CCGAGCATGA CGAACAAAGT 

4 51 TGCCTGCGCC AGCCAAGCCA AACCGTCCAT CACACGCAAA ACGTGTTCCG 

501 TcgcACGGTT GCGCTGGTTA CCGACAATGA TGCCGGCAAG GTAAACCGCC 

551 AAAAAGCCGC TGCCGCCTAT GGTATTGGTA AACGCAAACA CAAGCAGCCC 

601 GCCCGACACA ATCATCAGCG CGTACAGACC TTCCGtacac acctccaatt 

651 cccaatcaac gtcatagctg tctcccgtgt taaaatgttc ttcacttcag 

701 aatccccccc ttcttcccag cccgaaacct tcatgtgtta naccctgggg 

751 tgccccaacg gatttag~aa cctcccaatg actctgcttg tcgccccctt 

801 cgcccgcttt ctccttccgg gaaaacttgt tgtccccgtc ttacattaa 

This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 031.ng>: 

g031.pep 

1 MVSLRFRFGN HFKRRHSDNF LFRQPNIMRI FRFGLVGHGN LQQPRHHRNR 

51 RSLNQQRQHH HGKRHIKQQV RIGNAHHQRH HRQRNRYGSS QAQPTDIRLF 

101 TQAVIEFPQT AEHCQRTRDQ HQERRNRQGF RRPVQHAGGR NQQTEHDEQS 

151 CLRQPSQTVH HTQNVFRRTV ALVTDNDAGK VNRQKAAAAY GIGKRKHKQP 

201 ARHNHQRVQT FRTHLQFPIN VIAVSRVKMF FTSESPPSSQ PETFMCXTLG 

251 CPNGFSNLPM TLLVAPFARF LLPGKLW PV LH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 15>: 

m031 . seq (partial) 

1 . . .CGCCTGAAGC ACGGTGTCGG ACTGCATTTC TATTCGGCTA TACGCCTTTT 
51 CACGCAGGCT GTAATTGAAT TTCCACAAAC CGCCGAACAC TGCCGACGGA 

101 CGCGCGACCA GCATCAGGAA CGCCGCAATC GCCAAgGCTT CCGCCGCCCT 

151 GTCCAACACG TTGGCAGGAG AAACCAGCAG CAAAGGCATT CCCAAACGTG 

201 CGGACAAAGT GGTCGAAACC ACGCTCAGAA ACAACAGTGC GCCACCCGGC 

251 AG 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 03 1>: 

m031 . pop (partial) 

1 . . .RLKHGVGLHF YSAIRLFTQA VIEFPQTAEH CRRTRDQHQE RRNRQGFRRP 

51 VQHVGRRNQQ QRHSQTCGQS GRNHAQKQQC ATRQ 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

a031.seq 

1 ATACGCCTTT TCACGCAGGC TGTAATTGAA TTTCCACAAA CCGCCGAACA 

51 CTGCCGGCGG ACGCGCGACC AGCAT CAGGA ACGCCGCAAT CGCCAAGGCT 

101 TCCGCCGCCC CGTCCAACAC GTTGG CAGGA GAAACCAGCA GCAAAGGCAT 

151 TCCCAAACGT GCGGACAAAG TGGTCGAAAC CACGCTCAGA AACAACAGTG 

201 CGCCACCCGG CAG 

This corresponds to the amino acid sequence <SEQ ID 118; ORF 03La>: 

a031 . pep (partial } 

1 IRLFTQAVIE FPQTAEHCRR TRDQHQERRN RQG FRRPVQH VGRRNCQQRH 

51 SQTCGQSGRN HAQKQQCATR Q 
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n03i/a03i 100.0% identity over a 71 aa overlap 

10 20 30 40 50 60 

m031 . pep RLKHGV G LH FY SAI RLFTQAV IE FPQTAEHCRRTRDCHQERRNRQG FRRPVQHVGRRNQQ 

M I I I I I I I II I I I I I I I I I I 1 1 I I M I I I II 1 1 I I I I I 1 1 | | 1 1 | | 
a031 I RL FTQAV IE FPQTAEHCRRTRDCHQERRNRQG FRR PVQHVGRRNQQ 

10 20 30 40 



70 80 
m031 . pep QRHSQTCGQSGRNHAQKQQCATRQ 
I M 1 1 I 1 1 I I I I I I 1 1 I II I M 1 1 
a 0 3 1 QRH S QTCGQS GRNH AQKQQCAT RQ 

50 60 70 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 03 1 shows 60.0% identity over a 85 aa overlap with a predicted ORF (ORF 03 1 .ng) 
from N. gonorrhoeae: 
m031/g031 

10 20 30 

m031 .pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEH 

I : : I = : I I I I i I I I I i I I I I I 1 I 
g031 NQQRQHHHGKRHIKQQVRIGNAHHQRHHRQRNRYGSSQAQPTDIRLFTQAVIEFPQTAEH 
60 70 80 90 100 110 

40 50 60 70 80 

m031.pep CRRTRDQHQERRNRQGFRRFVQHVGRRNQQQRHS-QTCCQSGRNHAQKQQCATRQ 
I : I I I I I I I I i I I I I I I I I I I i I : I I I I I : I : I : I : : : : : : I : I : 
g031 CQRTRDQHQERRNRQGFRRFVQHAGGRNQQTEHDEQSCLRQFSQTVHHTQNVFRRTVALV 
120 130 140 150 160 170 



g031 



TDNDAGKVNRQKAAAAYGIGKRKHKQPARHNHQRVQTFRTHLQFPINVIAVSRVKMFFTS 
180 190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 19>: 

g032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTA TTGCGCCGCC CACGATTCGA 

51 GGCGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAAGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGACGC TGCTTGCGCC 

201 CTTTGCCGGT AACGTGTACC CACGCTTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGCTC 

301 GAACAGCGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAACAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

401 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGCGCATCAG 

451 CCCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCACGCC GACAGCTTGC 

501 GCGCCAGCGT CCGACCGTCC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GccgTAAAAT CGCGCCGCGA CAAGTCCTGC GGCACGCcgc CtgcaTCTTC 

601 AGACGGCATT TGTGCCAACA GTGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCAAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

701 TAAGCGTCAA AATacgccgC AAACccgTCC AAAACCATAA CCGTCCCACA 

751 CAAATATCAA AAAACCAGTG A 

This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 032.ng>: 

g032.pep 

1 MRRNVPAVAV LRRPRFEAFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLTLLAPFAG NVYP RFVQIY IICIQAVYLA HA OTAAVHQL 
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101 EQRWAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGAHQ 

151 PAFDQPGAIL PPRRQLARQR PTVQTALRQP PQRRRKIAPR QVLRHAACIF 

201 RRHLCQQCKQ FFQIAPVCRN RVLRLALAHD VFQISVKIRR KPVQNHNRPT 

251 QIGKNQ* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 ll>: 

m032 . seq (partial) 

1 ATGCGGCGAA ACGTGCmTGC raGTCGCCGTT kTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGACCAGCGG CACCTGCCGC TgTT . GCGCC 

201 CTTTGCCGAT AAcGTGTACC CACGCyTCGT CCAAATAGAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGGGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGTGCATCAG 

451 GCCGCGCTTT ACCAGCCAAA CGCAATACTG CCGCCAAGAC GAAAGCTTGC 

501 GAGCCAGCGT CCGTTCCCCC AAACCGCG. . . 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 032>: 

m032.pep (partial) 

1 MRRNVXAVAV XRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFADQR HLPLXAPFAD NVYPR XVQID IICIQAVYLA HA QTAAVHQF 

101 EQGWAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGVHQ 

151 AALYQPNAIL PPRRKLASQR PFPQTA. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

a032. seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTT TTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGCCGC TGCTTGCGTC 

201 CTTTGCCGGT AACGTGTACC CACGCCTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGCGCG TGATCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG TATGCAGCAG 

4 51 ACCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCAAGAC GACAGCTTGC 

501 GCGCCAGCGT CCGCGCATTC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GCCGTAAAAT CGCGCTGCGA CAAGCCCTGC GGCACGCCGC CTGCATCTTC 

601 AGACGGCATT TGTGCCAACA GCGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCCAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

701 TAAGCGTCAA AAT GCGCCGC AAACCCGTCC AAAACCATAA CCGCCCCACA 

751 CAAATATCAA AAAAACAGTG A 



This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 032.a>: 

a032.pep 

1 MRRNVPAVAV LRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLPLLASFAG NVYP RLVQIY IICIQAVYLA HA QTAAVHQF 

101 EQRVIAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGMQQ 

151 TAFDQPGAIL PPRRQLARQR PRIQTALRQP PQRRRKIALR QALRHAACIF 

201 RRHLCQQRKQ FFQIAPVCRH RVLRLALAHD VFQISVKMRR KPVQNHNRPT 

251 QISKKQ* 



m032/a032 88. 1 % identity over a 1 76 aa overlap 



10 20 30 40 50 60 

m032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFADQR 
Mill I I I I I I I I I I I i I I I I I I I I I I II I II | | | | | | | | | | | | M | | | | | | | || !| 
a032 MRRNVPAVAVLRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m032 . pep HLPLXAPFADNVYPRXVQIDI I C I QA V Y LAHAQT AAV H Q FEQG V V AH RQR VAA VHG QI QH 

: I I I I I I Mill ill I I I I I I (I I I I I I I I I I I I I I ; I : I I I I I I I | I | I | | | | 
a032 NLPLLASFAGNVYPRLVQIYIICIQAVYLAHAQTAAVHQFEQRVIAHRQRVAAVHGQIQH 

70 80 90 100 110 120 

130 140 ISO 1.60 170 

m032 . pep PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
I I I I I I I I II I I I I I I II I I I I I I I I I : : I : I : I I : I I I I I I I : I I I I I III 
a032 PVQPFLRQGFGYALGLLRRFDVGGRVGMQQTAFDQPGAILPPRRQLARQRPRIQTALRQP 

130 140 150 160 170 180 

a032 PQRRRKI ALRQALRHAACI FRRH LCQQRKQFFQIAPVCRHRVLRLALAH DVFQI S VKMRR 

190 200 210 220 230 240 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 032 shows 86.4% identity over a 176 aa overlap with a predicted ORF (ORF 032.ng) 
from N. gonorrhoeae: 

*032/g032 



10 20 30 40 50 60 

m032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQG FAVRCP.LTQRQI VFQG FHAFADQR 

I I I I I I I I I III : : I I I I I I I I I I I I I I I I I I I I I I I I I || I | | | | | || | || | || 
g0 3 2 MRRNVPAVAVLRR PRFEAFL DLALAQARAV PAGKQG FAVRCRLTQRQI VFQGFHAFAGQR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m032.pep HLPLXAPFADNVYPRXVQIDI ICIQAVYLAHAQTAAVHQFEQGVVAHRQRVAAVHGQIQH 

: I i I I I I Mill III I I I I I I I I I I I I I I I I I I I : I I I I II I I i ! I I I I I I I I I 
g032 NLTLLAPFAGNVY PRFVQI Y 1 1 CIQAVY LAHAQT AAVHQLEQRVVAHRQRVAAVHGQI QH 

70 80 90 100 110 120 

130 140 150 160 170 

m032 . pep PVQ P FLRQG FG Y ALG L LRR FDVGG R VG VHQ AAL YQ PN AI L P PRRKL ASQR P F PQT A 

I I I I I I I I I I I I I I I I I I I I M I I I I I : I I I : I I : I I I I I I I : I i III III 
gC32 PVQ P FLRQG FG Y ALG L LRR F D VGG R VG AHQ P A FDQPG A I L P P RRQL A RQR PT VQT ALRQ p 

130 140 150 160 170 180 

g032 PQRRRKIAPRQVLRHAACIFRRHLCQQCKQFFQIAPVCRNRVLRLALAH DVFQI SVKIRR 

190 200 210 220 230 240 



The following partial DNA sequence was identified in M. gonorrhoeae <SEQ ID 1 1 5>: 

g033 . seq 

1 ATGGCGGCGG CGGACAAACT CTTGGGCGGC GACCGCCGCA GCGTCG CCAT 

51 CATCGGAGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGTTG CCCAAATATC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAAcgg 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGagtTTGC CCAAAAAGTC 

301 GAACAcaaaA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 GCTGTCGCTG TTTGAAAATT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

401 GACACAACGT CGAGAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 

451 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGACCCcg tcaAATACCA CGCCGTCGCc aACCTGCcta 

551 AAGAAGGCGG GGCGCAAATg ccGTCTGAAA AAGAACCCAA GCCCGCCgCc 

601 aaaccgACCT ATACCCAAGT ATTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 



WO 99/57280 



PCT/US99/09346 



213 



701 GACTGGTGGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

751 ATCGCCGAGC AGCACGCCGT tacCTTTGCC GGCGGTTTGG CGTGCGAAGG 

801 CATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGTG CGGGCATCGT CGGCGCGGAC GGTCCGACCC ATGCCGGCTT 

951 GTACGATTTG AGCTTCTTGC GCTGTGTGCC GAACATGATT GTTGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCG 

1051 GATGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCC TTcatTGCCT TCGGCAGTAT GGTCGCCACC 

1201 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTt 

1251 cgtcaaacCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCAcg 

1301 aCCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

1351 GCGGTCTTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

1401 TTTGGGCGTT GCCGATACCG TAACCGAACA CGGCGATCCG AAAAAACTTT 

14 51 TGGACGATTT GGGTTTGAGT GCCGAAGCGG TGGAACGCCG GGTGCGCGAG 

1501 TGGCTGCCGG ACCGTGATGC G3CAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 033.ng>: 

g033 .pep 

1 MAAADKLLGG DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVtiDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSbSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKEGGAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVE7EQ RFPDRYFDVG 

251 IAEQIiAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

301 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

3 51 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG IIRREGE KTA FIAFGSMVAT 

401 ALAVAE KLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

451 AVLEVLAKHG ICKPVLLLGV ADTVTEHGDP KKLLDDLGLS AEAVERRVRE 

501 WLPDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

m033 . aeq 

1 ATGGCGGCGG CAGACAAACT CTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCaG.CGA TATGGATGTr GATTTGCTrG TCGTCCTCAA C3ACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGCTG CCGAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GCCTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGG CGCGA TGGAGTTTGC CCAAAAAGTC 

3 01 GAACACAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 
351 GCTGTCTTTG TTTGAAAACT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 
4 51 AAAGG CCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 
501 ACTCGCCGAA AACGACCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 
551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 
601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
7 01 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 
751 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 
801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 
851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTACCCGT TTTGTTTGCC 
901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTCT 
951 GTACGATTTG AG CTTTTTGC GCTGCGTGCC GAACATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGTTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCC 

12 01 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 
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1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAG3CGGC 

1351 GCGGTGCTGG AAGTATTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

1401 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

1451 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 118; ORF 033>: 

m033 .pep 

1 MAAADKLLGS DRRSVAIIGD GAMTAGOAFE ALNCAXDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI P GAME FAQ KV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEOHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

301 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

351 DAP AAVRY PR GTGTGAPVSD GMETVEIGKG IIRREGE KTA FIAFGSMVAP 

4 01 ALAVAEKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDD LGLS AEAVERRVRA 

501 WLSDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 19>: 

a033 . seq 

1 ATGGCGGCGG CGGACAAACA GTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGTCA GGCGTTTGAA GCCTTGAACT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGTGCGTTG CCCAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

301 GAACATAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 ACTGTCTTTG TTTGAAAACT TCGGCTTCCG CTATACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTCGATGTAT TGGAAGACCT GCGCGGACGC 

4 51 AAAGGCCCGC AGCTTCT5CA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGATC CCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 

551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 

601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 

801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTTT 

951 GTACGATTTA AGCTTTTTGC GCTGCATTCC GAATATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAATGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGG3TA CGGGCGTGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCT 

1201 GCATTGGCGG TCGCCGGAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 

1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCAGC 

1351 GCGGTGCTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTCTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

1451 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 120; ORF 033.a>: 

aQ33 .pep 

1 MAAADKQLGS DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLEDLRGR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 
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301 V DRAG I VGA D GP7HAGLYDL SFLRCIPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGVPVSD GMETVEIGKG IIRREGEK TA FIAFGSMVAP 

4 01 ALAVAGKLNA TVADMRFVKP IDEEL1VRLA RSHDRIVTLE ENAEQGGAGS 

451 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 

501 WLSDRDAAN* 

m033/a033 98.4% identity over a 509 aa overlap 

10 20 30 40 50 60 

m033 .pep MAAADKLLGS DRRS VAI I G DGAMTAGQAFEALNCAXDMDVDLLWLN DNEM5 1 S PNVGAL 

I I 1 I I I I I I I I I I I I I I I I I I I I 1 1 I I M I II I I I I II I I I I I I I I I I I | | | | | | | | | 
a033 MAAADKQLGS DRRS VAI I GDGAMTAGQAFEALNCAGDMDVDLLVVLNDNEMS IS PNVGAL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m033 . pet> PKYLASNVVRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I II I I I I I I | | | | 
a 033 PKYLASNVVRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m033 . pep FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 
I I I I I I I I I I M I I I It I I I I I I I : I II : I I I I t I I I I I I I I I I I I I I I I I I I II I I ! I I 
a033 FENFGFRYTGPVDGHNVENLVDVLEDLRGRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m033.pep NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 
I I I I I I I I I I I I I I I I II I I I I II I I I I I I II I I I I I I I I I I I I I I M I II I I I I I I I I | 
a033 NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

m033 . pep R F P DR Y FD VG I AEQHAVT FAGGL ACEGMK P W A I Y S T FLQRA Y DQL VH D I ALQN L P VLFA 

I i I I 1 1 I 1 1 I 1 1 I I I I II 1 1 I II 1 1 1 I II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I I I 1 1 ! 1 1| 1 1 1 1 1 

a033 RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 
250 260 270 280 290 300 

310 320 330 340 350 360 

m033 . pep V DRAG I VG ADG PTHAG L Y DL 3 FLRCV PNM I V AA P S DEN E CRL L LS T C YQ ADAP AA VR Y PR 

I I I I II I I I I I I I I M I II I I I I I I : I I I I I I II I I I ! I I I I I | I | M I I I ! II I I I I I I 
a033 VDRAGIVGADGPTHAGLYDL3FLRCIPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

m03 3 . pep GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 
I I I I I : I I I I I I I I I I I I I I II I II I I I I II I I I I II I I I I I I I I || | | | | | | | | | | | | 
a033 GTGTGVPVSDGMETVEIGKG I IRREGEKTAFIAFGSMVAPALAVAGKLNATVADMRFVKP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m033 . pep IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 
I I II I I I I I M I I II I I I II I I II I I I I I: I I II I I 1,1 III || (H M I I I I I I I II I II 
a033 IDEELIVRLARSHDR3VTLEENAEQGGAGSAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

430 440 450 460 470 480 

490 500 510 

m033 . pep KKLL DDLGLSAEAVERRVRAWLS DRDAANX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 0 3 3 KKLL DDLGLSAEAVERRVRAWLS DRDAANX 

490 500 510 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 033 shows 98.4% identity over a 509 aa overlap with a predicted ORF (ORF 033.ng) 
from N. gonorrhoeae: 

m033/g033 



m033 .pep 
g033 

m033 .pep 
g033 

m033 -pep 
g033 

m033 -pep 
g033 

m033 .pep 
g033 

m033 .pep 
g033 

m03 3 .pep 
g033 

m03 3 . pep 
g033 

m033 .pep 
9033 



MAAADKLLGSDRRSVAI IGDGAMTAGQAFEALNCAXDMDVDLLWLNDNEMS ISPNVGAL 

IIIMIIIhllllllllllllllMIIIIIIMI II Illlll IIIIIIIIMI 

MAAADKLLGGDRRSVAI I GDGAMTAGQAFEALNCAGDMDVDLLVVLNDNEMS I S PNVGAL 



FEN FGFRYTGP VDGHNVENLVDVLKDLRSRKGPQLLHV I TKKGNGYKLAEND P VKYHAVA 

IIHIIIIIIIIIIIMIIIIIIIIIinilMIIIIIIIIIIIIMIIIIIIIIIIIII 

FENFGFRYTGPVDGHl^ENLVEVLKDI^SRKGPOLLHVITKKGNGYKLAENDPVKYHAVA 
N L P KE S AAQMP S EKE P KP AAKP TYTQVFGKWL CD RAAADS RLVA I TP AMREGS GLVE FEQ 

1 1 1 1 1 : : 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

NLPKEGGAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 
RFPDRYFDVGIAEQHAVTFA3GLACEGMKPWAIYST7LQRAYDQLVHDIAL0NLPVLFA 

II I M III MM III IMMMIMM MM III III Illlll I III MUM II III 

RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 
VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

1 1 1 1 f i 1 1 1 f 1 1 1 1 1 f 1 1 f I i 1 1 r 1 1 1 1 1 1 1 1 i 1 1 1 1 1 r i e 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 f 1 1 1 

VDRAG I VGADG PTHAGLYDLS FLRCVPNMI VAAPSDENECRLLLSTCYQADAPAAVRYPR 
GTGTGAP VSDGME TVE I G KG 1 1 RREGE KTAF I AFGSMVAPALAVAE KLNATVADMRFVKP 

lllllllllllllll M MMIIM M I Ml Ml III I I i f 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 r I 

GTGTGAP VSDGMETVEIGKG 1 1 RREGE KTAF I AFGSMVATALAVAE KLNATVADMRFVKP 



60 



60 



120 



PKYLiASNWRDMHGLLSTVKAOTGKVLDKI PGAMEFAOKVEHKI KTLAEEAEHAKQSLSL 

1 1 1 MM Ml Ml I M MM Ml MUM II I M I II 1 1 1 M 1 1 1 1 1 1 IIIIIIIIMI 

PKYLASNWRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTLAEEAEHAKQSLSL 12 0 



180 



ieo 



240 



240 



300 



3 00 



360 



360 



420 



420 



I DEEL I VRLARSHDR I VTLEENAEQGGAGGAVLEVLAKHG I CKPVLLLGVADTVTGHGDP 4 80 

1 1 i i r 1 1 1 1 1 f p 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r r 1 1 1 j i r 1 1 r 1 1 1 1 1 1 mm 

I DEELI VRLARSHDR I VTLEENAEQGGAGGAVLEVLAKHG I CKPVLLLGVADTVTEHGDP 480 



510 



KKLLDDLGLS AEAVERRVRAWLS DRD AANX 

MMMMMIMM I || Mill 

KKLLDDLGLS AEAVERRVRENLPDRDAANX 510 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 121>: 

g034.seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTA7CC ATGCGCCAAC 

101 TGCTTGACCA CGCCGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGC AAA TACGcggGCG 

2 51 CGCCGTTTTT GCGCCACCTG ATTCTGGCGG CAGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTgtgCCA 

351 ACGCTCCATC CAACTGGGCT TCTCCTCCGT GATGATGGAC GG CTCTTTGC 

4 01 TCGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACC 

4 51 CGTACCGTCG TCAACTTCTC CCACGCCTGC GGCGTGTCCG TCGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGAAG CAGGC GAAGAAGACG 

551 GAGTGGGCGC GGCAGGCAAA CTCTCACACG ACCAAATGCT CACCAGCGTT 

601 GAAGATGCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 

651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 
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701 


GCGACGTATT 


GCGTATCGAC 


751 


AATACACACA 


TCGTGATGCA 


801 


GAAAGTCATC 


AACGAATACG 


851 


CGGTTGAAGA 


AATCGTCGAA 


901 


ATCGATACCG 


ACCTGCGCCT 


951 


TGCCGAAAAC 


CCGTCCGACT 


1001 


TTGAAGCGAT 


GAAGCAAATC 


1051 


GAAGGTCAGG 


CAGGCAAAAT 


1101 


CCGTTATGCC 


AAGGGCGAAT 



CGCATCAAGG AAATCCACCA AGCCCTGCCC 
CGgctCCAGC TCCGTTCCGC AAGAatgGCT 
GCGGCAATAT CGGCGAAACC TACGGCGTGC 
GGCATCAAAC ACGGCGTGCG CAAAGTCAAC 
CGCTTCCACC GGCGCGGTAC GCCGCTACCT 
TTGATCCGCG CAAATACTTG GGCAAAACCA 
TGCCTCGACC GTTATCTTGC GTTCGGTTGC 
CAAACCTGTT TCGTTGGAAA AAATGGCAAG 
TGAACCAAAT CGTCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 122; ORF 034.ng>: 

g034 .pep 

1 MSRLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHAA E NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV IVQASAGARK YAGAPFLRHL ILAAVEEFPH 

101 IPWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLLEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAAGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL GKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMASRYA KGELNQIVK* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 123>: 

m034.soq (partial) 

1 ATGAGCTGTT TATGGTTTTT TGCTGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA wACAGCTACG GCyTGCCGGC GTTCAACGTC 

151 AACAACCTCG wACAGATGCG CGCCATCATG GAGGCTGCAG ACCAAGTCGA 

201 CGCCCCCGTC ATCGTACAGG CGAGTGCCGG TGCGCGCAAA TATGCGGGTG 

251 CGCCGTTTTT ACGCCACCTG ATTTTGGCGG CTGTCGAAGT ATTTCCACAC 

301 ATCCCCGTCG TCATGCACCA AGACCACGGC GCATCACCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TCTCCTCTGT AATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACA 

451 CGTACCGTGG TTAACTTCTC CCACGCTTGC GGCGTATCCG TTGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGATGCAGGC GAAGAAGACG 

551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT GACCAGCGTC 

601 GAAGATGCCG TATGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCTAT 

651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 

701 GCGATGTATT ACGTATCGAC CGCATCAAAG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA C. . . 



This corresponds to the amino acid sequence <SEQ ID 124; ORF 034>: 

m034.pcp (partial) 

1 MSCLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHAA E XSYGLPAFNV 

51 NNLXQMRAIM EAADQVDAPV IVQASAGARK YAGAPFLRHL ILAAVEVFPH 

101 IPWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGDAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVCFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMH... 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 125>: 

a034.seq 

1 ATGAGCCGTT TATGGTTTTT T3CCGCAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGCGGGCG 

251 CGCCGTTTTT GCGCCACCTG ATTTTGGCGG CTGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TTTCCTCCGT GATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTATG AATACAACGT CAACGCCACC 
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4 51 CGTACCGTGG TTAATTTCTC CCACGCCTGC GGCGTATCCG TTGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC TCGAAACTGG CGAAGCCGGC GAAGAAGACG 

551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT CACCAGCGTC 

601 GAAG AT GCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 

651 TGCCGTCGGC ACCAGCCACG GCGCGTACAA ATTCACCCGT CCGCCCACAG 

701 GCGACGTGTT GCGTATCSAC CGCATCAAAG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA CGGCTCCAGC TCCGTTCCGC AAGAATGGCT 

801 GAAAGTCATC AACGAATACG GCGGCAATAT CGGCGAAACC TACGGCGTGC 

851 CGGTTGAAGA AATCGTCGAA GGCATCAAAC ACGGCGTGCG TAAAGTCAAC 

901 ATCGATACCG ACTTGCGCCT TGCTTCCACC GGCGCGGTAC GCCGCTACCT 

951 TGCCGAAAAC CCGTCCGACT TCGATCCGCG CAAATATTTG AGCAAAACCA 

1001 TTGAAGCGAT GAAGCAAATC TGCCTCGACC GCTACCTCGC GTTCGGTTGC 

1051 GAAGGTCAGG CAGGCAAAAT CAAACCGGTT TCCTTGGAAA AAATGGCAAA 

1101 CCGTTATGCC AAGGGCGAAT TGAACCAAAT CGTCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 126; ORF 034.a>: 

a 03 4 .pep 

1 MSRLWFFAAK NIIIRLIYLL PKETQMALVS MRQLLDHA AE NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV I VQ AS AG ARK YAGAP FLRHL ILAAVEEFPH 

101 IPVVMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTVVN FSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL SKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMANRYA KGELNQIVK* 

m034/a034 96.9% identity over a 257 aa overlap 

10 20 30 40 50 60 

m034 .pep MSCLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFWNNLXQMRAIM 
It I M I I : I I I I I I I I II I I I I I I I I I I I I I I I i I I I I I M I I I I I I I I I I | | | | | | 
a034 MSRLWFFAAKNIIIRLIYLLPKETQMALVSMRQLLDHAAENSYGLPAFNVN>3LEQMRAIM 

10 20 30 40 50 60 

70 80 90 100 110 120 

ra034 . pep E AA DQ V DA P V 1 VQAS AG ARK Y AGA P FLRHL I LAA V E V F PH I PWMHQ DHGAS P D V CQR S I 

I I I I I I : I I I I I 1 I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | 
a034 EAADQVNAPVIVQASAGARKYAGAPFLRHLILAAVEEFPHIPWMHQDHGASPDVCQRSI 

70 80 90 100 110 120 

130 140 150 160 170 100 

ra034.pep QLG FS S VMM DG S LME DG KT PS S YE YNVN AT RT WN FS HAC G VS VEG E I G VLGN LETG DAG 

I I I I I I I I I I I II I I I I I M I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I | | | | |: | | 
a034 QLGFSSVMMDGSLMEDGKT PSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGEAG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m034 .pep EE DGVGAVGKL S HDQMLTSVEDAVCFVKDTGVDALA I AVGTSHGAYKFTR PPTGDVLRID 

I I II I I I t I I I I I I I I I I I I I I I I II I I I I I f I I I I I I I t I I 1 i I I I I I I I II I I I ! I | 
a034 EEDGVGAVGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

190 200 210 220 230 240 

250 

m034 .pep RIKEIHQALPNTHIVMH 
I I I I I I I I I I I I I I I I I 

a034 RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 
250 260 270 280 290 300 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 034 shows 96.5% identity over a 257 aa overlap with a predicted ORF (ORF 034.ng) 
from N. gonorrhoeae: 
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m034/g034 

m034 .pep 
g034 

m03 4 .pep 
9034 

m034 -pep 
g034 

m034 .pep 
g034 

m034 .pep 
g034 



MSCLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 

II IIIIMIIIIIIIIIMIIIIMIIIIIIIIMIMI IIIMIMIMI llllll 

MSRLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 



QI^FSSVMMDGSLMEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVLGNLETGDAG 

IIIIIIMIIIIhllllMIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIhll 

QLGFSSVMMDGSLLEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVIjGNLETGEAG 



RIKEIHQALPNTHIVMH 

II III llllll Mill I 

RI KEIHQALPNTHI VMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEI VEGI KHGVRKVN 



60 



60 



120 



EAADQVDAPVI VQASAGARKYAGAPFLRHLILAAVEVFPHI PWMHQDHGASPDVCQRS I 

1 1 M 1 1 H Ml MM Ml I 111 Ml II III 1 1 1 II I lllllllllllllllllllllll 

EAADQVNAPVIVQASAGARKYAGAPFLRHLILAAVEEFPHIPVVMHQDHGAS PDVCQRS I 120 



180 



180 



240 



EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

I Millie III III II II II 1 1 II Mill lllllllllllllllllllllll 1 1 III II 

EEUGVGAAGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLR1D 240 



257 



300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 127>: 

g036.seq 

1 ATGCTGAAGC CGTGTTTGGT ATACAGTGCC TGTGCGGCGG cgttgcCTGC 

51 GCGGACTTCG AGCAGCAGGC GTTGCGTGCC TTCGGGCAGA TGTGCGTACC 

101 AATATTCGAG CAGGGCGGAC GCAACGCCCC GTCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCTGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

2 51 GCGAAACAAG CGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGTAT CGAGCGCGGC CAGTGCGGCG CAGTCGGACG GTGAGGCTGG 

351 GCGGATGTTC ATGTTCGTGC CTTCCGTTCC GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAGTTC GGCGTGTGCC GCGCCAGTTG 

4 51 CGGGATAGCC GCCGCCGAGG GCGAGCGCGA GAAAATCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGTG CGAACGCACT 

551 GCCGATGCCG TCTGAAAAGA CGTACCCCTC GGGGAGGGCA ATGTCTGCCG 

601 CCCTACCGAC TTGATAATCG CTCAAACGGC GGCGGT TCAG CGTGTCGAAC 

651 CACGCATAAA ACACTTCGCC CATACGCGCG TCCGCAGCGG CGAGTATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGTG GGGATGCCGA 

751 TTAAAGGCGT GTCGAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 128; ORF 036.ng>: 

g036.pep 

1 MLKPCLVYSA CAAAL PARTS SSRRCVPSGR CAYQYSSRAD ATPRRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA ILPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTVSSAASAA QSDGEAGRMF MFVPSVPPVL WQSGRFCCGR RAVRRVPRQL 

151 RDSRRRGRAR ENRRRSAYRV CLRRADGFPV RTHCRCRLKR RTPRGGQCLP 

201 PYRLDNRSNG GGSACRTTHK TLRPYARPQR RVCSFAAAAA RRRHRAWGCR 

251 LKACRTALPN LAPRRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 129>: 

m036 . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCg.CGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGC3GC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

401 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 

4 51 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 

501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT TCTTCCAGCG CGAACGCGCT 

551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGAGCG TCCGTAGCGG CAAGGATGCA 

701 GCTTTGCGGC GGCGGCA3CG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

751 TTAAGGGGGT ATCAAAC3GC GTTGCCAAAC CCTGAGCTAC ACCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 130; ORF 036>: 

m036.pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPXSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 

201 PARPDNRSNG GS3AYRTMHK TLRPYERP*R QGCSFAAAAA RRRHRARVRR 

251 LRGYQTALPN PELHRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 131>: 

a036.seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGCAT CGAGCGCGGC GAGTGCGGCG CAATCGGCAT AAACGGCGCG 

351 GCGGATGTTC ACAGGCGCGC CCTCCGTTCC GCCTGTTCTT TGGCAGTCAA 

401 GGCGATTTTG TTGCGGACGT AGAGCAGCTC GGCGTGTGCC GCAGCGACGG 

4 51 CGGGAAAACC GCCTTCAGCC GCCAGATTGA GGAAGTCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGCG CGAACGCATT 

551 GCCGATGCCG TCTGAAAAGG CGCATCCTTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGACCGAC CTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGTGCG TCCGCAGCGG CAAGGATGCA 

7 01 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

751 TTAAAGGAGT ATCAAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 1 32; ORF 036.a>: 

a036.pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASAA QSA*TARRMF TGAPSVPPVL WQSRRFCCGR RAARRVPQRR 

151 RENRLQPPD* GSRRRSAYRV CLRRADGFPA RTHCRCRLKR RILPAAGCLP 

201 PDRPDNRSNG GGSACRTMHK TLRPYVRPQR QGCSFAAAAA RRRHRARVRR 

251 LKEYQTALPN LAPRRCRYAV P* 

m036/a036 85.6% identity over a 270 aa overlap 

10 20 30 40 50 60 

m036 . pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 
a036 MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGA VAIRCSSDSS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m036.pep GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I t I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I | | | | | M I I I :: I I I I I I I I I 
• a 03 6 GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASAAQSAXTARRMF 

70 80 90 100 110 120 
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130 14C 150 160 170 160 

m036 . pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
III II II I I I I I I I I I I I I III : I : : I I I I I I : I I | I I | | | : : | | 
a036 TGAPSVPPVLWQSRRFCCGRRAARRVPQRRRENRLQPPDXGSRRRSAYRVCLRRADGFPA 

130 140 150 160 170 160 

190 200 210 220 230 240 

m036.pep RTRCLCRLKRRIPPAAGCLFPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
11:1 I I I I I I I I I I I I I I I 111111111:11 I I I I I I I I I I || ( I | I | | | | M I 
a036 RTHCRCRLKRRILPAAGCLPPDRPDNRSNGGGSACRTMHKTLRPYVRPQRQGCSFAAAAA 

190 200 210 220 230 240 

250 260 270 

m03 6 . pep RRRHRARVRRLRGYQTALPNPELHRCRYAVRX 

I I I I I I I M M : I I I I I I I : I I I I I I 
a 0 3 6 RRRHRARVRRLKE YQTALPNLAPRRCRYAVPX 

250 260 270 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 036 shows 74.9% identity over a 271 aa overlap with a predicted ORF (ORF 036.ng) 
from N. gonorrhoeae: 

m036/g036 

10 20 30 40 50 60 

MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
I M I I I I I I I I I : I I I I I I I I I I I I Mil: I I I I 1 I I I I I I | | | | | | | | | | | I [ | 
MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 
10 20 30 40 50 60 

70 80 90 100 110 120 

GRFCQTIKAAI PXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I : I I I | | : : I I | | ] j 
GRFCQTIECAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 
70 80 90 100 110 120 

130 140 150 160 170 180 

TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: II I I I I 1 I I I ! I I I I I I III : :|:| lUlhlll Mil I: :|: 

MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 
130 140 150 160 170 180 

190 200 210 220 230 240 

RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTI4HKTLRPYERPXRQGCSFAAAAA 
11:1 I I M I I I : : I I I |. I I I I I I I I : I I I I I I I I I I I ! I I : I I f I I j I | 
RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 
190 200 210 220 230 240 

250 260 270 

RRRHRARVRRLRGYQTALPNPELHRCRYAVRX 
I II I I I II:: : I I I I I : I I I I I I I I 
RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 
250 260 270 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 133>: 

m036-l.seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 



m036.pep 
g036 

m036.pep 
g036 

m036 .pep 
g036 

m036 .pep 
g036 

m036.pep 
g036 
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351 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 

4 51 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 

501 ATATCCGGTC TGCCTGA3AA CGGCGGAGCT TCTTCCAGCG CGAACGCGCT 

551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGAGCG TCCGTAG 

This corresponds to the amino acid sequence <SEQ ID 134; ORF 0036-l>: 

m036-l .pep 

1 MLKPCAVYSA CAAVL PARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 
51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 
101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 
151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 
201 PARPDNRSNG GSSAYRTMHK TLRPYERP* 

m036-i/g036 76.8% identity in 228 aa overlap 

10 20 30 40 50 60 

MLK PC AVY S ACAAVL P ART S S S RRC VS S GRCVNQY S SRAD A I PWRRH S GAV A I RC S S DSS 
Mill I I I I I I I : I I I I I I I 1 I I I I I I I I: I I I I I | | | | t I I I ! I t t I I I I I I I I 
MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 
10 20 30 40 50 60 

70 80 90 100 110 120 

GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I 1 1 I 1 I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I : t I M I :: I I I ill 
GRFCQTI KAAI LPS FSARKTCS DGETSADSNWRCVHADGLQTVS SAASAAQS DGEAGRMF 
70 80 90 100 110 120 

130 140 150 160 170 180 

TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: II I I i I I I I I I I I I I I I Ml : lt:||:|M ill |: :|: 

MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 
130 140 150 160 170 180 

190 200 210 220 229 

RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPX 
11:1 I II I I I i : : I I I I I I I I I I I I : I I II I I I I I I I I I 
RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 
190 200 210 220 230 240 

RRRHRAWGCRLKACR7ALPNLAPRRCRYAVRX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 135>: 

g038 .seq 

1 ATGACTGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTC AACGACGGCG CGTCCACGCT GCAACTGGCA 

151 AAATTCTATG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC GCGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG AC GTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCAATCAA AC T GATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

451 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTAAATTGT CCGCCGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATCCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GCACCTACCG CCGGCAGTAC GGCGTAGAAT AA 



m03 6-l .pep 
g036 

m036-l .pep 
g036 

m036-l .pep 
g036 

m036-l .pep 
g036 

g036 
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This corresponds to the amino acid sequence <SEQ ID 136; ORF 038.ng>: 

gC38.pep 



1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGASTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDRGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GKLSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRTYRRQY GVE* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 137>: 

m038 .seq 



1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAGGC AGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGTCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCcGC CGGTGTCGCC 

451 ATCGC GCTCG ATCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAr AAACAATACG GkCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 



This corresponds to the amino acid sequence <SEQ ID 138; ORF 038>: 

m038.pep 



1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYKRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 139>: 

a038.seq 



1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAAT7CA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTC?TT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA , TCCGTACGCG 

401 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

4 51 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

5C1 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

6C1 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 



This corresponds to the amino acid sequence <SEQ ID 140; ORF 038.a>: 

a038.pep 



1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF N3GLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

1C1 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 



m038/a038 



100.0% identity over a 213 aa overlap 



m038 



pep 



10 20 30 40 50 60 

MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
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1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 

a038 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
10 20 30 40 50 60 

70 80 90 100 110 120 

iti038 . pep GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

Minn iMiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiimiiiii 

a038 GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
70 80 90 100 110 120 

130 140 150 160 170 180 

m038 . pep 1 IDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIEIIIillllltlllilMII 
a038 IIDDVISAGTSVRESIKLIEAEGATPAGVAIAXDRMEKGTGELSAVQEVEKQYGLPVAPI 

130 140. 150 160 170 180 

190 200 210 

m038 .pep ASLN DLFI LLQNNPEFGQFLS PVRAYRRQYGVEX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
a 0 3 8 ASLN DLFI LLQNNPEFGQFLE PVRAYRRQYGVEX 

190 200 210 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 038 shows 98.1 % identity over a 213 aa overlap with a predicted ORF (ORF 038.ng) 
from N. gonorrhoeae: 

m038/g038 

10 20 30 40 50 60 

m038 . pep MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I M I I I I I I I I I I M I I I I I 
g038 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGASTLQLAKFYAQSIIES 

10 20 30 40 50 60 

70 80 90 100 110 120 

m038 . pep G I R F DM L FG PAYKG 1 1 LAAAT AMMLAEKG VNV P F A YNR KE AKDHG E GG V LVG A PLKGRV L 

M ItlMllllilllllllll llillllllllllllll ||||[:|||| HUM III III 
g038 GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDRGEGGVLVGAPLKGRVL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m038 . pep IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 
I J I t I I 1 1 I I I I I 1 I 1 1 I I 1 1 1 1 I I 1 1 I 1 I f I 1 1 I 1 1 I 1 1 I : I I I t i I t ! I 1 1 1 I 1 1 I 1 1 
9038 1 1 DDVI SAGTSVRES I KLI EAEGATPAGVAIALDRMEKGTGKLSAVQEVEKQYGLPVAPI 

130 140 150 160 170 180 

190 200 210 

m038 . pep ASLN DLFI LLQNNPEFGQFLE PVRAYRRQYGVEX 

I I I I I I I I I I I I I I I I I I I I I I I I : I I I I IN I I 
g038 AS LN DL FI LLQNN PE FGQFLE PVRT YRRQ YGVEX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 141>: 

g03 9.seq 

1 ATGCCGTCCG AACCACCTGC CGCTTCAGAC GGCATCAAAC CGACACACAC 

. 51 CGAGAAAACA TCATGCCCGC CTGTTTCTGT CCGCACTGCA AAACCCGCCT 

101 CTGGGTCAAA GAAAcccagC TCAAcytCgC ccaagGCTTC GTCGTCTgcc 

151 aaAAAtgcga agGGCTgttt aaAgccaaaG accAtctggc aaGcacGAAA 

201 gaacctatat tcaacgattg gcccgaagct gtttcgggat gTcaaaCTCG 
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251 TCcaccgcaT cggcacgcac gccattagca aGAaacagat gtcccgcgac 

301 gaaatCgccg atatcctcaa cggcggtaca acCCTGCACG ATACGCCGCC 

351 CGCAACCGCC GCTGCCGCac ctGCCGCCGC ACCGCaggTT TCCGTACCGC 

401 CCGCCCGTCA GGAAGGGCTC AACTGGACTA TTGCAACCCT GTTCGCACTT 

4 51 ATCGTCCTCA TTATGCAGCT TTCCTACCTC TTCATCCTAT GA 

This corresponds to the amino acid sequence <SEQ ID 142; ORF 039.ng>: 

g03 9.pep 

1 MPSEPPAASD GIKPTHTEKT SCPPVSVRTA KPASGSKKPS STSPKASSSA 

51 KNAKGCLKPK TIWQARKNLY STIGPKLFRD VKLVHRIGTH AISKKQMSRD 

101 EIADILNGGT TLHDTPPATA AAAPAAAPQV SVPPARQEGL NWTIATLFAL 

151 IVLIMQLSYL FIL * ~ 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 143>: 

xn03 9 . seq 

1 ATGCCGTCCG AACCGCCTTA CGCCTCAGAC GGCATCAAAC CTGACACACA 

51 CGAGGAAATA CCATGCCCGC CTGTTTCTGC CCCCACTGCA AAACCCGTCT 

101 CTGGGTCAAA GAAACCCAAC TCAATGTCGC CGnnnnnnnn nnnnnnnnnn 

151 rmnnnannnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnCCC GAGGCTGTTT 

251 CGGATGTCAA ACTCGTTCAC CGTATCGGCA CGCGCGCCAT CGGCAAGAAA 

301 CAGATTTCCC GTGACGAAAT CGCCGGCATC CTCAACGGCG GTACAACCCA 

351 GCCCGATATT CCGCCCGCAA CCGCCGCCAC CCCTGCTGCC GCACCGCAGG 

4 01 TTACCGTACC GCCCGCCGCG CCCGCCCGTC AGGATGGGTT CAACTGGACG 

451 ATTGCAACCC TGTTTGCCCT TATCGTCCTC ATTATGCAGC TTTCCTACCT 

501 CGTCATCCTA TGA 

This corresponds to the amino acid sequence <SEQ ID 144; ORF 039>: 

m039 .pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXP EAVSDVKLVH RIGTRAIGKK 

101 QISRDEIAGI LNGGTTQPDI PPATAATPAA APQVTVPPAA PARQDGFNWT 

151 IA TLFALIVL IMQLSYLVIL * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 145>: 

a039.seq 

1 ATGCCGTCTG AACCGCCTTA CGCCTCAGAC GGCATCAAAC CTGACACACA 

51 CGAGGAAATA CCATGCCCGC CTGTTTCTGC CCCCACTGCA AAACCCGTCT 

101 CTGGGTCAAA GAAACCCAAC TCAATGTCGC CCAAGGCTTC GTCGTCTGCC 

151 AAAAATGCGA AGGAATGTTT AAAGCCAAAG ACCATCTGGC AAGCACGAAA 

201 GAACCCATAT TCAACGATT. TGCCCGAAGC TGTTTCGGAT GTCAAACTCG 

251 TTCACCGCAT CGGCACGAGC GCCATCGGCA AGAAACAGAT TTCCCGTGAC 

301 GAAATCGCCG GCATCCTCAA CGGCGGCACA ACCCAGCCCG ATATTCCGCC 

351 CGCAACCGCC GCCACCCCTG CTGCCGCACC GCAGGTTACC GTACCGCCCG 

4 01 CCGCGCCCGC CCGTCAGGAT GGGTTCAACT GGACGATTGC AACCCTGTTT 

4 51 GCCCTTATCG TCCTCATTAT GCAGCTTTCC TACCTCGTCA TCCTATGA 

This corresponds to the amino acid sequence <SEQ ID 146; ORF 039.a>: 

a039.pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPKASSSA 

51 KNAKECLKPK TIWQARKNPY STIXPEAVSD VKLVHRIGTS AIGKKQISRD 

101 EIAGILNGGT TQPDIPPATA ATPAAAPQVT VPPAAPARQD GFNWTIATLF 

151 ALIVLIMQLS YLVI L* 

m039/a039 79.4% identity over a 1 70 aa overlap 

10 20 30 40 50 60 

m039 . pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

I I I I I M I I I I I M I 1 1 I I I II II I I I | | | | | | | | | | | | | | | | 
a039 MPSEPPYAS DGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPKASSSAKNAKECLKPK 
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10 



20 



30 



40 



50 



60 



70 80 90 100 HO 120 

m039 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDErAGILNGGTTQPDI 
• : I I I I I I I I I I i I I I I I I I I I I II I I I | | | | M | | | | II || | 

a039 TIWQARKNPYSTIX PFJWSDVKLVHRIGTSAIGKKQISRDEIAGILNGGTTQPDI 

70 80 90 100 no 

130 14C 150 160 170 

m039 .pep PPATAATPAAAPQVTVPPAAPARQDGFTIWTIATLFALIVLIMQLSYLVILX 
I I I I I I I I I 1 1 1 1 f i I I I I 1 1 I II I 1 1 I II I I I i 1 1 i I I I I I | | 1 1 ! | j I | 
a039 PPATAATPAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 
120 130 140 150 160 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 039 shows 60.8% identity over a 171 aa overlap with a predicted ORF (ORF 039.ng) 
from N. gonorrhoeae: 

m039/g039 

10 20 30 40 50 60 

m03 9.pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

MUM MMMI \ Mill: Mlhlllllhl II 

903 9 MPSEPPAASDGIKPTHTEKTSCPPVSVRTAKPASGSKKPSSTSPKASSSAKNAKGCLKPK 
10 20 30 40 50 60 

70 80 90 100 110 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 

M IIIMMMhlhllhlMIM MMMI I 

9039 TIWQARKNLYSTIG PKLFRDVKLVHR I GTHAI S KKQMS RDE I AD I LNGGTTIiHDT 

70 80 90 100 110 



130 140 150 160 170 

m03 9 . pep PPATAAT- PAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 

M MM-- || mil: |M I Ihhl Ml MMMI II I Ml III Ml 

g039 P PATAAAAPAAAPQVS VP PA RQEGLNWTIATLFALIVLIMQLSYLFILX 

120 130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 147>: 

g040.seq 

1 ATGAACGCGC CCGACAGCTT T3TCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCCAA ATGCGCGGCA CGACACTGGT CGCCGGCATA GAcggCCGCC 

101 TGCTCGAAGG CGGCACC7TA AATAAGCTCG CCGCCGACAT C3GGCTGTTG 

151 TCGCAACTGG GCATCCGACT CGTCCTCATC CACGGCGCGT ACCACTTCCT 

201 CGAccgCCTC GCCGCCGCGC AAGgccGCAC GCCGCATTAT TGCCGgggtt 

251 tGCGCGTTAC CGACGaAACc tcGctcgGAC AGGCGCAGCA GtttGCCGGC 

301 AccgTCCGCA GCCGTTTTGA agcCGCATTG tgcggcagCG tttcaggatt 

351 cgcgCGCGCG CCTTCCGTCC CGCTCGTAtc gggcaacttc ctgacCGCCC 

401 GTCcgatggg cgtgattgac ggaACCGata tggaatacgc gggggttatc 

4 51 cgcaaaaccg ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGATG CCGCCGCTCG GGCATTCCTA CGGCGGCAAA ACCTTCAATC 

551 TCGATATGGT GCAGGCCGCC GCTTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGCCAGCGA AACCCGACGA CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

751 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGGGCCGCCG ACGGCAGCCT 

801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

851 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATC 

901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCGTCCTAT TGCACCGCAG 

951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTCGAACACG 
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1001 ACGGCGACCT GTACGGCTGT GCCGCACTCA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGg 

1101 ctACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

12 01 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGCTGCCCG AAACGCGGCG 

1251 CAAAGACTAC CGCAGCAACG GACGAAACCC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 148; ORF 040.ng>: 

g040.pep 

1 MNAPDSFVAH FREAAPYIRQ MRGTTLVAGI DGRLLEGGTL NKL AADIGLL 

51 SQLGIRLVLI HGAYHFLDRL AAAQGRTPHY CRGLRVTDET SLGQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPMGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWM PPLGHSYGGK TFNLDMVQAA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAASETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GVLLHRSREY LENHISEFSI LEHDGDLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERG FQTASED ELPETRRKDY RSNGRNPH1L VRRLHR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 149>: 

m040 . seq 

1 ATGAGCGCGC CCG ACCTCTT TGTCGCCCAC TTCCGCGAAG CCGTCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCTC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAgCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AG CCG CAT TG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

451 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCTATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGCCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGGCGGGCA AACGCGACGG CTGATTTCGT CCGCCGAACT CTTCACCCGC 

751 AACGGCATCG GCACGTCCAT TGCCAAAGAA GCCTTCGTCT CCATCCGGCA 

801 rGCGCAywgG G.CGACATCC CGCACATCGC CGCCCTCATC CGCCCGCTGG 

851 AAGAACAGGG CATCCTGCTG CACCGCAs.c GCGAATACCT CGAAAACCAC 

901 ATTTCCGAAT TTTCCATCCT CGAACACGAC GGCAACCTGT ACGGTTGCGC 

951 CGCCCTGAAA ACCTTTGCCG AAGCCGATTG CGGCGAAATC GCCTGCCTTG 

1001 CCGTCTCGCC GCag.cACAG GACGGCGGCT ACGGCGAACG CnTGCTTGCC 

1051 CACATTATCG ATAAGGCGCG CG G CAT AG GC ATAAGCAGGC TGTTCGCACT 

1101 GTCCACAAAT ACCGGCGAAT GGTTTGCCGA ACGCGGCTTT CAGACGGCAT 

1151 CGGAAGACGA GTTGCCCGAA ACGCGGCGCA AAGACTACCG CAGCAACGGA 

1201 CGGAACTCGC ATATTCTGGT ACGTCGCCTG CACCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 150; ORF 040>: 

m040.pep 

1 MSAPDLFVAH FREAVPYIRQ MRGKTLVAGI DDRLLEGDTL NK LAADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFYLDMLQTA ASAAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAGGQTRR LISSAELFTR 

251 NGIGTSIAKE AFVSIRQAHX XDIPHIAALI RPLEEQGILL HRXREYLENH 

301 ISEFSILEHD GNLYGCAALK TFAEADCGEI ACLAVSPQXQ DGGYGERXLA 

351 HIIDKARGIG ISRLFALSTN T3EWFAERGF QTASEDELPE TRRKDYRSNG 

4 01 RNSHILVRRL HR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 151>: 

a040 . seq 
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1 ATGATCGTGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGTTCG CCGCCGACAT CGGGCTTTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCGC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAGCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCCATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGTA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGGCGGCGA AACGCGACGG CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

7 51 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGAGCCGCCG ACGGCAGCCT 

801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

851 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATT 

901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCATCCTGC TGCACCGCAG 

951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CT CG AACACG 

1001 ACGGCAACCT GTACGGTTGC GCCGCCCTGA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TSCCGTCTCG CCGCAGGCAC AGGACGGCGG 

1101 CTACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGTTGCCCG AAACGCGGCG 

1251 C AAAGAC T AC CGCAGCAACG GACGGAACTC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 152; ORF 040.a>: 

a04 0.pep 

1 MIVPDLFVAH FREAAPYIRQ MRGKTLVAGI DDRLLEGDTL NKFAADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFHLDMLQTA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAV TLSAQEAQSL AEHAGGETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

3G1 AALIRPLEEQ GILLHRSREY L2NHISEFSI LEHDGNLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 CI ERGFQTASED ELPETRRKDY R3NGRNSHIL VRRLHR* 

m040/a04O 91.5% identity in 436 aa overlap 

10 20 30 40 50 60 

MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 
I : I I I I I I I I I I I : I I I I I I I I I I I I I M I I I I I I I I I I I I: II I I I M I I I I II II I I 
MIVPDLFVAH FREAAPYIRQMRGKTLVAGIDDRLLEGDTLNKFAADIGLLSQLGIRLVLI 
10 - 20 30 40 50 60 

70 80 90 100 110 120 

HGARH FL DRHAAAQGRT PH YCRG LRVT DET S LEQAQQFAGT VR SR FEAALCG S VS G FARA 
I i 1 1 I 1 1 I I I I I I I I 1 1 It I I I 1 1 I I I ! I I I I IN 1 1 I I II I I I II I I I I I I 1 1 I 1 1 1 1 I 
HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 
70 80 90 100 110 120 

130 140 150 160 170 180 

PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 
I I I I I II i I I I I I 1 1 1 1 I 1 1 1 I I I I I I I I I I I IM 1 1 I 1 1 I 1 1 I 1 1 I I I I I 1 1 1 I i I 1 1 I 
PSVPLVSGNFLTARPIGVIDGTDMEYAGV1RKTDTAALRFQLDAGN IVWLPPLGHSYSGK 
130 140 150 160 170 180 

190 200 210 220 230 240 

TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 
M • I I I I I M I I : I I I I! I I II I I I I I I I M I I I I I I I I I | | I I I | I I I I I I ! I I: I | i 
TFHLDMLQTAASVAVSLQAEKLVYLTLSDGISRPDGTLAVTLSAQEAQSLAEHAGGETRR 



m04 0.pep 
a040 

m040 .pep 
a040 

m040 .pep 
a040 

mO 4 0 . pep 
a040 
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190 200 210 220 230 240 

250 260 270 

LISSA ELFTRNGIGTSIAKEAFVSIRQAHXXDIPHI 

inn 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 ii tun 

LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 
250 260 270 280 290 300 

280 290 300 310 320 330 

AALIRPLEEQGILLHRXREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I j | | | 
AALIRPLEEQGILLHRSREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 

310 320 330 340 350 360 

340 350 360 370 380 390 

PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

II I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I II II I I I II I I I I I I I I I I I I I I I I 
PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

370 380 390 400 410 420 

400 410 
RSNGRN SH I LVRRLHRX 
I I I I I I I I I I ! I I I I I I 
RSNGRN SHI LVRRLHRX 
430 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae . 

ORF 040 shows 88.3% identity over a 436 aa overlap with a predicted ORF (ORF 040.ng) 
from N. gonorrhoeae: 

m040/g040 



m04 0 .pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 60 

I : I I I I II I I I I I : I I I I II I I II I I I I I NIM I I I I I I I I I I I I II I I I I I I I I 
g040 MNAPDSFVAHFREAAPYIRQMRGTTLVAGIDGRLLEGGTLNKLAADIGLLSQLGIRLVLI 60 

m040 .pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I II I I I I I I I I II I I 1 I I I | 
g040 HGAYHFLDRLAAAQGRTPHYCRGLRVTDETSLGQAQQFAGTVRSRFEAALCGSVSGFARA 120 

m04 0.pep P S V PL V SGN FLT AR P I GV I DGT DME Y AG V I RKTDT AALR FQLD AGN I VWL PPLGHSYSGK 180 

I I I I I I I I I I I 1 1 I l: I 1 1 I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I : I I I I I I |: I I 

g04 0 PSVPLVSGNFLTARPMGVIDGTDMEYAGVI RKTDT AALRFQLDAGNIVWMPPLGHSYGGK 180 

m04 0 . pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 240 

II I II : I: I I I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I : :: I I I 
g040 T FN LDMVQ AAA S VA V S LQ AEKL VY LT L S DG I S R P DGT L AE T LS AQE AQ S LAEHAAS ET RR 240 

m040.pep LISSA ELFTRNGIGTSIAKEAFVSIRQAHXXDIPHI 27 6 

I I I I I I I I I I II I I I I I I I I I I I I I I II I Mill 

g04 0 LI SSAVAALEGGVHRVQI LNGAADGSLLQELFTRNGIGTS I AKEAFVSIRQAHSGDI PHI 300 

m04 0 .pep AALI RPLEEQG I LLHRXRE YLENH I SE FS I LEHDGNLYGCAALKTFAEADCGEI ACLAVS 326 

I I I I I I I I I I I : I I I I I i I I I I I I I I I I I I I I I I: I I I I I I I I I I I I M I I I I I I I I I I 

g04 0 AALIRPLEEQGVLLHRSREYLENHISEFSILEHDGDLYGCAALKTFAEADCGEI ACLAVS 360 

m04 0 . pep PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 3 S 6 

II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I | I I I I I I I I I I | | | | | | | | ! | | 
g040 PQAQDGGYGERLLAH 1 1 DKARG IG I SRLFALSTNTGEW FAERG FQTASE DE LPETRRKDY 4 20 

m040.pep RSNGRNSH I LVRRLHRX 413 

I I I I I I I I I I I II I I I 
g04 0 RSNGRN PHI LVRRLHRX 437 



mC4 0.pep 
aC40 

m040.pep 
a040 

m040.pep 
a040 

m040 .pep 
a040 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 153>: 

g041 .seq 

1 ATGAGTTCGC CCAAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGCCT 

51 GATTACCGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGTGCGCTGG 

101 TGTGCGAAGT ACCGCTGACC GATATGATCC GTTATCCGCT GCTGTCCGCC 

151 GGTTCAAGTT GGACGGACGA ATACGGCAAT CCGCAGAAAT ACGAAGCCTG 

201 CAAACGCCGG CTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCACTC ATTACCACCA GCCTCAGCGA CGACCGCGTC 

301 CATCCCGCCC ACGCGCTCAA ATTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 31 CCCAACGCGA ATCCGCCGAC AAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

451 GAATTTTTGG GATAA 

This corresponds to the amino acid sequence <SEQ ID 154; ORF 041 .ng>: 

g041.pep 

1 MSSPKHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

31 GSSWT DEYGN PQKYEACKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQRESAD KLACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 155>: 

m041.seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC C-ACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCCGC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC GAACTCGCCT GCGTCTTGCT GTTTTTGAAA 

. 451 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 156; ORF 04 1>: 
m041.pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEVCXRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSAQSW LYSPDGGGHT GNGTQRESAD ELACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 157>: 
a041.seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATA GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

401 CGCAGCGCGA AGCCGCCGAC GAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 1 58; ORF 041 .a>: 

a 041 .pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQREAAD ELACVLLFLK 
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151 EFLG* 

m04i/a04i 98.7% identity over a 154 aa overlap 

10 20 30 40 50 60 

ra041.pep ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

I I I I 1 1 I I I I I I I I 1 1 I I I I I I I I II I I I I II I I I I II I I I M I II II I I I I I I I 

a04l ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 

70 80 90 100 110 120 

n\041.pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I I f I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I ! I I I I M I | | | | | | | Mi 
a041 PQK YE VCKRR LG E LSPYHNL3DGIDYP PAL I TT S L S D D R VH P AHAL K F Y AK LR E T S PQ S W 

70 80 90 100 110 120 

130 140 150 

m04 1 . pep L Y S PD GGG HTGNG T CRE SADE LAC VL L FLKE FLG X 

I I i I I I I I I I I I I I I I 1 : I II I I I I I 1 i I I I I I I I 
a04 1 LYSPDGGGHTGNGTCREAADELACVLLFLKEFLGX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 041 shows 96.8% identity over a 154 aa overlap with a predicted ORF (ORF 041. ng) 
from N. gonorrhoeae: 

m041/g041 

10 20 30 40 50 60 

m041.pep ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
: I I i: I I I I I I I I I I I I I I I I I III I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g041 MSSPKHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 

70 80 90 100 110 120 

ra041 . pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I I I I I = I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I II I | || | IN 
g041 PQKYEACKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 8C 90 100 110 120 

130 140 150 

mO 4 1 . pep LYS PDGGGHTGNGTQRESADELACVLLFLKEFLGX 

M I I I I I I I I I I I I I I I I I I : I I I I I I I I It I I I I 
g04 1 LYSPDGGGHTGNGTQRESADKLACVLLFLKEFLGX 
130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 59>: 
g041-l .seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCG CGCGCACTTT CAGACGGCAT TTTGAATCAA 

151 ATGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CA7CAGAATG CGGAATATCC GAAGGGCGTG TACCGCATGT 

251 GTACGGCGGC GACCTACCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGATGA GTTGCTCGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAGC AGCCCAACCG CGCGCTGCTG ACTTTGAACA 

401 AATCGGGCGG CGATACGGCG TATACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAATTGGTAG AGGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAAG TGTGGCTGGT GGAACGCGGC 
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601 AAGAGTTTCG AGGAAAGCCT GCCGGCGTAC CAAATCGATA AAGGCGCGAT 

651 GATGGTAAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 

701 TGATTGAAGC GTCGGACGGT TTTTACACCA AG AC GT ATTT GCAGGTGTCG 

751 TCCGAAGGCG GGGCGAAACC GTTGAACCTG CCTAATGATT GCGATGTGGT 

801 CGGCTATCTG GCGGGACATC TTTTGCTGAC GCTGCGCAAG GACTGGCACC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCGT TGGTGGCGGT GAAACTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCAAGCCTGC 

1001 TGGAGAATGT ACAAGGCCGT CTGAAAGCGT GGCGGTTTGC CGACAGCAAA 

1051 TGGCAGGAAG CCGAGTTGCC GCACCTGCCC TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCG TGGGGCGGCG ACGTGGTTTA TCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC TCCAGCCGCA GCAGTTTGTT TCAGACGGCA TCGAAGTGCG 

1251 GCAGTTTTGG GCGGTGTCGT CCGACGGCGA ACGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACACGCCGA CCTTAGTCTA TGCTTACGGA 

1351 GGTTTCGGCA TTCCTGAATT GCCGCATTAT CTGGGCAGCG TCGGCAAATA 

1401 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCAAACATC CGCGGCGGCG 

1451 GAGAATTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAC 

1501 AAAAGCGTTG ATGATTTGTT GGCAGTCGTG CGTGATTTGT CCGAACGCGG 

1551 CATGAGTTCG CCCAAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGCC 

1601 TGATTACCGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGTGCGCTG 

1651 GTGTGCGAAG TACCGCTGAC CGATATGATC CGTTATCCGC TGCTGTCCGC 

1701 CGGTTCAAGT TGGACGGACG AATACGGCAA TCCGCAGAAA TACGAAGCCT 

1751 GCAAACGCCG GCTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1001 ATCGATTATC CGCCCGCACT CATTACCACC AGCCTCAGCG ACGACCGCGT 

1851 CCATCCCGCC CACGCGCTCA AATTCTACGC CAAACTGCGC GAAACCTCGC 

1931 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CAAACTCGCC TGCGTGTTGC TGTTTTTGAA 

2031 AGAATTTTTG GGATAA 

This corresponds to the amino acid sequence <SEQ ID 160; ORF 041-l.ng>: 

g041-l.pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILNQ 

51 MQDTRQIPFC QEHRARMYHF HQNAEYPKGV YRMCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLNKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KS FEES L PAY QIDKGAMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

251 SEGGAKPLNL PNDCDWGYL AGHLLLTLRK DWHRAKQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADSK 

351 WQEAELPHLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

4 01 VMRLQPQQFV SDGIEVRQFW AVSSDGERIP YFHVGKNAAP DTPTLYYAYG 

4 51 GFGIPELPHY LGSVGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAW RDLSERGMSS PKHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEACKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA KALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQRESADKLA CVLLFLKEFL G* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 161>: 

m041-l. seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAGAAAA CGACAAGGCG CGCGCGCTTT CAGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAGGACG CGGAGTATCC GAAGGGCGTG TACCGCGTGT 

251 GTACCGCGGC GACGTATCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTTGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

401 AATTGGGCAG CGATACGGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAGTTGGTCG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG AACGAACGCC 

551 AGTTGACCCA ATCGGGCTAT CCGCGCGAAG TATGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT C-CCTGTGTAT CAAATCGGCG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ATCTCGATCC GCAGGGTTCG CCGATTGATT 

7 01 TGATTGAAGC GTCGGACGGT TTTTACACCA AAACCTATTT GCGGGTCTCA 
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751 GCCGAAGGCG AGGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTGGT 

801 CGGCTATCTG GCGGGGCATC TTTTGCTGAC GCTGCGCAAG GACTGGAACC 

851 GCGCGAACCA AAGCTATCCG AGCGGCGCGC TGGTGGCGGT GAAGCTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCGAGCCTGT 

1001 TGGAGAACGT ACAAGGCCGT CTGAAAGCAT GGCGGTTTGC CGACGGCAAA 

1051 TGGCAGGAAG TCGAATTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCT TGGGGCGGCG ACGTGGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

1401 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 

1451 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG CGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

1701 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 

1751 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 AT CG ATT AT C CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 

1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCCG 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CGAACTCGCC TGCGTCTTGC TGTTTTTGAA 

2001 AGAGTTTTTG GGCTAA 

This corresponds to the amino acid sequence <SEQ ID 162; ORF 041-1>: 

m041-l.pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLENDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKLGSDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW NERQLTQSGY PREVWLVERG 

201 KSFEESLPVY QIGEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLRVS 

251 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWNRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADGK 

351 WQEVELPRLP SGALEMTDQP WGGDWYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGI PELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HOAAOGI SKH 

501 KSVDDLLAVV RDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSAQSWLYS PDGGGHTGNG 

651 TQRESADELA CVLI.FI.KF.FT.. G* 

m041-l/g041-l 94.6% identity in 671 aa overlap 

10 20 30 40 50 60 

MKSYPDPYRHFENLD5AETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 

I t I I 1 I I I I I I I I I I I I i I I I I 1 I I I I I I 1: I I I I I I I I I I I I I : M II I I I I I 

MK S Y PD P YRH FEN L DS AETQN FAAE AN AE TRAR FLNNDKARAL S DG I LN QMQ DTRQ I P FC 
10 20 30 40 50 60 

70 80 90 100 110 120 

QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I I I I I I M I I I I: I I I II I I I I :| I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
QEHRARM YH FHQN AE Y PKG V Y RMCT AAT YR SG Y PEW K I L FS V A D FDE LLG DD V YLGGV S H 
70 80 90 100 110 120 

130 140 150 160 170 180 

LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
I M I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I | | | | | | 
LVEQPNRALLTLNKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
130 140 150 160 170 180 

190 200 210 220 230 240 

NERQLTQSGY PRE VWLVERGKS FEE S L P V YQ I GE DGMMVNAWR Y LD PQG S PI DLIEASDG 
•Mill: I I I I I I i I I I I 1 I I i : I I I : : I I I t I I I I I I M I I i I I I I I I I M 



m041-l.pep 
g041-l 

m041-l.pep 
g041-l 

m041-l.pep 
g041-l 

mO 4 1-1 .pep 
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g041-l DERQLTESGYPREVWLVERGKSFEESLPAYQIDKGAKMVNAWRYLDPQGSPIDLIEASDG 
19Q 200 210 220 230 240 

250 260 270 280 290 300 

m04 1-1 . pep F YT KT YLR VS AEG EAKP LNL PN D C D WG YL AG H L LLT LRK DWN RAN QS Y P SG ALV A VKLN 
I I I I I I I: I I : I I I 1 I I i I I M I I I i I i [ I i I I I I II I I I I : I I I I I I I I I i | | | | | | | 
g041-l FYTKTYLQVSSEGGAKPLNLPNDCDWGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 

250 260 270 280 290 300 

310 320 330 340 350 360 

m04 1-1 . pep RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 
I I I 1 1 I II I I I I I I I I II I 1 1 I I I I I I I II I I I I I I I I I I I I M II I I :! I I I: I I I: I I 
g041-l RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADSKWQEAELPHLP 

310 320 .330 340 350 360 

370 380 390 400 410 420 

mO 4 1- 1 . pep SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 1111:1:111 
g041-l SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRLQPQQFVSDGIEVRQFW 
370 380 390 400 410 420 

430 44C 450 460 470 480 

m041-l.pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
:: I : I I I I 1 1 I I I I I 1 I I I I I I I I I I ! I I I I I I I I I I i I I I I : I I i I I I' I I I I I I I I I I 
• g041-l AVSSDGERIPYFHVGKNAAPDTPTLVYAYGGFGIPELPHYLGSVGKYWLEEGNAFVLANI 
430 440 450 460 470 480 

490 500 510 520 530 540 

m04 1-1 . pep RGGGE FGPRWHQAAQ3 I SKHKSVDDLLAVVRDLSERGI S S PEHI GLQGGSNGGLI TAAAF 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I : I I I : I I I I I I I I I I I I I I I | | | 
g041-l RGGGE FGPRWHQAAQGISKHKSVDDLLAWRDLSERGMSSPKHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 

550 560 570 580 590 600 

m04 1-1 . pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I : I I I I I II I I I I I I I | | | 
g041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEACKRRLGELSPYHNLSDG 

550 560 570 580 590 600 

610 620 630 640 650 660 

m041-l . pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPCGGGHTGNGTQRESADELA 
I I I I I I I I I I i I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I : i I 
g041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPCGGGHTGNGTQRESADKLA 

610 620 630 640 650 660 

670 

m04 1-1 . pep CVLLFLKEFLGX 
I I I I I I I I I I I I 
g U 4 1 - 1 CV LLFLKE FLGX 

670 

m041-l/P55577 

splP55577|Y4NA_RHISN PROBABLE PEPTIDASE Y4NA >gi|2182536 (AE000086) Y4nA IRhizobium sp. 
NGR234J Length = 726 
Score = 370 bits (940), Expect = e-101 

Identities = 217/682 (31%), Positives = 331/682 (47%), Gaps = 22/682 (3%) 

Query: 2 KSyPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFCQ 61 

K DP + +D + + N T + ++ + L LO T +1 

Sbjct: 42 KDASDPRAYLNEI DGDKAMTWVEAHNLSTVDKLSKDPRYSEYQADALTI LQATDRI ACPS 101 

Query: 62 EHRARMY-HFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 120 

R M +F QD + +G++R T +YRSG P+W+ + V + G G 
Gbjct: 102 FARDGMIDNFWQDGTHVQGLWRRTTWESYRSGNPQWRTILDVDALSKAEGKTWVFEGGDC 161 



Query: 121 LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSKRDENSVWVCPAW 100 
L N L+ LS G D E D+ GE V+ GF P GK V+W DEN+*+V W 
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Sbjct: 


162 


Query: 


181 


Sbjct: 


222 


Query: 


233 


Sbjct: 


282 


Query : 


292 


Sbjct : 


338 


Query: 


34 8 


Sbjct : 


398 


Query : 


408 


Sbjct : 


458 


Query: 


465 


Sbjct: 


518 


Query: 


525 


Sbjct: 


578 


Query: 


585 


Sbjct: 


637 


Query: 


645 


Sbjct: 


697 



162 LPPTSNLCLIRLSDGGKDADVVREFDI AKGEFVKEGFVLPEGKQSVTWVDENTI YVTREW 221 



++T SGY 



FY 



+V+RG+S +++4 +++ 



L LP 



-IGEDGMM--VNAWRYLDPQGSPI 232 
E G++- ++ +D + 



GY G -f L+ DW A + + + 



LF P+E Q++ 



TK +V S+L NV 



G W 



L +T 



D +++ + F P TLF D 



+FD+ G+ QQFW TS DG + H-PYF V * 



GK WLE-H3 A+ LANIRGGGEFGP+WH A 



PT++YAYGGF IP p Y 



++ + DD AV +DL 



++S H+ 



G+ GGSNGGL+ 



L +SPYHN+ G+ YP 



A+V +VPL DM+ + +SAG+SW EYG+P 



TS DDRV P HA K A 



GH 



The following partial DNA sequence was 

a041-l.seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA 

101 TTTTAAACAA CGACAAGGCA CGCGCATTGT 

151 TTGCAGGACA CGCGGCAAAT TCCGTTTTGT 

2 CI GTACCATTTC CATCAAGATG CGGAATATCC 

251 GTACCGCGGC GACTTACCGT TCGGGCTATC 

3C1 TCGGTGGCGG ATTTCGACGA ATTGCTCGGT 

351 CGTGTCGCAC CTGGTGGAAC AGCCCAACCG 

401 AATCGGGCGG CGATACCGCG TACACGCTGG 

451 GAGTTGGTAG AAGGCGGTTT TCACTTTCCG 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAGG 

601 AAGAGTTTCG AGGAAAGCCT GCCGGTGTAC 

651 GATGGTGAAC GCGTGGCGTT ACCTCGATCC 

701 TGATTGAAGC GTCTGACGGT TTTTACACCA 

7 51 GCCGAAGGCG AAGCGAAAC C GTTAAACCTG 

801 CGGCTATCTG GCCGGACATC TTTTGCTGAC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCAT 

901 CGCGGCGAAT TGGGCGCGGC GCAGCTTTTG 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG 

1001 TGGAAAACGT ACAGGGTCGT CTGAAAGCGT 

1051 TGGCAGGAAA CCGAGTTGCC GCGCCTGCCT 

1101 CGACCAACCG TGGGGGGGCG ACGTAGTTTA 

1151 CCACGCCGCT GACGCTGTTT GCATTGGATT 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT 

1401 TTGGCTGGAA GAGGGCAATG CCTTTGTATT 



identified in N. meningitidis <SEQ ID 163>: 



TTTGAAAACC 
TGCCGAAACG 
CTGACGGCAT 
CAGGAACACC 
GAAAGGCGTG 
CTGAGTGGAA 
GACGATGTAT 
CGCGTTGTTA 
AAGTGGATTT 
GCAGGCAAAA 
TCCGGCTTGG 
TGTGGCTGGT 
CAAATTGCTG 
GCAGGGTTCG 
AAACCTATTT 
CCCAACGATT 
CTTGCGTAAA 
TGGTAGCAGT 
TTTGCGCCCA 
TTTTGTCGTG 
GGCGTTTTAC 
TCGGGCGCGT 
CCTTGCCGCC 
TGAACGTGAT 
TCAGACGGCA 
GCGCATTCCT 
CGCTGGTCTA 
CTGGGCAGCA 
GGCGAACATC 



TCGATTCCGC 
CGCGCGCGTT 
TTTGGCGCAG 
GCGCGCGGAT 
TACCGCGTGT 
AATCCTGTTT 
ATCTAGGCGG 
ACACTGAGCA 
GGAAGCAGGG 
ACCATGTGTC 
GACGAACGCC 
GGAACGCGGC 
AAGACGGCAT 
CCGATTGATT 
GCAGGTCTCA 
GCGACGTAGT 
GACTGGCACC 
AAAATTAAAC 
ATGAAACGCA 
GCGAGCCTGC 
TGATGGCAAA 
TGGAAATGAC 
AGCGATTTCA 
GGAACTGACC 
TTAACGTGCA 
TATTTCCACG 
TGCCTACGGC 
TTGGCAAATA 
CGCGGCGGCG 



WO 99/57280 



PCT/US99/09346 



236 



1451 GCGAGTTCGG CCCGCGCTGG 

1501 AAAAGCGTTG ATGATTTATT 

1551 TATCAGTTCG CCCGAACACA 

1601 TGATTACTGC CGCCGCCTTC 

1651 GTGTGCGAAG TGCCGCTGAC 

1701 CGGTTCAAGC TGGACAGACG 

1751 GCAAACGCCG GTTGGGCGAA 

1801 ATCGATTATC CGCCCGCGCT 

1851 CCATCCCGCC CACGCGCTCA 

1901 CGCAATCTTG GCTCTACTCG 

1951 ACGCAGCGCG AAGCCGCCGA 

2001 AGAGTTTTTG GGCTAA 



CATCAGGCGG CGCAGGGAAT CAGCAAACA7 
GGCAGTCGTG AGCGATTTGT CCGAACGCGG 
TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 
GTGCGCGAAC CGCAAAGCAT AGGCGCGCTG 
CGACATGATC CGTTATCCGC TGCTCTCCGC 
AATACGGCAA TCCGCAAAAA TACGAAGTCT 
TTGTCGCCGT ATCACAATCT TTCAGACGGC 
CATTACCACC AGCCTGTCCG ACGATCGCGT 
AGTTCTACGC CAAACTGCGC GAAACCTCGC 
CCTGACGGCG GCGGCCATAC CGGCAACGGC 
CGAACTCGCC TGCGTGTTGC TGTTTTTGAA 



This corresponds to the amino acid sequence <SEQ ID 164; ORF 041-l.a>: 

a041-l.pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILAQ 

51 LQDTRQI PFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSRSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPVY QIAEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLC'VS 

251 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPNETQALE SVETTKRFVV ASLLENVQGR LKAWRFTEGK 

351 WQETELPRLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERI P YFHVGKNAAP DMPTLVYAYG 

4 51 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV SDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQREAADELA CVLLFLKEFL G* 



a041-l/m041-l 97.9% identity in 671 aa overlap 

10 20 30 40 50 60 

a041-l .pep MKSYPDPYRH FEN L DS AET QN FAAE AN AE T RAR FLNN DKARAL S DG I LAQLQDTRQ I P FC 

I I I I I I I I I I I I I I I I I I I II II I I II I I I : M I I I I I I I I I I I I | ! I | | | ! [ | | 

m04 1-1 MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 

10 20 30 40 50 60 



70 80 90 100 110 120 

a04 1-1 . pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

I I I I I I I M I I I I I I I I M 1 1 I I Ml II || | 1 1 | | | | | | 1 1 1 1 | | | M | n I I I I 1 1 Ml 
m04 1- 1 QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 8C 90 100 110 120 

130 14C 150 160 170 180 

a041-l.pep L VE QPN RAL LT LS K S GG DT A YT LEV DLE AGE LVEGG FH FPAGKN H V S WR DE N S VW VC PAW 

II I II II I I I I I II I: I I I I I I I I I I II I I I I I I I I M M I I I I M I I I I I I I M I II I 
mO 4 1- 1 LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

a041-l . pep DERQLTESGY PRE VWLVERGKS FEESLPVYQIAEDGMMVNAWRYLDPQGSPIDLIEASDG 
> N I I I : I M I I II M II I I II I M I II I I I I : II II II M M M M 1 M II I I I M M I 
m041-l NERQLTQSGYPRJBWLVERGKSFXESLPVYQIGEDG^WVNAWRYLDPQGSPIDLIEASDG 

190 200 210 220 230 240 

250 260 270 280 290 300 

a04 1-1 . pep FYTKTYLQVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 
N I II II : M M M I II I II M I M II I M M M II II I I M : M II I II M M I M II I 
m041-l FYTKT Y LR V S AEG E AK P LN L PN DC DWG Y LAG H LL LT LRKD WNRANQ S Y PS G ALV AVK LN 

250 260 270 280 290 300 

310 320 330 340 350 360 

a04 1-1 . pep RGELGAAOLLFAPNETQALESVETTKRFVVASLLENVQGRLKAWRFTDGKWQETELPRLP 

lllinilMlllrllMMMIMMIIMMMMIM : I I I I I I : I I I I I I 

m041-l RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 

3 10 320 330 340 350 360 
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370 380 390 400 410 420 

a04 1-1 . pep SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMElRQPQQFDSDGINVCX^rw 
I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I || | | | | | | | || | 
m041-l SGALEMTDQPWGGDWYLAAS DFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 

370 380 390 400 410 420 

430 440 450 460 470 480 

a041-l.pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
I I I I I I I I I II I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I M | | M || | | | 
m041-l TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 

430 440 450 460 470 480 

4 90 500 510 520 530 540 

a041-l.pep RGGGEFGPRWHQAAQGISKHKSVDDLLAWSDLSERGISSPEHIGLQGGSNGGLITAAAF 
I I I I I I I I I I I I I I I I I 1 II I I I 1 I I I I I I I I II I I I I I I I I I I I | | | | | l| | | | | | | | 
m041-l RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 

550 560 570 580 590 600 

a041-l.pep VRE PQS IGALVCEV PLT DMIRYPLLSAGS SWT DEYGNPQKYE VCKRRLGELS PYHNLS DG 

I M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 M ll 1 1 1 1 1 1 1 1 1 1 1 1 

m041-l VRE PQS IGALVCEV PLT DMIRYPLLSAGS SWT DEYGNPQKYE VCKRRLGELS PYHNLS DG 

550 56C 570 580 590 600 



610 62C 630 640 650 660 

a041-l.pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQREAADELA 
I I I I I I I I I I I I I II I II I I I I I I I I I ! I I I I I I I II I I I I I I I I II I I I I | | : | | | | | 
m041-l I D Y P PAL I T T S LS D DR VH PAHALK FY AK LRE T S AQ S WL Y S P DGGGH TGN GTQRE S A DE LA 

610 620 630 640 650 660 



a041-l .pep 



m041-l 



670 

CVLLFLKEFLGX 
I I I I I I I I I I I I 
CVLLFLKEFLGX 
670 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 165>: . 

g042.seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAagcG TTCGTGCCGC ATACCAGCGC 

51 GTTATCCAAC ACTTCCACGG CAGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TGCGGTCGAT GATGAAAATC CAGCCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACGG GCTGCCCGTG CCCTTCGTTG CGTAAAGATT CGTCCACGGG 

201 CGGCAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GATTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCTGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG cggCTTCGCG CTTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 ATTCAATTCC GCCACGCGCG CTTCCTTACC GAAAATCCGC GACAGGGTCT 

401 CCATCTGCTT CTCGCCGCTG GTGCGGATAT TGCCGTT3TC CACCGTCAAA 

451 TCTATGgtgG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCACCCGG 

501 CCCGCCGGTA ATGACAAACT GCGGATTGTG GCGGTGCAGG GATTCGCAAT 

551 CGGGCTCAAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAG 

This corresponds to the amino acid sequence <SEQ ID 1 66; ORF 042.ng>: 

g042 .pep 

1 MTMICLRFQA FVPHTSALSN TSTAAGPSCP MAAVRSMMKI QPGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRFWA NSASICAFNS ATRASLPKIR DR VSICFSPL VRILPLSTV K 

151 SMWAFFANC SYASAPGPPV MTNCGLWRCR DSQSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 167>: 
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XHUU^ . Bcv] 








A?GACGATGA 


TTTGCTTGCG 


51 


GTTATCCAmT 


ACTTCGACAG 


101 


TACGGTCGAT 


GATGAAAATC 


151 


AAGGAAACAG 


GCTGCCCGTG 


201 


CGGTAGGCCG 


ATGTCGCCGT 


251 


CGAAGGCGGA 


CACCTTGTTG 


301 


TTGCCTTTGG 


CGGCTTCGCG 


351 


CTTCAATTCC 


GCCGCGCGCG 


401 


CCATCTGCTT 


TTCGCCGCTG 


451 


TCTATGGTGG 


TCGCGTTTTT 


501 


CCCGCCGGTA ATGACAAGCT 


551 


CGGGCTCGAA 


CAGCGTCCCC 


601 


AAATAA 





CTTCCAAGCG 
CCGcCGGCCy 
CAATCGGGGT 
CCCCTCGTTG 
GTATCCAACT 
CCCGTAACCG 
CGTTTGGGCG 
CTTCCTTGCC 
GTGCGGATAT 
CGCTAACTGT 
GAGGATTGTA 
ACCGTTGCCG 



TTCGTGCCGC 
TTCyTGCCCG 
TTTTCTCTTT 
CGTAAAGATT 
TGCCAACCGC 
ACAGCACCAG 
AACAGCGCGT 
GAAAATCCGC 
TGCCGTTGTC 
TCATACGCTT 
GCGGTGCAGG 
CCTTGTCAAA 



GTACCAGCGC 
ATGGCGGCGG 
GATGTATTCG 
CGTCTACAGG 
GACTGCGTGC 
CCCGCGTCCT 
CAATCTGCGC 
GCCAAGGTCT 
CACCGTCAGA 
CCGCGCCCGG 
GCTTCGTAAT 
TGCAGGCTGC 



This corresponds to the amino acid sequence <SEQ ID 168; ORF 042>: 

m042 .pep 

1 MTMICLRFQA FVPRTSALSX TSTAAGXSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR AK VSICFSPL VRILPLSTV R 

151 SMWAFFANC SYASAPGPPV MTSXGLXRCR ASXSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 169>: 

a042. seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

401 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA ATGACAAGCT GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 

551 CGGGCTCGAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 170; ORF 042.a>: 
a042.pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTVR 

151 SMWAFFANC SYASAPGPPV M?S*GL*RCR AS*SGSNSVP TVAALSNAGC 

201 K* 

m042/a042 99.0% identity over a 201 aa overlap 

10 20 30 40 50 60 

MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I I I I I M I I I I I I ( I MINI I I I I M I I I I I I I I I I I I I I I II I I I I II I It I 
MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
10 . 20 30 40 50 60 

70 80 90 100 110 120 

RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II I I 11 I I I II I II I I I I I I II II I I 
RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
70 80 90 100 110 120 

130 140 150 160 170 180 

AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSXGLXRCR 



m042.pep 
a042 

m042 .pep 
a042 

mO 4 2 . pep 
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I 1 1 I I I I I 1 1 M I I I I I i I I 1 1 1 1 II I I 1 1 1 1 1 I I I I I I I I I I I 1 1 i 1 1 I | | | 1 1 | | || | 

aC4 2 AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 
130 140 150 160 170 180 

190 200 
m042.pep ASXSGSNSVPTVAALSNAGCKX 
I I I I I I I II I ! II I I I I II I I I 
a04 2 ASXSGSNSVPTVAALSNAGCKX 

190 200 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 042 shows 93.0% identity over a 201 aa overlap with a predicted ORF (ORF 042.ng) 
from N. gonorrhoeae: 

m042/g042 

10 20 30 40 50 60 

MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFS LMYSKETGCPCPSL 
I i I I I I I I I I I I I : I I I I I I I 1 I I I I II I I I I I I I [ I I I I I I I I I | | | | | | || | | | | 
MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFS LMYSKETGCPCPSL 
10 20 30 40 50 60 

70 80 90 100 110 120 

RKDSSTGGRPMSPCIQLAigRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I 1 I I I II I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I 
RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 
70 80 90 100 110 120 

130 140 150 160 170 180 

AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 
1:11111111 I I M I M I I I I I I I I I : I I ! I I I I I I I I I I I I ! t I I I I I : || III 
ATRASLPKIRDRVSICFSPLVRILPLSTVKSMWAFFANCSYASAPGPPVMTNCGLWRCR 
130 140 150 160 170 180 

190 200 
ASXSGSNSVPTVAALSNAGCKX 
i I I I I I II I I I I I I I I I I I I 
DSQSGSNSVPTVAALSNAGCKX 
190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 171>: 

m04 2-l .seq 

1 ATGACGATGA TTTGCTTGCG 

51 GTTATCCAAT ACTTCGACAG 

101 TACGGTCGAT GATGAAAATC 

151 AAGGAAACAG GCTGCCCGTG 

201 CGGTAGGCCG ATGTCGCCGT 

251 CGAAGGCGGA CACCTTGTTG 

301 TTGCCTTTGG CGGCTTCGCG 

351 CTTCAATTCC GCCGCGCGCG 

4 01 CCATCTGCTT TTCGCCGCTG 

4 51 TCTATGGTGG TCGCGTTTTT 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 172; ORF 042- 1>: 

m04 2-l . pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
1C1 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 
151 SMWAFFANC SYASAPGPPV MTS* 



m042 .pep 
g042 

m04 2 .pep 
g042 

m042.pep 
g042 

m04 2 .pep 
g042 



CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 
CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 
CAATCGGGGT TTTTCTCTTT GATGTATTCG 
CCCCTCGTTG CGTAAAGATT CGTCTACAGG 
GTATCCAACT TGCCAACCGC GACTGCGTGC 
CCCGTAACCG ACAGCACCAG CCCGCGTCCT 
CGTTTGGGCG AACAGCGCGT CAATCTGCGC 
CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 
GTGCGGATAT TGCCGTTGTC CACCGTCAGA 
CGCTAACTGT TCATACGCTT CCGCGCCCGG 



m042-l/g042 95.4% identity in, 173 aa overlap 
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10 20 30 40 50 60 

m042-l.pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I I I II I II i : I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | • | 
g042 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 4 0 50 60 

70 80 90 100 110 120 

m042-l . pep RKDSSTGGRPMS PC IQLANRDCV PKADTLL PVT DSTS PR PLPLAASRVWANSAS ICAFN S 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I | | | 
g042 RK DS STGGRPM S PC IQL ANR DC V PKA DTLL PVT D ST S PR PL PLAAS R FW AN S AS I C AFN S 

70 80 90 100 110 120 

130 140 150 160 170 

m04 2-l.pep AARASLPKI^KVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSX 

I : I I I II I I I •: I I I I I I I I I I I I I I I M : I I I I I I I I I I I I I I I I I I I I I I : 
g042 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMWAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

g04 2 DSQSGSNSVPTVAALSNAGCKX 
190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 173>: 

a042-l.seq 

1 ATGACGATGA TTTGOTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

401 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

451 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 174; ORF 042-1. a>: 

a042-l.pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCV PKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR AKVSICFSPL VRILPLSTVR 
151 SMWAFFANC SYASAPGPPV MTS* 



m042-l/a042-l 100.0% identity in 173 aa overlap 

10 20 30 40 50 60 

m042-l.pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I I I I I I I I I I I I I I I I I I I I I ( I I I 1 I II I I i I I I I I i I I I I I I I ) I i I I I I I I I I 
a042-l MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 4 2-1. pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I M I II 1 I I I I I I I I I I I I I I I I 
a042-l RKDSSTGGRPMSPC IQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m042-l.pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYA5AFGPPVMTSX 
I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I IN I II I t I 
a042-l AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 175>: 
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g043 . seq 




1 


ATGGTTGTTT 


51 


TCACATCCGA 


101 


CGTCCCGCGT 


151 


GCCCGATTCG 


201 


GTCCGGCGAC 


251 


CATTCGTTCA 


301 


GGCGAATTCG 


351 


GGTAGCCGTT 



CAAATCAAAA TATCTATGCC 
AGGCAAAAAT CCGTAATGCC 
ggcggtagcc gcAAAAGTGC 
ATGAGGGcga gcGCGTGTTC 
GGTTTCGCCG GTTTGCGCTT 
GCCCGACGCG GCGGGCGATT 
CTGTTCAAAA TATCGGCGGC 
GTCGTGGCGG CGGAAGGTGA 



GTCGGCCCAT CAGCACTTTT 
GCCTGAACGC TTCgttgaAC 
ATcGCGGCTT GGATGSTGCT 
CAGCCGCAGG CGGCGCAGGC 
TGAAATAGCG TTTCAGGTAG 
TCGGCGATGG 7CAGCGGGCG 
TTCGTCTATG CGCCGGCGGC 
AGCGTAA 



This corresponds to the amino acid sequence <SEQ ID 176; ORF 043.ng>: 

g043.pep 

1 MVVSNQNIYA VGPSALFHIR RQKSVMPPER FVEPSRVAVA AKVHRGLDGA 

51 ARFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQPDA AGDFGDGQRA 

101 GEFAVQN IGG FVYAPAAVAV WAA EGEA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 177>: 

m043 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 

151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAgGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

251 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 178; ORF 043>: 

m04 3.pep 

1 MWSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQDVGG FVYAPTAVTV WAAEGEAQ* 

Computer analysis of the amino acid sequences gave the following results: 

Homology with a predicted ORF from N. meningitidis menA with menB 

ORF 043 shows 89.8% identity over a 128 aa overlap with a predicted ORF (ORF043.a) from 

N. gonorrhoeae: 

m043/g043 

10 20 30 40 50 60 

m043 .pep MWSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I I I I I II I I I: I I I I I : I I I I I I I I I I I I i I I I I I I I I I II I I I I I I I I I I I I I I I I 
g04 3 MWSNQNIYAVGPSALFHIRRQKSVMPPERFVEPSRVAVAAKVHRGLDGAARFDEGERVF 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 4 3 . pep QPQAAQASGDGFAGLRFEI A FQVAFVQSDAAGDFG DGQRTGEFVLQDVGG FVYAPTAVTV 

I I I I I I I I I I I 1 I I I I I t II M I I I I I I I I I I I I I I I ! : I I I : : t : : I I I I I I I : | | : | 
g043 Q PQAAQASGDG FAG LRFEI AFQVA FVQP DAAG D FG DGQRAGE FAVQN I GG FVYAPAAVAV 

70 80 90 100 110 120 



130 

mO 4 3 . pep WAAEGEAQX 

I I I I I I I I 
g04 3 WAAEGEAXX 
130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 179>: 
a043.seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 
51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 
101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 
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151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

2 31 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 
251 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 
301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

3 51 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 



This corresponds to the amino acid sequence <SEQ ID 180; ORF 043.a>: 

a043 .pep 

1 MVVSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQD VGG FVYAPTAVTV WAA EGEAQ* 



ra043/a043 100.05 identity in 129 aa overlap 



10 2C 30 40 50 60 

m043.pep MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I I I I I I I M I I I I I I I I I I I I I I I 1 I I I t I ! I I I I i I I I I I I 1 I I 1 I | I I I | | | | I | | | | 
a04 3 MVVSNQN I YAAGPSALLH I RRQKSVMPSER FVE PSRVAVAAKVHGGLDGAAG FDEGERVF 

10 20 30 40 50 60 



70 80 90 100 110 120 

. m04 3 . pep QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGGFVYAPTAVTV 
I I I I M I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I t I I I I I I I I M I | | 
a04 3 QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGGFVYAPTAVTV 
70 80 90 100 110 120 

130 

m04 3 . pep WAAEGEAQX 
I I I I I I I I I ! 
aC4 3 WAAEGEAQX 
130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 81>: 

g044.seq 

1 ATGCTGCCCG ACCAGAGCGT CGAGTTCTTG CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGCCC TTCGATAACG GCGGTCAGCT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 CGGCTGCCGT AGCGCATTAA 



This corresponds to the amino acid sequence <SEQ ED 1 82; ORF 044 .ng>: 

g044.pep 

1 MLPDQSVEFL P QWVFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQP FDNGGQLHAV VGGLRFAAEK FFFAAAVAH* 

The following partial DNA sequence was identified in AT. meningitidis <SEQ ID I83>: 

m044.seq 

1 ATGCCGTCCG ACTAGAGCGT OGAGTTCTTT CCAGAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGAGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCATTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCAGTCAGTT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTACCGT AGCGCAyTAa 



This corresponds to the amino acid sequence <SEQ ID 184; ORF 044>: 

m044 .pep 

1 MPSDXSVEFF PE VWFDGLF GGGFPAVAL P TVYPVFHAIF DVLRVGADDD 
51 GAAAFERFQS FDDGSQFHAV VGGLRFAAEK FFFVATVAH* 



WO 99/57280 



PCT/US99/09346 



243 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 185>: 

a044 .seq 

1 GTGCCGTCCG ACCAGCGCGT CGAGTTCTTT CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCGGTCAGTT 

201 CCATACGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 186; ORF 044.a>: 

a044 . pep 

1 VPSDQRVEFF PQ VWFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQS FDDGGQFHTV VGGLRFAAEK FFFVAAVAH * 

m044/a044 91.0% identity over a 89 aa overlap 



10 20 30 40 50 60 

m04 4 .pep MPSDXSVEFFPEVWFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 
: I i I II II I : I! I I I i I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I II I I I I I I I I I 
aO 4 4 VPSDQRVEFFPQWVFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQS 
10 2C 30 40 50 60 

70 8C 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
II I I : I I I : M I I I I II I I I I I I I I : I ! I I 
a04 4 FDDGGQFHTWGGLRFAAEKFFFVAAVAHX 

70 80 90 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 044 shows 86.5% identity over a 89 aa overlap with a predicted ORF (ORF 044.ng) 
from N. gonorrhoeae: 

m044/g044 

10 20 30 40 50 60 

m044.pep MPSDXSVEFFPEWVFDGLFGGGFPAVALPTVYPVFHAI FDVLRVGADDDGAAAFERFQS 

I I III M MlilMMIIIMIIIIII III MIM llllll III III llllli 

g0 4 4 MLPDQS VEFLPQVVVFDGLFGGGFPAVAIiPTVYPVFHAVFDVLRVGADDDGAAAFERFQP 

10 20 30 40 50 60 



70 80 90 

mO 4 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 

I hhhl III 111 lllll I I Ihhllll 
g044 FDNGGQLHAWGGLRFAAE KFFFAAAVAHX 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 87>: 

g046.seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGC CCGCCGCgcc gCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC gaATATGGAA AGGCTGCCGt TTTcGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TtcgctGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGagaGCGCG AGcagcaagt cggcatcttC 

351 CgcgccggcG Cgttataatg tgAAGGGGGA TGCGccgttg ccgaAAACGG 

4 01 TTTGGacatc gaggcggctg CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

451 TCGATAAcgg TTACGTCGTT GTTGGTGATG GCGGCAAGGT TTTGCGCGAC 
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501 GGTAGAACCT ACCTGCCCGT TGCCTAAAAT GAGGATTTTC ACGGTATGGG 

551 TCGCCGGGTG A 

This corresponds to the amino acid sequence <SEQ ID 188; ORF 046.ng>: 

g046 .pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RYNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLVM AARFCATVEP TCPLPKMRIF TVWVAG* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 189>: 

mC46 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAsT CCGC.r.sGC gCGcCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAAC3AG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTG3AA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

401 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

451 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 190; ORF 046>: 

m046.pep 

1 MSAMLRPTSX PXXRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 S I TVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 191>: 

a046. seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGT CCGCCGCGCC GCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

401 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

451 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 192; ORF 046.a>: 

a046.pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SI TVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE* 

»046/a046 98.4% identity over a 186 aa overlap 

10 20 30 40 50 60 

m04 6.pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGOSIRPASCSVTSCSGLMVSVMPNME 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
a04 6 MSAMLRPTSS PPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
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10 



20 



30 



40 



50 



60 



70 80 90 100 110 120 

m04 6 . pep RLPFSLFSSL3LRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 
I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | 
a046 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 6 .pep RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 
I M I I I I I I I I I I I I I i I I I I I I II I I I I I I I I I I I 1 I I I I I I I I I I I I I I I (I I i I ! i I 
a04 6 RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m04 6.pep 
a046 



TVWVAEX 
I I I I I I I 
TVWVAEX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 046 shows 97.3% identity over a 185 aa overlap with a predicted ORF (ORF 046.ng) 
from N. gonorrhoeae: 
m046/g046 

10 20 30 40 50 60 

m04 6 .pep MSAMLRPTSXPXXRACh^IRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 

lllllllll I MIMIIIIIIIIIIIMIIMIIMIIIIIIIIIIIIIIIIIMII 

g046 MSAMLRPTSSPPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
10 2C 30 40 50 60 

70 80 90 100 110 120 

m046.pep RL P FS L FSSLGLRYS RYSLERTRAMRPGMLNRSAATLQATMLVS SLRES AS S KSAS SAP A 

lllllll! Ill lllllllllllll lllllllll IIIIMIIIII lllllllll lllllll 

g046 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 
70 80 90 100 110 120 

130 140 150 160 170 180 

m04 6 .pep RSNVKGI^LPKTWTSRRLPVSCNAFSSMSITVTSLLGMAAJ^FCATVEPTCPLPKMRIF 

I IIIIIIIIIIIIIIIIIIIMMIIIIMIMMII lllllllllllllllllim 

g046 RYNVKGDAPLPKTVWTSRRLPVSCNAFSSMS I TVTSLLVMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m04 6 .pep 



g046 



TVWVAEX 

Mill 
TVWVAGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 193>: 

g047 .seq 

ATGGTCATCA TACAGGCGcg gcGCGGCGGG CTGCTTGTCG GACGCAGCAT 
TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 
CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 
ATCGAAGGCG ACGAAATCCT GTTTGCCGCC GCCGCCG AAA ACATCGGGGC 
GGTCATACCC gaATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 
TTGCCGGCGG CGGCAACATc tgctACCGCC TCGCCAAGCA GCTCGAACAC 



1 
51 
101 
151 
201 
251 
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301 GCATAcaacG TCAAAATCAT CGAATGCCGG CCGCGCcgtg ccgaATGGAT 

351 AGCCGAAAAC ctcgAcaaCA CCCTCGTCCT GCAAGGTTCG Gcaaccgacg 

401 aAaccctgct cgAcaacgaa tacatcgacg aaatcgaCGT ATTCTGCGCC 

451 CTGACCAACG ACGACGAAAG CAACATT A C g tCCGCCCTTT TGGCGAAAAA 

501 CCTcggcgCG AAGCgcgtca tcggCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

651 CCACCCCATC CG3CGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCGCACG 

701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 

751 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGTGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAGAAACT C 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 194; ORF 047.ng>: 

g047 .pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI CYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA . 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK 1D IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHIIFFVSR RRILNELEKL 

3 01 IQVKMGFFG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 195>: 

xn047.seq 

1 ATGGTCATCA TACAGgCGcG C . . syGCGGA sTGCTTGTCG GACG CAG CAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CCAAAGAAAC CAGCcCmgmm 

251 GcATCATGAT TkCCGGCGGC GGCAACATCG GCTACCGTCT CGCCAAGCAG 

3 01 CTCGAACACG CATACAACGT yAAAATCATC GAATGCCGGC CGCGCCGTGC 

3 51 CGAATGGATA GCCGAAAACC TCGACAACAC CCTCGTCyTG CAAGGTTCGG 

4 01 CAACCGACGA AACCCTGCTC GACAACGAAT ACATCGACGA AATCGACGTA 
451 TTCTGCGCCC TGACCAACGA CGACGAAAGC AACAT TATGT CCGCCCTTTT 
501 GGCGAaAAAC CTCGGCGCGA AGCGCGTCAT CGGCATCGTC AACCG CTCAA 
551 GCTACGTCGA TTTGCTCGAA GGCAACAAAA TCGACATCGT CGTCTCCCCC 
601 CACCTCATCA CCATCGGCTC GATACTCGCC CACATCCGGC GCGGCGACAT 
651 CGTTGCCGTC CACCCCATCC GGCGCGGCAC GGCGGAAGCC AT CGAAGTCG 
701 TCGCACACGG CGACAAAAAA ACTTCCGCCA TCATCGGCAG GCGCATCAGC 
751 GGCATCAAAT GGCCCGAAGG CTGCCACATT GCCGCCGTCG TCCGCGCCGG 
801 AACCGGCGAA ACCATTATGG GACACCATAC CGAAACCGTC ATCCAAGACG 
851 GCGACCACAT CATCTTTTTC GTCTCGCGCC GGCGCATCCT GAACGAACTG 
901 GAAAAACTCA TCCAGGTCAA AATGGGCTTT TTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 196; ORF 047>: 

m047 .pep 

1 MVIIQARXXG XLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETQRN QPXXIMIXGG GNIGYRLAKQ 

101 LEHAYNVKI I ECRPRRAEWI AENLDNTLVL QGSATDETLL DNEYIDEIDV 

151 FCALTNDDES NIMSALIAKN LGAKRVIGIV NRSSYVDLLE GNKI DIWSP 

201 HLITIGSILA HI RRGDIVAV HPIRRGTAEA IEWAHGDKK TSAIIGRRIS 

251 GIKWPEGCHI AAWRAGTGE TIMGHHTETV IQDGDHIIFF VSRRRILNEL 

301 EKLIQVKMGF FG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 197>: 

a047 .seq 

1 ATGGTCATCA TACAGGCGCG GCGCGGCGGA CTGCTTGTCG GACGCAGCAT 
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51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

2 31 GGTCATACCC GAATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATC GGCTACCGTC TCGCCAAGCA GCTCGAACAC 

331 GCATACAACG TCAAAATCAT CGAATGCCGG CCGCGCCGTG CCGAATGGAT 

351 AGCCGAAAAC CTCGACAACA CCCTCGTCCT GCAAGGTTCG GCAACCGACG 

4 31 AAACCCTGCT CGACAACGAA TACATCGACG AAATCGACGT ATTCTGCGCC 

4 51 CTGACCAACG ACGACGAAAG CAACATTATG TCCGCCCTTT TGGCGAAAAA 

501 CCTCGGCGCG AAGCGCGTCA TCGGCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCACACG 

701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 

7 51 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GG AC AC CAT A CCGAAACCGT CATCCAAGAC GGCGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAAAAACTC 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 



This corresponds to the amino acid sequence <SEQ ID 198; ORF 047. a>: 

a047 .pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI GYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHIIFFVSR RRILNELEKL 

301 IQVKMGFFG* 

m047/a047 96.5% identity over a 3 12 aa overlap 

10 20 30 40 50 60 

m047.pep MVIIQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVIIEGDEILFAA 
M I I I i I I I I I I I I I I I I II I I I I i I I I ! I I I I I I I I I I II I I i I I I I I I I I I I I I I 
a C 4 7 MVI IQARRGGLLVGRS I ADIAQDLPDGADCQI CAVYRNNRLI VPAPQTVI I EGDE ILFAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

mC4 7.pep MENIGAVIPELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 
I I II III! II i II I IN: : III I I I I I! II I I I I I I I I 1 I I I I I I I I I I M I I I 
aC47 AAENIGAVIPELRPKETSTRR IMIAGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 

70 80 90 100 110 



130 140 150 160 170 180 

m0 4 7 . pep AENLDNTLVLQGSATDETLLDNEYI DEI DVFCALTNDDESN IMSALLAKNLGAKRVIGIV 

I i I I I I I i I i : 1 1 I I I I 1 1 I I I I I I 1 I I I M 1 1 I I I I I 1 1 I 1 1 I I I I I I I 1 1 I 1 1 I M I I 

a047 AENLDNTLVLQGSATDETLLDNEYI DEI DVFCALTNDDESN IMSALLAKNLGAKRVIGIV 

120 130 140 150 160 170 

190 200 210 220 230 240 

m047.pep NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHFIRRGTAEAIEVVAHGDKK 

I I If 1 1 I I I I ! 1 1 I I I I I I 1 1 I I I I II I I II I I I I I I I 1 1 I 1 1 I I I I I I I 1 1 I 1 1 I I I I I 
a047 NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAISWAHGDKK 

180 190 200 210 220 230 



250 260 270 280 290 300 

m047 .pep TSAIIGRRISGIKWPEGCHIAAVVRAGTGETIMGHHTETVIC'DGDHIIFFVSRRRILNEL 

I I 1 1 I I II 1 1 I 1 1 I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I 1 1 I I M 1 1 I I I M | I | 
a04 7 TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 

240 250 260 270 280 290 



m04 7 .pep 



310 

EKLIQVKMGFFGX 
I I I I I II I II I I I 
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a 0 4 7 EKL I QVKMG FFGX 

300 310 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 047 shows 96.2% identity over a 3 12 aa overlap with a predicted ORF (ORF 047.ng) 
from K gonorrhoeae: 

m047/g045 

m04 7 .pep MVIIQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVIIEGDEILFAA 6 0 

III MM I I II 1 1 II I! I M I M I M M I M 1 1 1 1 1 1 M M M 1 1 M M M M I M I 

g04 7 MVI IOARRGGLLVGRSI ADI AQDLPDGADCQICAVYRNNRL:VPAPQTVI IEGDEILFAA 6 0 

m047 .pep AAENIGAVIPELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 120 

I I I I II I I I I I II I II I • : III Mill IIIIIIIIIIIIIIIIIIIIIMMI 
g047 AAENIGAVIPELRPKETSTRR- - - IMIAGGGNICYRLAKQLEHAYNVKIIECRPRRAEWI 117 

m04 7 . pep AENLDNTLVLQGSATDETIiLDNEYIDEIDVFCALTNDDESNIMSAIjLAKNLGAKRVIGI V 130 

I I i 1 1 M M M I f 1 1 1 1 1 1 1 M 1 1 1 i II > M 1 1 M I [ II 1 1 f 1 1 1 11 1 M II I M 1 1 1 1 1 

g047 AENLDNTLVLQGSATDETLLDNEYIDE IDVFCALTMDDESNIMSALLAKNLGAKRVIGI V 177 

m047 .pep NRSSYVDLLEGNKIDIWSPIILITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 240 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 ! I 

g047 NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 237 

m047 .pep TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 300 

I M 1 1 1 II 1 1 1 1 M 1 1 tl 1 1 II I M 1 1 i 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 i I f 1 1 1 1 [ 1 1 1 1 1 1 1 . 

g047 TSA I IGRR I SG I KW PEGCH I AAWRAGTGETI MGHHTETVI QDGDH 1 1 FFVS RRRI LNEL 297 

m047.pep EKLIQVKMGFFGX 313 

I I I I I I I I I I II I 
g047 EKLIQVKMGFFGX 310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 199>: 
g048 . seg 

1 ATGCTCGACA AAGGCGA3GA GTTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTc ggcCCcgTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCCG 

101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTTACCCG CCAAATGCTC 

151 AAACAAACCG GCCTCTTGGG CATGATCGGC AAATCCGagc gcgGcgcggc 

201 cacctGCGAA GCcatCGCCG ACAACAAGGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC ATACCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

301 GCGTTCCCCG AATTGGGTAT GGAAGCCGTT TACGAATTTG AAGTCAAAGA 

351 TATGCCCGTA ACCGTCGCCG TGGACAGCAA AGGCGAATCC ATCCACGCCA 

401 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAGTCT 

451 TGA 

This corresponds to the amino acid sequence <SEQ ID 200; ORF 048. ng>: 
g04 8.pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 KQTGLLGMIG KSERGAATCE AIADNK AVYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAV YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 201 >: 

m04 8 .seq 

1 ATG CTCAACA AAGG CGAAG A ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCGG 
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101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGTGGC 

2 01 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 
251 GCGGCGCGGC GTAICTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

3 01 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

3 51 CATGCCCGTA ACCGTCGCCG TAGATAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAATCT 
4 51 TGA 

This corresponds to the amino acid sequence <SEQ ID 202; ORF 048>: 

m048 .pep 

1 MLNKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 EQTDLLGMIG KSERGVATCE AIADNKA VYL MAVGQAAyLV AKAI KSSKVL 

101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 203>: 

a048.seq 

1 ATGCTCGACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGACGAAATC GTCGGCCCAG 

101 CAGGTCCGAC CACCGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGCGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

301 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

351 CATGCCCGTA ACCGTCGCCG TAGACAGCAA AGGCGAATCC ATCCACGCCA 

401 CCGCCCCGCC CCAATGGCAG GCGAAAATCG GCATCATCCC CGTCAAATCT 

451 TGA 

This corresponds to the amino acid sequence <SEQ ID 204; ORF 048.a>: 

a04 8.pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEI VGPAGPTTAT RMDKFTRQML 
51 EQTDLLGMIG KSERGAATCE AIADNKA VYL MAVGGAAYLV AKAI KSSKVL 
101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPPQWQ AKIGIIPVKS 
151 * 

m048/a048 96.0% identity over a 150 aa overlap 

10 20 30 40 50 60 

m04 8 . pep MLNKGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

I I : I M I I I t I I I I II I I I I I II I I II I I : II I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
a048 MLDKGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 
10 20 30 40 50 60 

70 80 90 100 110 120 

m04 8 . pep K SERG VAT CE A I ADN KAV Y LMA VG G AA Y LV AKA I KS SKVLA FP E LGMEAI Y E FE VKDM P V 

1 1 1 1 1 - 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 i 1 1 1 1 f 1 1 1 f 1 1 1 1 1 1 1 1 1 r i i i f i f 1 1 1 1 1 1 1 1 1 1 1 1 

a04 8 KSERGAATCEA I ADNKAV YLMAVGGAAYLVAKAI KSSKVLAFPELGMEAI YE FEVKDMPV 

70 80 90 100 110 120 

130 140 150 

m04 8 .pep T VAV D SKG E S I H ATA PRK WQAK I G 1 1 P VES X 

I I I I I I I I I I I I I I II : I I I I I I I ! II : I I 
a04 8 TVAVDSKGES I HATAPPQWQAKIG 1 1 PVKSX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 048 shows 96.4% identity over a 150 aa overlap with a predicted ORF (ORF 048.ng) 
from N. gonorrhoeae: 
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m048/g048 

10 20 30 40 50 60 

m04 8 . pep MLNKGEELPVDFTNRLI YYVGPVDPVGDEWGPAGPTTATRI4DKFTRQMLEQTDLLGMIG 

IIMIIIIMMMIIMIMIIIIIMIIIIIIIMIIIIIIMMMIMI llllll 

g04 8 MLDKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRI^DKFTRQMLKQTGLLGMIG 
10 20 30 40 50 60 



70 80 90 100 110 120 

m04 8 . pep KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPV 

III lllllllllllllllll III II IhllllMII i| 

g04 8 KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPV 

70 80 90 100 110 120 



130 140 150 

m048.pep TVAVDSKGESIHATAPRKWQAKIGIIPVESX 

III II Ml I II I I III Ml III III II II II 
g04 8 TVAVDS KGES I HATAPRKWQAKI GI I PVESX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 205>: 

g049 . seq 

1 ATGCGGGCGC AGG CGTTTGA TCAACCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG GATATTGATT 

101 TGGACGGGCA TCAACGCCTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCGG ATTCTGCCGC ATCGGCGTTT TCCCCGCCCT 

201 CAATCTGTGC GGTTTCAAAT TCGG CACTGT CTTTTTTGGC ATCGAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAAccggca tTTGCAGGGA 

301 AGCCTgcgcg TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCgc tGTTGTGTTC TTCCTGCCAT 

4 01 TTCTTCAGAT ACGCCTTTAA 



This corresponds to the amino acid sequence <SEQ ID 206; ORF 049.ng>: 

g04 9.pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRL FRTAFAVFRN 

51 PVCRRTGFCR IGVFPALNLC GFKFGTVFFG IEPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGAWF FLPFL QIRL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 207>: 



m049 .seq 


(partial) 










1 


ATGCGGGCGC 


AGGCGTTTGA 


TCAGCCGTTC 


GGTCAGCTCC 


TGTTCGGACA 


51 


GGCAGAACAC 


TTCGCGCCGG 


TTGACGGCTT 


TCGGGTTCAG 


GATATTGATT 


101 


TGGACGGGCA 


TCAACGTTTC 


TTCCGCATCG 


TTTTCCCCGT 


TTTCCGAAAC 


151 


CGCCGGCTCA 


TTCGTGCCGG 


ATTCTGCCTC 


GTCGGCGTTT 


TCCCCGCTTT 


201 


CAATCTGTCC 


GGTTTCAAAT 


TCGACACTGT 




ATCAAACCGG 


251 


ATTCTCCGCC 


GCGATTCGAT 


GTGTTTTTCC 


GAAACCGACA 


TTTGCAGGGA 


301 


AGCCTGCGCG 


TTGAGCCAGT 


TTTCCTGAAG 


GACGATCATC 


GGGTCGGTTT 


351 


CGACTTCCTC 


GCCGCAATCG 


GCAACGGCGG 


CATTGTGTTC 


CTCCTGCCAT 


401 


TTTTTCAGAT 


ACGCCTT . . . 









This corresponds to the amino acid sequence <SEQ ID 208; ORF 049>: 

m049.pep (partial) 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRF FRIVFPVFRN 

51 RRLIRAGFCL VGVFPAFNLS GFKFDTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGGIVF LLPFF QIRL. . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 209>: 

a049.seq 
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1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG AATATTGATT 

101 TGGACGGGCA TCAACGCTTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCCG ATTCTGCCGC ATCGGCGTTT TCCCCGCCTT 

201 CAATCTGTCC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC ATCAAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA TTTGCAGGGA 

301 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

401 TTTTTCAGAT ACGCCTT 



This corresponds to the amino acid sequence <SEQ ID 210; ORF 049.a>: 

a OA 9. pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ NIDLDGHQRF FRTAFAVFRN 

51 PVCRRTRFCR IGVFPAFNL3 GFKFGTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGGIVF LLPFF QIRL 

m049/a049 90.6% identity over a 139 aa overlap 



10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

I I M I I I I I I I! I I I I I I 1 I I I I I I I I I I I : I I I I I I I I I I I : I Mil I : II 
a04 9 M RAQA FDQ P FGQL L FG QAE H FA PVDGFRVQN I DLDGHQRFFRTAFAVFRN PVCRRTRFCR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 9 . pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 
: I M I I I I I I M I 1 I I I I I I II I I I I I I 1 I I I I I I II I I I I I I I I I I I I I I I | | | | | | | 
a04 9 IGVFPAFNLSGFKFGTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

70 80 90 100 110 120 

130 139 
m04 9.pep AAIGNGGIVFLLPFFQIRL 
I I I I I I I I I I I I I I I I I I I 
a04 9 AAIGNGGIVFLLPFFQIRL 

130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 049 shows 86.3% identity over a 1 39 aa overlap with a predicted ORF (ORF 049.ng) 
from N. gonorrhoeae: 

m049/g049 



10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRI VFPVFRNRRLIRAGFCL 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 >\ 1 1 1 1 hill 

g 0 4 9 MRAQAFDQ P FGQLL FGQAEH FAPVDG FRVQD I DLDGHQRLFRTAFAVFRNPVCRRTCFCR 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 9 . pep VGVFPAFNLSGFKFDTVFFGI KPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

-11111 = 11 1 1 M lllllhllllllll II ! Ill II I ! II 1 1 II III I II II! MM 

g049 IGVFPALNLCGFKFGTVFFGIEPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 
70 80 90 100 110 120 



130 139 
m04 9 . pep AAIGNGGIVFLLPFFQIRL 
M II ll-lhllhl I II 
g 04 9 AAIGNGAWFFLPFLQIRLX 

130 140 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 21 1>: 

g050.seq 

1 atgggcgCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGg 

51 cacgcccGAA AAAGccgtgt TGATGGcaaA AGAATCCCTG ATGAGCCACA 

101 TCGAcatCca aGaATTGCAG GAAAAAGCCG CGTccggggc ggaattgtcc 

151 accaccgaAG cc CTGCGCCT cGAACTCTTT GAAAAGGTCA ACGCGCTGGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 T CCTCG ATT A CCCGACCCAT GCCGCCTCCA AACCGATTGC CATGATTCCC 

3 01 AACTGTGCcg ccacCCGcca cgtcgAATTT GAATTGgACG GCTCAGGtCC 

351 TGTCGAactC acgccGCcgc gtgtCGAAGA CTGA 



This corresponds to the amino acid sequence <SEQ ID 212; ORF 050.ng>: 

gOSO.pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAAS GAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVED* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 213>: 

m050 . seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGTATCG GCATCGGCGG 

51 C.agCCgAA AAAGCCGTGC TGATGGCAAA AGAGTC CCTG ATGAGCCACA 

101 TCGACATTCA AGAATTGCAG GAAAAGGCCG CGTCCGGCGC GgAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTGGG 

201 CATCGGCGCA CAAGGCTTGG GCGGACTGAC CACCGTGTTG GACGTGAAAA 

2 51 TCCTCGATTA TCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA TGGCCCGATT TGA 



This corresponds to the amino acid sequence <SEQ ID 214; ORF 050>: 

mOSO.pep 

1 MGAGWCPPGI LGIGIGGXAE KAVLMAKESL MSHIDIQELQ SKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDGPI * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 215>: 

a050. seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGG 

51 TACGCCCGAA AAAGCCGTGT TGATGGCGAA AGAATCCCTG ATGAGCCACA 

101 TCGACATCCA AGAATTGCAG GAAAAAGCCG CGTCCGGCGC GGAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTAGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA CCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA CTGGCCC 

This corresponds to the amino acid sequence <SEQ ID 216; ORF 050.a>: 
aOSO.pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDWP 



m05o/a050 97.7% identity over a 129 aa overlap 

10 20 30 40 50 60 

mOSO.pep MGAGWCrPG I LGIGIGGXAE KAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

IIIIIMIIMIIIItl: ililllMIIIHII || I I I [ 1 M | | | | | | | | | I | 

a050 MGAGWC PPG I LG IG IGGTPEKAVLMAKE SLMSH I DI QELQEKAASGAELSTTEALRLELF 
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10 



2C 



30 



40 



50 



60 



70 80 90 100 110 120 

m050 . pep EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

I I I I I I I I I I I 1 1 I I I I I ! 1 1 I I I 1 1 1 I I I 11 1 I I I I i 1 1 I I I I I I I I I I | | | | | | | | | | 

a050 EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 
70 80 90 100 110 120 

130 

mOSO.pep TPPRVEDGPIX 

I I I I f I I I 
a050 TPPRVEDWP 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 050 shows 98.4% identity over a 127 aa overlap with a predicted ORF (ORF 050.ng) 
from N. gonorrhoeae: 

m050/g050 

10 20 30 40 50 60 

mO50 .pep MG AG W C PPG I LG I G I GGXAEKAVLMAKE S LMSH I D I QELQ E KAASGAE LS TTE ALRLE LF 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M IIIIIIMIIMIIIIIIIIIIIIIIMIIIIIIIIIIIII 

g050 MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRIiELF 
10 20 30 40 50 60 

70 80 90 100 110 120 

m050.pep EKVNALGIGAQGLGGLTTVLDVKI LDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

II II IMMI Mllll III II II IMIII III II! Ill Ml 1 1 II I II i MM II ! II 1 1 

g050 EKVNALGI GAQGLGGLTTVLDVKI LDYPTHAASKPI AMI PNCAATRHVEFELDGSGPVEL 

70 80 90 100 110 120 



m050 .pep 
g050 



130 

TPPRVEDGPIX 

MIIIM 
TPPRVEDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 217>; 

g050-l.seq 

1 ATGACCGTTA TCAAGCAAGA AG ACT TT ATT CAAAGTATCT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCAAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCGGA CATGAGCGTG GAAAAGATGG TTAACGAAGG CGTACGCCGC 

301 GCCTACACTT GGGAAGGCAA CACCCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCACA 

4 01 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

4 51 GGCGGCTCTG AAAACAAATC CAAACTCGCT ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACGATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGCAcgcC CGAAAAAGCC 

601 GTGTTGATGG cgaAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCC7CGAACT CTTTGAAAAG GTCAACGCGC TGGGCATCGG CGCGCAAGGC 

7 51 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCATGCCGCC TCCAAACCGA TTGCCATGAT TCCCAACTGT GCCGCCACCC 

851 GCCACGTCGA ATTTGAATrG GACGGCTCAG GTCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGACC CGATCTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGATAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCGCACAAAC GCCTCGTCAA TATGCTCGAC AAAGGCGAGG AGTTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCC GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTTACCC GCCAAATGCT CAAACAAACC GGCCTCTTGG GCATGATCGG 
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1251 CAAATCCGAG CGCGGCGCGG 

1301 CCGTGTACCT CATGGCAGTC 

1351 ATCAAATCTT CCAAAGTCTT 

1401 TTACGAATTT GAAGTCAAAG 

1451 AAGGCGAATC CATCCACGCC 

1501 GGCATCATCC CCGTCGAGTC 



CCACCTGCGA AGCCATCGCC GACAACAAGG 
GGCGGCGCGG CATACCTCGT GGCAAAAGCC 
GGCGTTCCCC GAATTGGGTA TGGAAGCCGT 
ATATGCCCGT AACCGTCGCC GTGGACAGCA 
ACCGCCCCGC GCAAATGGCA GGCGAAAATC 
TTGA 



This corresponds to the amino acid sequence <SEQ ID 21 8; ORF 050-1 .ng>: 

g050-l.pop 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDANTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EKMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTI PTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVED*PDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKI LTGRDA 

351 AHKRLVNMLD KGEELPVDFT NRLIYYVGPV DPVGDEVVGP AGPTTATRMD 

401 KFTRQMLKQT GLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAVYEF EVKDMPVTVA VDSKGESIHA TAPRKWOAK1 

501 GIIPVES* 



g050-l/pl44O7 

sp | Pi 4 407 | FUMB_ECOLI FUMARATE HYDRATASE CLASS I, ANAEROBIC (FUMARASE) 
>gi 1280063 Ipir I IB44511 fumarate hydratase (EC 4.2.1.2) fumB, iron- dependent - Escherichia coli 
>gi 1146048 (N27059) anaerobic class I fumarase (EC 4.2.1.2) [Escherichia coli] Length = 548 

Score » 172 bits (432), Expect = 4e-42 

Identities - 138/488 (28%), Positives = 216/488 (43%), Gaps = 22/488 (4%) 



Query: 11 QSICDAFQFI SYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAENNRPICQDTG 70 

Q-t DA + H K L+ E + K Q L NS + A+ P CQDTG 
Sbjct: 53 QAFHDASFMLRPAHQKQVAAILHDPEASEND KYVALQFLRNSEIAAKGVLPTCQDTG 109 

Query: 71 IATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGKRQNTKDNTPA 130 

A + KG V W E+ YEN + A K NT N PA 

SbjCt: 110 TAIIVGKKGQRV-WTGGGD-EETLSK3VYNTYI-EDNLRYSQNAALDMYKEVNTGTNLPA 166 

Query: 131 VIHMSIVPGGKVEVTCAAKGGGSENKSKL AMLNPSDNIVDWVLKTIPTMGAGWCP 185 

1 + V G + + C AKGGGS NK+ L A+L P + +++++ + T+G CP 

Sbjct: 167 QIDLYAVDGDEYKFLCVAKGGGSAKKTYLYQETKALLTPG-KLKNFLVEKMRTLGTAACP 225 

Query: 186 PXXXXXXXXXTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEKVNXXX 245 

P T + L + +H EL + + L EL E+ 

Sbjct: 226 PYHIAFVIGGTSAETNLKTVKLASAHY-YDELPTEGNEHGQAFRDVQLEQELLEEAQKLG 284 

Query: 246 XXXXXXXXXTTVLDVKILDYPTHAASKPIAMI PNCAATRHVEFELDGSG PVELTPP 301 

D++++ P H AS P+ M +C+A R+++ +++ G +E P 

Sbjct: 285 LGAQFGGKYFAH-DIRVIRLPRHGASCPVGMGVSCSADRNIKAKINREGIWIEKLEHNPG 343 



uery: 302 RVEDXPDLTYSPDNGKRVDVDKLTKE EVASWKTGDVLLLNGKILTGRDAAHKRLVNM 358 

+ +VD+++ KE ++- + L L G 1+ GRD AH +L + 

Sbjct: 344 QYI PQELRQAGEGEAVKVDLNRPMKEILAQLSQYPVSTRLSLTGTI IVGRDI AHAKLKEL 403 

Query: 359 LDKGEELPVDFTNRLI YYXXXXXXXXXXXXXXXXXTTATRMDKFTRQMLKCTGLLGMIGK 418 

+D G+ELP 4- IYY TTA RMD + + G + M+ K 

Sbjct: 404 IDAGKELPQYIKDHPIYYAGPAKTPAGYPSGSLGPTTAGRMDSYVDLLQSHGGSMIMLAK 463 

Query: 419 SERGAATCEAIADNKAVYLMAVGG-AAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPV 477 

R +A + YL ++GG AA L *+IK + +A+PELCMEA+++ EV+D P 

Sbjct: 464 GNRSQQ^/TDACHKHGGFYLGSIGGPAAVLAQQSIKHLECVAYPELGMEAIWKIEVEDFPA 523 



Query: 478 
Sbjct: 524 



TVAVDSKG 485 

+ VD KG 
FILVDDKG 531 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 219>: 

mOSO-l.aeq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTATC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCTGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCGGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG AACGTCCAAT 

251 GGGATGCGGA CATGAGCGTG GAAGAGATGG TTAACGAAGG CGTACGCCGC 
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301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCC ATGCTCAATC CTTCCGACAA 

501 CA7CGTCGAT TGGGTATTGA AAACCATCCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGT ATCGGCATCG GCGGCACGCC CGAAAAAGCC 

601 GTGCTGATGG CAAAAGAGTC CCTGATGAGC CACATCGACA TTCAAGAATT 

651 CCAGGAAAAG GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TGGGCATCGG CGCACAAGGC 

751 TTGGGCGGAC TGACCACCGT GTTGGACGTG AAAAT CC TC G ATTATCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCAAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCG GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGTGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGATAGCA 

1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 220; ORF 050- 1>: 

m050-l .pep 

1 MTVIKQEBFI QSICDAFQFI SYYHPKDYID AL-YKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM NVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLK ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLNAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLN KGEELPVDFT NRLI YYVGPV DPVGDEVVGP AGPTTATFJMD 

401 KFTRQMLEQT DLLGMIGKSE RGVATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 

m050-l/g050-l 98.2% identity in 507 aa overlap 

10 20 30 40 50 60 

mO 5 0-1. pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

I I I I I I i II I I I I I I I I II I I I I I I I I I I I I II I I I I I I I 1 I I I I I I I I I I I II I I I I I 
g050-1 NTVIKQEDFXQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m050- 1 . pep NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLFASVLADPAGK 

II I I I I I I I I I I II I I I I I i: I I ! I I I I I I :| I I I ! I I I I I I I I I I II I I M I I I II I I 
g050-l NNRPICQDTGIATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 50-1. pep RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I II I I I I I I I I I I I t I I I I I I I I I I I I I i I i I I I i I I I I I I I I I I I I I I I I I I I I I I I I I 
g050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m050-l . pep AGWGPPGILGIGIGGTPEKAVLMAKESLHSHlOIQtLUEKAASGAELSTTEAlaRLELFEK 
I I I I I I I I I ! I I I I I I I I I I I I II I I I I I I ! I I I I I I II I I I I I I I I I II I I I I I I I I I I 
g050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

mO 50-1 .pep ' VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
g050-l VNALGIGAQGLGGLTTVLDVK I LDYPTHAASKPIAMI PNC AATRHVEFEL DGSGPVELTP 

250 260 270 280 290 300 
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310 320 330 340 350 360 

mOSO-1 . pep P R V E DW P DLT Y S PDNGK RV DV DK LT KE E V AS WKTG DVLLLNGKILTGR DAAHK RL VDML N 

Mil) i I M I I I I H I II t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I :| I : 

g050-l PRVEDXPDLTYSPDNGKRVDVDK LT KE EVAS WKTG DV LL LNG K I LTG RD AAH K RLVNML D 

310 320 330 340 350 360 

370 380 390 400 410 420 

mO 5 0-1 .pep KGEELPVDFTNRLI YYVGPVPPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
I I I I I I I I I I I I II I I I I I I I I I II I I I I I i I I I I I I I I I I I I I I I I : I I I I t I I I I I I 
g050-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLKQTGLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

m050-l . pep RGVATCEAIADNKAVYLMAVSGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 
I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II :| I I I I I I I I I I I I 
g 050-1 RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
mObO-1 . pep VDSKGESIHATAPRKWQAKIGIIPVESX 
I I II I I I I I I I I 1 I I I I I I I I II I I II 
g050-l VDSKGE3IHATAPRKW0AKIGIIPVESX 

490 500 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 22 1>: 

a050-l.seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AACCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGCAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGATACCGGT ATCGCGACCG TGTTTTTGAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCAGA CATGAGCGTC GAAGAGATGG TTAACGAAGG CGTGCGCC3C 

301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTTC TCGCCGACCC 

351 CGCCGGCAAA CGCCAAAATA CCAAAGACAA CACGCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGAC AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGTTCTG AAAACAAATC CAAACTCGCC ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATTCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGTACGCC CGAAAAAGCC 

601 GTGTTGATGG CGAAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TAGGCATCGG CGCGCAAGGC 

751 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCGAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGACGAAAT CGTCGGCCCA GCAGGTCCGA CCACCGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGGC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGACAGCA 

1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC CCCAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCAAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 222; ORF 050-1 .a>: 

a050-l.pep 

1 MTVIKOEDFI OSICDAFQFI SYYHPKDYID ALYKAWOKEE NPAAKDAMTO 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGD KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDNLD KGEELPVDFT NRLIYYVGPV DPVGDEIVGP AGPTTATRMD 

401 KFTRQMLEQT DLLCMICKSE RCAATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAI YEF EVKDMPVTVA VDSKGESIHA TAPPQWQAKI 

501 GIIPVKS* 
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a050-l/m050-l 98.4% identity in 507 aa overlap 

10 20 30 40 50 60 

a050-l .pep MTVIKQEDFIQSICDAFOFISYYHPKDYI CALYKAWOKEENPAAKDAMTQILVNSRMCAE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I 1 I II I I ! I I I I I I I I 
m050-l MTVIKQEDFIQSICDAFQFISYYHPKDYICALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

a050-.l .pep NNRPICQDTGIATVFLKVGMDVOWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
I I I I I I I I t I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m050-l NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
70 80 90 100 110 120 

130 140 150 160 170 180 

a050-l.pep RQNTKDNTPAVIHM3 IVPGDKVEVTCAAKGGG5ENKSKLAMLNPSDNIVDWVLK7IPTMG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I II I I I I If I I I I I I II I M I I I I I I I 
m050-l RQN T K DNT P AV I HMS IV PGGKVE VTCAAK GGGS EN KS KL AMLN P S DN I V DWVL KT I PTMG 

130 140 150 160 170 . 180 

190 200 210 220 230 240 

a050-l .pep AGWC PPGILGIGI GGTP EK A VLMAK ES LMSI 1 1 D I QELQE KAA S GA EL S TT EAL RLEL FEK 

I I I I II II I I I I I I II I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
m050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRIELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

aG50-l.pep VN ALG I G AQGL GG LTTVL D VK I L DY PTHAAS K P I AM I PN C AAT RH VE FELDGSG P VE LT P 

II I I I i I i I I I I I I I I I I I I I I I I I I I I I I If I i I ! I I II I I I I I I I I I I I I I I i I I I I I 
m050-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 

250 260 270 280 290 300 

310 320 330 340 350 360 

aO 50-1 . pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLD 
I I I I I : I I 1 1 I 1 1 I I I I I 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 I I I I 1 1 I I I I I I I I I I I I 1 1 ! I I : 
m050-l P R V E DW P DLT Y S P DNGK RV DV DKLT KEEVAS WKTG DVLL LNGK I LTGRD AAHKRL VDMLN 

310 320 330 340 350 360 

370 380 390 400 410 420 

a050-l.pep KGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQKLEQTDLLGNIGKSE 
I I I I I i I I I I I I I I I I I I I 1 I I I I I I : I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
m050-l KGEELPVDFTNRLIYYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGNIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

a050-l.pep RGAATCEAIADNKAVYLMAVGGAAYLVAKAI KSSKVLAFPELGMEAI YEFEVKDNPVTVA 
I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I IN I I I 
mO 50 - 1 RGVATCEAI ADNKAVYLMAVGGAAYLVAKAI KS SKVLAFPELGMEAI YEFEVKDNPVTVA 

430 * 440 450 460 470 480 

490 500 
a050-l .pep VDSKGESIHATAPPQWQAKIGIIPVKSX 
I I I I I I I I I I I I I : I I I I I I I I I I : I I 
mG50-l VDSKGES IHATAPRKWQAKIGI I PVESX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 223>: 

g052 .seq 

1 ATGG CTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGG CGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TT ATT GAT AA AGCCGACGGT CGTGCCGAAC 

3 01 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

3 51 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 224; ORF 052.ng>: 
g052 .pep 
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1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 
51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 
101 RLRLiETTWSP ACRKVKNAA* 

The following partial DIM A sequence was identified in N. meningitidis <SEQ ID 225>: 

m052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCCG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

3 51 OGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 226; ORF 052>: 

m052 .pep 

1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 227>: 

a052. seq 

1 ATGGCTTTGG TCGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGAGAGCCG ACAGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCTCCC 

151 AAGGGATTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCGTTTA TATCAGTCGG CGACACGTGA CTCACTTCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAATCAC ATGGTCGCCC GCCTGCAAAA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 228; ORF 052.a>: 

a052.pep 

1 MALVAEETEI SAPCFKG*EP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFI SVG DT* LTSMPNLVTM LLIKPTWPN 

101 RLRLEITWSP ACKKVKNAA* 

m052/a052 95.8% identity over a 1 1 9 aa overlap 

10 20 30 40 50 60 

m052 . pep MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 
I | I || I I I I I I I I I t I I i I I I I I II I I I I II I I I I II I I I I I I I i I II I I I I I I I I II I 
a052 MALVAEETEI SAPCFKGXEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 

70 80 90 100 110 120 

ra052 . pep S L V L ALT AAFH SFISVGDTRLTPMPN L VTM LL I K PT W PNR LRLE T TW S P ACRKVKNAAX 

I I I I I | I I II I I I I I I I I I II I I I I I I I I I I I ! I I I I I I I I I I I I I i I I : I I I I I I I 
3052 SLVLALTAAFHSFISVGDTXLTSMPNLVTMLLIKPTWPNRLRLEITWSPACKKVKNAAX 

70 00 90 100 110 120 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 052 shows 95.8% identity over a 1 19 aa overlap with a predicted ORF (ORF 052.ng) 
from N. gonorrhoeae: 
m052/g052 

10 20 30 40 50 60 

m052 .pep MAL V AE ETE I SAPCFKGCEPTGDSRLLSTT KSAPMPCANS AKASKSATSP KGLDGVSKNS 

M in iii mi mi iiimii iii nit ii in in ii ii i inn i in iiiiiiiiii 
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g0 52 MALVAEETEI SAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m052 .pep S LVLALTAAFHS F I SVGDTWLTSMPNLATMLLI KPTWPNRLRLE I TWS PACKKVKNAAX 

IIIIIIIIIIIIMIIIII II Nihil I II I II III II 111! 1 1 lllhll ill II 

g052 S LVLALTAAFHSF I S VGDTRLT PM PNLVTMLL I KPT W PNRL RLETTWS PAC RKVKN AAX 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 229>: 

g073 .seq 

1 ATGTGTATGC CATACGCAAT AAGGGTTTCA GACGGCATCT GCCGCATTTT 

51 TCCGCCGATG CCGTCTGAAA CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AGTCCGGGGC GGatacCGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT TGGTGATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAAAAGAGT TCCACGACTT TCCACGCGGC TGCTTGGTCG 

3 01 GCGACTTCAA AACCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

351 AAGCTCCGCC TGCGGATGGT CGGGCAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 230; ORF 073. ng>: 

go 73 .pep 

1 MCMPYAIRVS DGICRIFPPM PSETRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 SPGRIPAASF SSGCILPCW VHGLVMV ERT SPRLAVREKS STTFHAAAWS 

101 ATS KPMTMPP PFCCLRISSA CGWSGNPV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23 1>: 

m0 73 . seq 

1 ATGTGTATGC CATATAAGAT AAGGGTTTCA GACGGCATCT GCTGTCCAAT 

51 GCCGTCTGAA ACACGCAATC AGCGTGCGAG TGCCTGTTTC AAATCGTCAA 

101 TCAAATCGCC AACATATTCC AAACCGACCG ACAGGCGCAC CAATC CGGGG 

151 CGGATGTTGG CGGCGAGTTT TTCTTCGGGC TGCATCCTGC CGTGCGTGGT 

2 01 TGTCCACGGG TGGGTAATGG TCGAGCGCAC GTCACCGAGG TTGGCGGTGC 
251 GGGAAAAGAG TTCCACGCCG TCCACAACTT TCCACGCCGC TTCTTGATCG 
301 GCAACTTCAA AGCCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

3 51 AAGCGCCGCC TGAGGATGGT CGGACAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 232; ORF 073>: 
m073 .pep 

1 MCMPYKIRVS DGICCPMPSE TRNQRASACF KSSIKSPTYS KPTDRRTNPG 
51 RMLAAS FSSG CILPCWVHG WVMV ERTSPR LAVREKSSTP STTFHAASXS 
101 ATS KPMTMPP PFCCLRISAA XGWSDNPV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 233>: 

a073. seq 

1 ACGTGTATGT CATATAAGAT AAGGGTTTCA GACGGCATTT GCGGTGTTTT 

51 TCCGCCGATG CCGTCTGAA. CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AATCCGGGGC GGATGTTGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT GGGTAATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAGAAAAGT TCGACGCCGT CCACGACTTT CCACGCGGCT 

301 GCTTGGTCGG CGACTTCAAA GCCGATGACG ATGCCGCCGC C3TTTTGCTG 

351 TTTGCGGATA AGCTCCGCCT GAGGATGGTC GGGTAATCCG GTGTAA 

This corresponds to the amino acid sequence <SEQ ID 234; ORF 073.a>: 

a073.pep 

1 TCMSYKIRVS DGICGVFPPM P3EXRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 NPGRMLAASF SSGCILPCW VHGWVMV ERT SPRLAVREKS STPSTTFHAA 

101 AWSATSKPMT MPPPFCCLRI S3A*GWSGNP V* 

m073/a073 92.3% identity over a 1 30 aa overlap 
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10 20 30 40 50 

m073.pep MCMPYKIRVSDGICC PMPSETRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

II I I I I I I I I II f I I I I : I I I I II t M I I I I M I II I I I I I I II I I I I II I I I I 

a07 3 TCMSYKIRVSDGICGVFPPMPSEXRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

10 20 30 40 50 60 



60 70 80 90 100 110 

m073 . pep SSGCILPCVVVHGWVTWERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRl 
I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I 
307 3 SSGCILPCVVVHGWVMVERTSPRLAVREKSSTPSTTFHAAAWSATSKPMTMPPPFCCLRI 
70 80 90 100 110 120 



120 129 
m07 3 . pep S AAXGWS DN PVX 

I : I I I I I I I I I 
a07 3 SSAXGWSGNPVX 
130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 073 shows 87.0% identity over a 131 aa overlap with a predicted ORF (ORF 073.ng) 
from N. gonorrhoeae: 

ra073/g073 

10 20 30 40 50 

m073 .pep MCMPYKIRVSDGICC PMPSETRNQRASACFKSS IKS PTY S KPTDRRTNPGRMLAASF 

Mill Illlllll IIIIIIHIIIMMIIIIIIMIIIIIMIhlll: llll 

g073 MCMPYAIRVSDGI CRIFPPMPSETRNQRASACFKSSIKSPTYSKPTDRRTSPGRIPAASF 

10 20 30 40 50 60 

60 70 80 90 100 110 

m073 . pep S S G C I LPCVWHGWVMVERTS PRLAVRE KS S TP STT FHAASX S ATS KPMTMP P P FCCLR I 

IIIIIMIIIMI IMIIIIMIMIMIil llllh IIMIIMMMIIMM 

g073 SSGCILPCVWHGLVMVERTSPRIiAVREKSST- - - TFHAAAWS ATS KPMTMP PPFCCLR I 

70 80 90 100 110 



120 129 
mO 7 3 . pep SAAXGWSDNPVX 

hi Ml Mil 

g073 SSACGWSGNPVX 
120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 235>: 

g075 . seq 

1 ATGCCGCCTT ACTTCATCAC CCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTTTCGG GCAACGCTGC GTTTGCCTGT 

151 GCCGCCAAAG CCAGCGGGGC GGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTT ACGAAATTTT 

251 TAAAAAAATG TGTTTGCGGG CTTTGTGAAG GTTTTAGAGA CCGCCTGCCG 

301 GGCCTCTTAA ACTTAATCTT CTTTTTCGTA GAATCCGAAA ATTACAAATT 

351 CCCCGCCTAT CTCTTCCAAT GCCGAGCTAA AAGCGTCTTC ATAGCTGTCA 

401 TATTTACCGG CTGA 

This corresponds to the amino acid sequence <SEQ ID 236; ORF 075 .ng>: 

g075 .pep 

1 MPPYFITLLT MENTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNAAFAC 

51 AAKASGAAVT TASFAPYLRQ VLINFMIFSF TKFLKKCVCG LCEGFRDRLP 

101 GLLNLIFFFV ESSNYKFPAY LFQCRAKSVF IAVIFTG* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 237>: 

xn075.seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAATG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

4 01 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 238; ORF 075>: 

m07S.pep 

1 MPSYFITLLT MENTKSAAKM PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVMDGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 075 shows 65.7% identity over a 137 aa overlap with a predicted ORF (ORF 075.ng) 
from N. gonorrhoeae: 
m075/g075 

10 20 30 40 50 60 

MPS Y F I T LLTMENT KS AAKM P TT I QPAS I PS AFAAS XAF FAV S GNV AFACAAKARGAAVT 

II IMIIIIIIIIIIIII IMIIIIMIMIIMIIMI Nihil MINI Mill 

M P P Y F I T LLTMENT KS AAKT PTT I QPAS I PSAFAAS KAFF AVS GNAAFACAAKASGAAVT 
10 20 30 40 50 60 

70 80 90 100 110 

TASFAPYLRQVLINFMIFSF KKC LAVMDG AF FRRP PNI R KSVFQKS EYD KFVLVAD 

llllllllllllllllllll III: : =1 I l== = H | : | 

TASFAP YLRQVLINFMI FSFTKFLKKCVCGLCEGFRDRLPGLLNLI FFFVESENYKFPAY 
70 80 90 100 110 120 

120 130 
FFQTCVNRFFEWEI IGIGDX 
:|| :: | t| . | 
LFQCRAKSVFI AVI FTGX 
130 



nt07 5 .pep 
g075 

tn075.pep 
g075 

m075 .pep 
g075 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 239>: 

a075 . seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAAGA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCTGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

401 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 240; ORF 075.a>: 

a075 .pep 

1 MPSYFITLLT MEKTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVM DGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 
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m075/a075 98.5% identity over a 136 aa overlap 

10 20 30 40 50 60 

m075 . pep MPSYFITLLTMENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
I I I I I I I 1 II I I: I I I I I I I i I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I II I 
a075 MPSYFITLLTMEKTKSAAKTPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 

10 20 30 40 50 60 

70 80 90 100 110 120 

m075 . pep TASFAPYLRQVLINFMIFS FKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I 
a075 TASFAPYLRQVLINFMIFS FKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKF\/LVADFFQT 

70 80 90 100 110 120 

130 

m075.pep CVNRFFEVVEIIGIGDX 
I I I I I I I I I I I I I I I I I 
a07 5 CVNRFFEWEI IGIGDX 

130 

The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 24 1>: 

g080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAAGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA TACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTTGCA CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAA TGTCGTTTTG GACAACGGCA 

551 TCACCGTCAG GCTCGGACGG GAAAAcgaGA TGAAACGCCT CCgGCTTTTT 

601 ACcgAAGCGT GGCAGCATCT gttgcGTAAG AATAAAAATC GGTTATCCTA 

651 TGTGGATATG Aggtataagg acggatttTC agtcccccat gctCCCGACG 

701 GTTTACCCGA AAAAGAATcc gAAGAATatt gggaacaggt ttgggacata 

751 ttacggcctg gcgtcggaaa cggttcgacg caaatttcaa tcagttatAA 

801 GGGCAGacga acaatgqaac AGcagtaa 

This corresponds to the amino acid sequence <SEQ ID 242; ORF 080.ng>: 

g080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLAA SGLVW FYNSNHLPVK QVS LKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWNWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVPH APDGLPBKES EEYWEQVWDI 

251 LRPGVGNGST QISISYKGRR TMEQQ* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 243>: 

m0 80 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGACATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 

551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 
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601 ACCGAAGCGT GGCAGCATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 
6 51 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTTCCGACG 
701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 2441 ; ORF 080>: 

m080 -pep 

1 MWDNAEAMER LTRWLLVMMA KLLAA SGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKTLGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARS AW I WL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY ASDGLPEKES EE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 080 shows 97.9% identity over a 242 aa overlap with a predicted ORF (ORF 080.ng) 
from N. gonorrhoeae: 

mC80/g080 

10 20 30 40 50 60 

MWDNAEAMERLTRWLLVMMAMLIA^ 

1 1 II I II I IIMI 1 1 1 1 llllll! II! Ill 111 1 1 1 1 1 II III III II I Ml III 

MWDNAEAl^RLTRWLLVMMAMLI^ 

10 20 30 40 50 60 

70 80 90 100 110 120 

KEY I HGN I LRTDI NGAQEAYRRYP W I AS VMVRRRFPDTVE WLTERKPVARWGDHALVDG 

I III 1 II I Mi III 1 1 1 MM IN III MM II 1 1 1 1 III III 1 1 1 1 II III I II I ! II I 

KEY I HGN I LRTDI NGAQEAYRRYPW I AS VMVRRRFPDTVE WLTERKPVARWGDHALVDG 
70 80 90 100 110 120 

130 140 150 160 170 180 

EGNVFEARIjDRPGMPVFRGAEGTSAEMLRRYDEFSTVIJIKQGLGIKE 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 II I III 

EGNV F E ARLDRPGMP V FRGAEGTS AEMLRR Y DE FS T VLAKQG LG I KEMT YT ARS AWNVVL 
130 140 150 160 170 180 

190 200 210 220 230 240 

DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYA3DGLPEKES 

I Ml II II MMM II MM I II M II III III III 1 1 1 1 1 M III II M llllllll 

DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKWRLSYVDMRYKDGFSVPHAPDGLPEKES 
190 200 210 220 230 240 



EEX 
II 

EEYWEQVWDI LRPGVGNGSTQI S I S YKGRRTMEQOX 
250 260 270 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 245>: 
a080. seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTAGTTTAT 

151 TCCGATAAGA AAGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGTTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

4 51 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 



m080 .pep 
080 

m080.pep 
080 

m080 .pep 
080 

m080 .pep 
080 

m080 .pep 
080 
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551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 

601 ACCGAAGCGT GGCAACATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 

651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTCCCGACG 

701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 246; ORF 080.a>: 

a080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFFDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA E3TSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIVVL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY APDGLPEKES EE* 

moso/a080 99.2% identity over a 242 aa overlap 

10 20 30 40 50 60 

mOBO . pep MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKTLGSLA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I i I I I I I I I I I I I I II I I I I : I I I I I 
aOBO MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKALGSLA 
10 20 30 40 50 60 

70 80 90 100 110 120 

inOSO . pep KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
I I I I I I I M I 1 1 I 1 1 I I I I I I I II I I I I I I I I I 1 1 1 1 I M I I II I I I I I I I I 1 I I I I I I I 

a090 KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
70 80 90 100 110 .120 

130 140 150 160 170 180 

m080 . pep E GN V FE ARL DR PGM P V FRG AE GT S AEMLRR Y DE FS T VLAKQG LG I KEMT YT ARS AW I W L 

I I I I I I I I I I I I I I I I t I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a090 EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIWL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m080 . pep DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYASDGLPEKES 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I \ I I I I I I I I I I I I I I I I I I I I I I I 
a080 DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYAPDGLPSKES 

190 200 210 220 230 240 



m080.pep EEX 
I I ! 

a080 EEX 



The following partial DNA sequence 



g08l .seq 






1 


ATGAAACCAC 


TGGACCTAAA 


51 


GCCGTCTGAA 


AACAAACCCG 


101 


TTCGGGAAGG 


CGATGTGTTT 


151 


CATGATTTTG 


TTGGAGGCGT 


201 


TTCGCGCGAA 


GATTGCGCGG 


251 


CGCTTGCCGC 


GTTGCAAACG 


301 


CCGTTTGTGT 


TCGGCATTAC 


351 


GATGCTGGCT 


GCGGTATTGC 


401 


CGACGGCAGG 


CAACTTCAAC 


451 


AAATtaaAcg 


aAAAACACCG 


501 


TTTTGGcgaa 


ctggcggtTt 


551 


TGGtcaACAA 


CGCCCTGCGC 


601 


GGCGATATTG 


CCAAAGcgaa 


651 


CGGCATGGCA 


CTGATTCCTC 


701 


CGGCAACGTT 


TAATTTGAAT 


751 


GTCCGCGCGG 


AAAATATCGT 



identified in N. gonorrhoeae <SEQ ID 247>: 

TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 
TGTCGCGCAT CGTAACCGAC AGCCG CGATA 
TTCGCATTGG CGGGCGGGCG GTTTGACGCG 
ATTGTCTGCG GGCGCGGCGG CGGTTGTGGT 
CTTTGGGCGG CGCGTTGAAA GTCGATGACA 
TTGGCGAAGG CGTGGCGCGA TAATGTGAAC 
CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 
GCCGCCGTTT CGGCGATGAT GCCGTTTCGG 
AACCACAtcg gaTTGCCGCT GACTTTATTG 
CTATGCCGTG ATTGAAATGG GCATGAACCA 
taacgcaaaT CGCCA7VACCC GATGCCGCTT 
GCCCATGTCG GATGCGGTTt cgacggagtg 
aagcGAGATT TatgcagGct tATGTTCAGA 
AAGAAGATGC CAATATGGCT GTCTTCAAAA 
ACGTGCACTT TCGGCGTCGA TAGCGGCGAT 
GCTGAAACCT TTGTCGTGCG AATTTGATTT 
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801 GGTGTGCGGC GACGAGCGCA CTGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCCGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG CAAGGCTTCA GCAACATCAA 

951 AGGCCGTCTG AACGTCAAAG CCGGCATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAATCCC GACAGTATGA AAGCCGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG CATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAGGAc gaAGCCGCCG CCATGCACGC CGAagtcgGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAG CGTCGAA 

1201 GCGGcggaAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

1251 GTTGATTCAA GTGTTGAGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

1301 TGAAAGGTTC GCGCTTTATG CAGAtggAAG AAGTGGTCGA GGCATTGGAG 

1351 GATAAGTga 

This corresponds to the amino acid sequence <SEQ ID 248; ORF 08l.ng>: 

g081.pep 

1 MKPLDLNFIC QALKLPKPSE NKPVSRIVTD SRDIREGDVF FALAGGR FDA 

51 H DFVGGVLSA GAAAVWSR E DCAALGGALK VDDTLAALQT LAKAWRDNVK 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVSATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNALR AHVGCG FDGV 

201 GDIAKAKSEI YAGLCSDGMA . LIPQEDANMA VFKTATFNLN TCTFGVDSGD 

251 VRAENIVLKP LSCEFDLVCG DERTAWLPV PGRHNVHNAA AAAALALAAG 

301 LSLNDVAEGL QGFSNIKGRL NVKAGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDOGIEA AYFVGDNSVE 

401 AAEKFGADGL WFAAKDPLIQ VLSHDLPERA TVLVKGSRFM QMEEWEALE 

451 DK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 249>: 

m081 .oeq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

• 51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGAGCG GTTTGACGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCT GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGTGCTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTGCCGC ATTGCAAACG CTGGCAAAGG CGTGGCGTGA AAATGTGAAT 

301 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

3 51 AATGCTGGCT GCGGTATTGC GCCgCCGTTT CGG CGATGAT GCCGTGTTGG 

401 CGACGGCAGG CAACTTCAAC AACCATATCG GATTGCCGCT GACTTTGTTG 

451 AAGTTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 

501 TTTCGGCGAA CTGGCGGTTT TAACGCAmAT CGCCAAACCA AATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TACCAAGGTT TATGTTCAGA 

651 CGGCATTGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 

751 GTTCACGCGG AAAATATTGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 

801 GGTGTGCGGC GATGAGCGCG CCGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCGGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGAT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAACTG GGCGAGGACG AAGCCGCCGC TATGCACGCC GAAGTCGGCG 

1151 CGTATGCCCG CGACCAAGGC ATCGAAGCGG CTTATTTTGT CGGCGACAAC 

1201 AGCGTCGAAG CGGCGGAAAA ATTTGGCGCG GACGGTTTGT GGTTCGCCGC 

1251 CAAAGACCCG TTGATTCAAG TGTTGCGCCA CGATTTGCCC GAACGCGCCA 

1301 CCGTGTTGGT GAAAGGTTCG CGCTTTATGC AGATGGAAGA AGTGGTCGAG 

1351 GCATTGGAGG ATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 250; ORF 08 1>: 

m081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGERFDA 

51 H PFVEDVLAA GAAAVWS RE DCAAMDGALK VDDTLAALQT LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTXIAKP NAALVNNAMR AHVGCG FDGV 
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201 GDIAKAKSEI YQGLCSDGIA LIPQEDANMA VFKTATLNLN TRTFGIDSGE 

2 51 VHAENIVLKP LSCEFDLVCG DERAAWLPV PGRHNVHNAA AAAALALAAG 

3 01 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAIDVL 

3 51 ARMPAPRIFV MGDMGELGEL GEDEAAAMHA EVGAYARDQG IEAAYFVGDN 
401 SVEAAEKFGA DGLWFAAKDP LIQVLRHDLP ERATVLVKGS RFMQMSEWE 

4 51 ALEDK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 081 shows 94.1% identity over a 455 aa overlap with a predicted ORF (ORF 081.ng) 
from N. gonorrhoeae: 

m081/g081 



10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 

II ,11 ill I ill Ml II I! hill, II Ml Mill II Nil II I I III I II I MM 

g08 1 MKPLDLNFICOALKLPMPSENKPVSRIVTDSRDIREGDVFFALAGGRFDAHDFVGGVLSA 
10 20 30 40 50 60 



70 00 90 100 110 120 

mO 8 1 . pep GAAAVWSREDCAAMDGALKVDDTLAAIiQTIA^ 

IIMIIIIIIIIIh MMM III II MMIIIMM MMM III MM II MM! II 

gO 8 1 GAAAVWSREDCAALGGALKVDDTIAALQTLAKAWRDNVNPFVrc 

70 80 90 100 110 120 



130 140 150 160 170 180 

m081 .pep AVLRRR FGDDAVLATAGNFNNH I GL PLTL L KLNE KHR YAVI EMGMNH FGELAVLTX I AKP 

llllllllllll MMIIIIIIIMIIIIIIIIIMIIIIilllllllllllll I I I I 

gO 8 1 AVLRRRFGDDAVS ATAGNFNNH I GLPLTLLKLNEKHRYAVIEMGMNHFGE LAVLTQ I AKP 

130 140 150 160 170 180 



190 200 210 220 230 240 

mO 8 1 . pep NAAL VNN AMRAIIVG CG FDGVGD I AKAKS E I YQG LC SDG 2 AL I PQEDANMAVF KTATLNLN 

:|||||lhlllllllllllllllMIIIII IIIMMIIIII IIIIIIIIIIIIMM 
gO 8 1 DAALVNNALRAHVGCX3 FDGVGD I AKAKS E I YAGLCSDGMALI PQEDANMAVFKTAT FNLN 

190 200 210 220 230 240 



250 260 270 280 290 300 

mO 8 1 . pep TRTFGI DSGDVHAENI VLKPLS CEFDLVCGDERAAWLPVPGRHNVHNAAAAAALALAAG 

I IMMI I II Ml I MM III IM II Ml I IIMI 1 1 MIMI III III M MMM II 

gO 8 1 TCTFGVDSGDVRAENI VLKPLS CEFDLVCGDERTAWLPVPGRHNVHNAAAAAALALAAG 

250 260 270 280 290 300 



310 320 330 340 350 360 

mO 8 1 . pep LSLNDVAEGLKGFSNI KGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 

II IMMI MMIIIMMMMM II MIMIIMM IMMI IMIMIIIMIM II 

gO 8 1 LSLNDVAEGLQGFSNI KGRLNVKAGI KGATLI DDT YNANPDS MKAAVDVLARM PAP R I F V 

310 320 330 340 350 360 



370 380 390 400 410 420 

m081 .pep MGDMGELGELGEDEAAAMHAEVGAYARDQG I EAAYFVGDNSVEAAEKFGADGLWFAAKDP 

lllllllll IMIIIMIIIII Mill IMMI M MMM Ml I II IMMMIII 

g081 MGDMGELGE- - -DEAAAMHAEVGAYARDQG I EAAYFVGDNSVEAAEKFGADGLWFAAKDP 

370 380 390 40Q 410 



430 440 450 

mO 8 1 . pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

IIMI llllllllllllllllllllllllllllll 

g 0 8 1 LI Q VLS HD L P E RATVLV KGS R FMQMEE WE AL EDKX 

420 430 440 450 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 25 1>: 

a081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGATGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCG GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGTTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTACCGC GTTGCAAATG TTGGCGAAGG CGTGGCGCGA GAATGTGAAC 

301 CCGTTTGTGT TCGGTATTAC CGGCTCGGGC GGCAAGACGA CGGTGAAGGA 

351 AATGTTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATAAT GCCGTTTTGG 

401 CGACGGCAGG CAACTTCAAC AACCACATCG GATTGCCGTT GACTTTGTTG 

451 AAATTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GTATGAACCA 

501 TTTTGGCGAA CTGGCGGTTT TGACACAAAT CGCCAAACCC GATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TATCAAGGCT TATGTTCAGA 

651 CGGCATGGCG CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 

751 GTCCACGCGG AAAATATCGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 

801 GGTGTGCGGC AACGAGTGCG CAGCCGTGGT TCTGCCCGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCCGCCGCCG CGCTGTCTTT GGCTGCAGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 AT ACT TAT AA TGCGAACCCT GACAGCATGA AAGCTGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGTGAGGAC GAAGCCGCCG CCATGCACGC CGAAGTCGGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

1201 GCGGCGGAAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

1251 GTTGATTCAA GTGTTGCGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

1301 TGAAAGGTTC GCGCTTTATG CAGATGGAAG AAGTGGTCGA GG C ATT GG AG 

1351 GATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 252; ORF 081.a>: 

a081.pap 

1 MKPLDLNFIC QALKLPMPSE SKFVSRIVTD SRDIRAGDVF FALAGGRFDA 

51 HDFVEDVLAA GAAAVWSRE DCVAMDGALK VDDTLTALQM LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDN AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNAMR AHVGCGFDGV 

201 GDIAKAKSEI YQGLCSDGMA LIPQEDANMA VFKTATLNLN TRTFGIDSGD 

251 VHAENIVLKP LSCEFDLVCG NF.CAAWLPV PGRHNVHNAA AAAALSI..AAG 

301 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

401 AAEKFGADGL WFAAKDPLIQ VLRHDLPERA TVLVKGSRFM QMEEWEALE 

451 DK* 

m08i/a08i 96.7% identity over a 455 aa overlap 

10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 
II I I I I i I I I I It M I I I I I I I I I I I I I I I I I \ I I I I I I I I 1 I I I I I I I I I I I I I I I I 
a081 MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGGRFDAHDFVEDVLAA 
10 20 30 40 50 60 

70 80 90 100 110 120 

GAAAVWSREDCAAWDGALKVDDTLAALO^LAKAWRENVNPFVFGITGSGGKTTVKEMLA 
I I I I I I I I I I I I : I II I I I 11111:111 I I I I I I I I I I I I I I I I I I I I I I I I I I I I [ I 
G AAAV V VS RED C V AM DG ALK V D DT LT ALQMLAKAW RENVNPFVFCITGS G GKTT VKEM LA 
70 80 90 100 110 120 

130 140 150 160 170 180 

m081 .pep AVLRRRFGDDAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 
I I I I I I I I I : I I I I I I I I I I I I II I I I I I ! I II I I I I I I I I I I I I I II I I I I I I I I I I I 
a081 AVLRRRFGDNAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTQIAKP 



ra081 . pep 
a081 
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130 14C 150 160 170 180 

190 20C 210 220 230 240 

mC81 .pep N AA L VN N AMRA H VG CG FDG VG D I AKAK S E I YQG LC S DG I AL I PQE DAN MA V FKTAT LN LN 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I M I I I I ! I II 1 I I II I 
3081 DAALVNNAMRAHVGCG FDG VGDI AKAK S E I YQG LC S DGMALI PQE DANMAV FKTAT LNLN 

190 20C 210 220 230 240 



250 26C 270 280 290 300 

m081 . pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAWLPVPGRHNVHNAAAAAALALAAG 
I I I I I I I I I I I I I I I I I I i II I I I 11111:1 i I I I I I I I I I I I I I I I II I I I I : I I I I 
a081 TRTFG1DSGDVHAENIVLKPLSCEFDLVCGNECAAWLPVPGRHNVHNAAAAAALSLAAG 

250 260 270 280 290 300 



310 320 330 340 350 360 

m081 .pep LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 
I I I I I I I I I I II I I II I I I I I I I I ! I I I II II I I I I I I I I I I I I I I : I I I I I I I I I I I I I 
aU81 LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAVDVIARMPAPRIFV 
310 320 330 340 350 360 



370 380 390 400 410 420 

m081 .pep MG DMGE LGE LGEDEAAAMH AEVG AY AR DQG I E AAY FVG DN S VE AAEK FGADG LW FAAKD P 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I 

a08 1 MG DMGE LGE DE AAAMHAE VGA YARDQG I E AAY FVG DN S VEAAEK FGADG LW FAAKD P 

370 380 390 400 410 



430 440 450 

mO 8 1 . pep LI QVLRHDLPERATVLVKGSRFMQMEE WEALE DKX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a081 LI QVLRHDLPERATVLVKG SRFMQMEE WEALE DKX 

420 430 440 450 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 253>: 

g082 . seq 

1 aTGTGGTTGT TGAAGTTGCC TGCCGTCGCC GAAACGGCAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATCTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATG CCGAAC ACAAACGGGT TCACATTATC GCGCCACGCC 

) 151 TTCGCCAACG TTTGCAACGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 

201 GCCCAAAGCC GCGCAATCTT CGCGCGAAAC CACAACCGCC GCCGCGCCCG 

251 CAGACAATAC GCCTCCAACA AAATCATGCG CGTCAAACCG CCCGCCCGCC 

301 AATGCGAAAA ACACATCGCC TTCCCGAATA TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGTTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

401 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTGCTTTCGT TAATATTCGG 

451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGCCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGACT GTCTGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCGGCAC ATCGGGGACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 254; ORF 082.ng>: 

g082 -pep 

1 MWLLKLPAVA ETASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTLSRHA 

51 FANVCNAASV SSTFNAPPKA AQSSRETTTA AAPADNTPPT KSCASNRPPA 

101 NAKNTSPSRI SRLSVTMRDT GLFSDGIGSL RAWQMKFRSS GFI FAFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRPAFSRIP 

201 RRGWGLSVD KGKVIAFARH IGDIPPKIIA VIGQLVGFDT RPTAESA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 255>: 

m082 . seq 

1 ATGnnGTTGT TGAAGTTGCC TGCCGTCGCC AACACGGCAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATTTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGAT TCACATTTTC ACGCCACGCC 

151 TTTGCCAGCG TTTGCAATGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 
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201 ATCCATTGCA GCACAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCAG 

251 CAGCCAATAC GTCTTCAACA AAATCATGCG CGTCAAACCG CTCGCCCGCC 

3 01 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

3 51 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

4 01 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTACTTTCGT TAATATTCGG 
4 51 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 
501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGsATTT TTTCTGTACG 
551 TATCATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 
601 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 
651 CGCCCTGCAC AT CGGGAACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 
701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 256; ORF 082>: 

m082.pep 

1 MXLLKLPAVA NTASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTFSRHA 

51 FASVCNAASV SSTFNAPSIA AQSSRETTTA AAPAANTSST KSCASNRSPA 

101 NAKNTS PARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSXF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 082 shows 92.7% identity over a 247 aa overlap with a predicted ORF (ORF 082.ng) 
from N. gonorrhoeae: 

m082/g082 

10 20 30 40 50 60 

m082 .pep MXLLKL PAVANTAS SP KRRRNTAAS I S FTVVLP PEPVMPNTNGFT FSRHAFASVCNAAS V 

i i ii i ii i hi ii iii 1 1 ill ii i ii i it i ii ii 1 1 ii inn hi i nihil iii ii 

g082 MWLLKLPAVA2TAS SP KRRRNTAAS 1 S FTWLP PE PVMPNTNG FTLS RHAFANVCNAAS V 

10 20 30 40 50 60 

70 80 90 100 110 120 

m082 . pep SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 

n i ii 1 1 inn ii iii iii ii ii i in ii 1 1 ii mi i ihhiii inn ii 

g082 SSTFNAPPKAAQSSRETTTAAAPADNTPPTKSCASNRPPANAKNTSPSRI SRLSVTMRDT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m082 . pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAVVKHRLFSHSHSXF 

I h I I I I I I I I II I II II I I I I I I = I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I 
g082 GLFSDGIGSLRAWQMKFRSSGFIFAFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 
130 140 150 160 170 180 



190 200 210 220 230 240 

m082 . pep FLYVSFFRRI FSRFAFSRIPRRGWGQSVDKGKVIAFALHIGNIPPKI I AVIGQLVGFDT 

llllhlllllllllllllillim I II II III I II llhlllllllhlllllhl 
g082 FLYVSFFRRIFSRFAFSRIPRRGWGLSVDKGKVIAFARHIGDIPPKIIAVIGQLVGFDT 
190 200 210 220 230 240 



m082 .pep 



g082 



RPTAESAX 

III III II 
RPTAESAX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 257>: 

a082 . seq 

1 ATGTGGTTGT TGAAGTTGCC TGCCGTCGCC AAAACGGCAT TATCGCCGAA 
51 ACGGCGGCGC AATACCGCAG CCAACATTTC CTTCACCGTC GTCTTGCCGC 
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101 CCGAGCCGGT AATACCGAAC ACAAACGGGT TCACATTCTC GCGCCACGCC 

151 TTCGCCAACA TTTGCAACGC GGTAAGCGTG TCATCGACTT TCAACGCGCC 

201 ATCCATTGCA ACGCAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCCG 

251 CAGCCAATAC GTCTTCAACA AAATCATGCG CATCAAACCG CCCGCCCGCC 

301 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

401 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTACTTTCGT TAATATTCGG 

451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCTGCAC ATCGGGAACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 258; ORF 082.a>: 

a082 . pep 

1 MWLLKLPAVA KTALSPKRRR NTAANISFTV VLPPEPVIPN TNGFTFSRHA 

51 FANICNAVSV SSTFNAPSIA TQSSRETTTA AAPAANTSST KSCASNRPPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFI FTFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAE3A* 

m082/a082 95.5% identity over a 247 aa overlap 

10 20 30 40 50 60 

n\082 . pep MXLLKLPAVANTASSPKRRRNTAASISFTVVLPPEPVMPNTNGFTFSRHAFASVCNAASV 
1 11111111:11 I I I I ! I I I I 1:1 I I I I I I I I I M: I I I I ! II I I I I I I I : :l I I : I I 
a082 MWLLKLPAVAKTALS PKRRRNTAAKI S FTWLPPEPV I PNTNGFTFSRHAFANI CNAVSV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m082 . pep SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 
I | | | | I I I I I : I I I II I I I I I I I II I I I I I I I I I I I I I II I I I I N I I I I I I I II I I I I 
a082 SSTFNAPSIATQSSRETTTAAAPAANTSSTKSCASNRPPANAKNTSPARMSRLSVTMRDT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m082 . pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAVVKHRLFSHSHSXF 

I I I I I I I I I I I I I I I I N ! t I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 1 

a082 GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m082 . pep FLYVS FFRRI FSRFAFSRIPRRGVVGQSVDKGKVIAFALHIGNI PPKI I AV IGQLVGFDT 

I I I I I || I I I I I I I I I II I II I I I I I II II II I I I I I I I I I I I I I I M I I I I I I I I II I I 
a 08 2 FLYVSFFRRI FSRFAFSR I PRRGVVGQSVDKGKVIAFALHI GNI PPKI I AV I GQLVG FDT 

190 200 210 220 230 240 



m082.pep RPTAESAX 
I I I I I I I I 
a082 RPTAESAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 259>: 

g084 . seq 

1 ATGAAacaAT CCGcccgaat aAAAAATATG GATCAGACAT TAAAAAATAc 

51 attgggcatt tGCGCGcttt tagcctTTTG TTTTggcgcG gccaTCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGC 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GCTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCTT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 
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401 TTTGGAAATA TTGTGTATCT GTGGGGGTAT TTGCTGACGT AAAAAACTAT 

451 AAACGTCGCA GCAAAA7ATG GCTGACCATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGAGAAAA TCGccggcga taaAGATTGG CGAGaacctg 

551 atgccggcct gttgttgaat ATTTTcgacc tgtattaCga cttggctttc 

601 cgcgccggca cAATATGCCG CCAAGCGCGC CCAcattttg gaagCagcaa 

651 aaaaagcgtC AACATGGCAt atccgccaac ttgcgcccaa gTAtaa 
This corresponds to the amino acid sequence <SEQ ID 260; ORF 084.ng>: 

g084 .pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA A1ASG YHLEY EYGYR YSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY 3APSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS VGVFADVKNY 

151 KRRS KIWLTI LLTL1LSCAV ME KIAGDKDW REPDAGLLLN IFDLYYDLAF 

201 RAGTICRQAR PHFGSSKKSV NMAYPPTCAQ V* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 26 1>: 

m084 . seq 

1 ATGAAACAAT CCGCCcGAAT AAAa . ATATG AATCAGACAT TACTTTATAC 

51 ATTGGGCATT TGCGCGCTTT TAACCTTTnn nnnnnnnnnn nnnnnnnnnn 

101 nnnnnTATCA CCCnGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

401 TTTGGAAATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

451 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCT.TC 

601 CGCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 262; ORF 084>: 

m084.pep 

1 MKQSARIKXM NQTLLYTLGI CALLTFXXXX XXXXX YHPEY EYGY RYSAVG 

51 ALASWFLLL LA RGFP RVSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS GGVFADVKNY 

151 KRRS KIWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN I FDLYYDLAX 

201 RAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 084 shows 90.5% identity over a 231 aa overlap with a predicted ORF (ORF 084.ng) 
from N. gonorrhoeae: 

m084/g084 



10 20 30 40 50 

m084 . pep MKQSARI KXMNQTLLYTLG I CALLT F YHPEYEYGYRYSAVGALASWFLLL 

llllllll hill IMIIIIhl I IIIIIIIIIIIIIIIIIMIII 

g084 MKQSAR I KNMDQTLKNTLG I CALLAFCFGAAI ASGYHLE YE YGYRYS AVGALAS WFLLL 

10 20 30 40 50 60 



60 70 80 90 100 110 

m084 . pep LARGFPRVSSWLLI YVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

1 Mill IMIMII III llllll II III II MMII lllllllllll llllll III I II I 

g 0 84 LARGFPRVSSWLLI YVGTTALYLPVGWLYGAPSYQ I VGS I LESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 



120 130 140 150 160 170 

m084.pep YFVQALFFIFGLTWKYCVSGGVFADVKNYKRRSKIWLTILLTLIIjSCAVMDKIASDKDL 

lllllllllll Mtllllll IIIMIIIIIIIIII II llllll I M MM Ih III 

g0 84 Y FVQ AL F F I FGLTWI KYCV S VGVFADVKNY KRRS K I W LT I LLTL I LS CAVME KI AGD KDW 

130 140 150 160 170 180 
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180 190 200 210 220 

m084 .pep REPDAGLLLNIFDLYYDLAXRAGTICRQARPHFGSSKKSVNmYPSCCAQVX 

III l.lllll! II MINI Ml MM III III Mill Mill III Mill 

g 0 8 4 RE PDAGLLLNI FDLY YDLAFRAGT I CRQARPHFGS SKKSVNMAYP PTCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 263>: 

a084 .seq 

1 ATGAAACAAT CCGCCCGAAT AAAAAATATG GATCAGACAT TAAAAAATAC 

51 ATTGGGCATT TGCGCGCTTT TAGCCTTTTG TTTTGGCGCG GCCATCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

401 TTTGGAGATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

451 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCTTCC 

601 . GCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 264; ORF 084.a>: 

a084 .pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 A LASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WRYCVS GGVFADVKNY 

151 KRRSK IWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN IFDLYYDLAS 

201 XAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

m084/aOB4 92.2% identity over a 23 1 aa overlap 

10 20 30 40 50 60 

MKQS AR I KXMN QTLL YT LG Z CALLT FXXXXXXXXX YH PE YE YG YR YS AVGALASVVFLLL 
I I i I } I I I 1:111 III I I I I: I II I I I I I I I I I 1 I I I I I I I I I I II 

MKQSARIKNMDQTLKNTLG I CALLAFCFGAA I ASGYHLEYEYGYRYS AVGALASVVFLLL 
10 20 30 40 50 60 

70 80 90 100 110 120 

LARGFPRVSSVVLLIYVGT7ALYLPVGWLYGAPSYQIVGSI1ESNPAEAREFVGNLPGSL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I II I II I I I I I I 
LARGFPRVSSVVLLIYVGTTALYLPVGWLYGAPSYQIVGSI1ESNPAEAREFVGNLPGSL 
70 80 90 100 110 120 

130 140 150 160 170 180 

YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTIL1TLILSCAVMDKIASDKDL 
I I If I M It I II I I I : I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I } I 
YFVQALFFIFGLTVWRYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 
130 140 150 160 170 180 

190 200 210 220 230 

RE PDAGLLLNI FDLY YDLAXRAGTI CRQARPHFGSSKKSVNMAYPSCCAQVX 
I I I I 1 i I I I I I I I I I I I I I I I I I I I I ( I I I M 1 I I I I I I I I I I I I I I I I I 
RE PDAGLLLNI FDLY YDLASXAGTI CRQARPHFGSSKKSVNMAYPSCCAQVX 
190 200 210 220 230 



ra084 .pep 
a084 

ra084 .pep 
a084 

m084 .pep 
a084 

m084 .pep 
a084 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 265>: 

g085.seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGAC3CGT TGAAAGATAA 
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51 GGCAAAAGGC GTGTTCCTGA TCGGCGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AACCTGACCG ACTGCGTCAC TTTGGAAGAG 

151 GCGGTTCAGA CGGCATACGC CCAAGCCGAA GCGGGCGATA TTGTCTTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGG CT AC GCGCAC CGTT 

251 CGGAAGTGTT tatCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 266; ORF 085. ng>: 

g085.pep 

1 MGKGODPTPL RDALKDKAKG VFLIGVDAPQ IRRDLDGCGL NLTDCVTLEE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 267>: 
m085.seq 

1 ATGGGTAAAG GGCAGGACTT CACGCCCCTG CGCGATGCAC TGGTAGGCAA 

51 GGCAAAAGGC GTGTTCTTGA TTGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AATATGACCG ACTGCGCCAC TTTGGGAGAA 

151 GCCGTTCAGA CGGCATATGC CCAAGCCGAA GCAGGCGATA TTGTGTTGCT 

201 CAGCCCCGCC TGCGCGAGCT TTGATATGTT CAAAGG CTAC GCGCACCGTT 

2 51 CGGAAGTGTT TATCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 268; ORF 08 5>: 

mO 8 5 . pep 

1 MGKGQDFTPL RDALVGKAKG VFLIGVDAPQ IRRDLDGCGL NMTDCATLGE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 085 shows 94.7% identity over a 94 aa overlap with a predicted ORF (ORF 085.ng) 
from N. gonorrhoeae: 

m085/g085 

10 2C 30 40 50 60 

m0 85 . pep MGKGQDFTPLRDALVGKAKGVFLIGVDAPQIRRDLDGCGLNMTDCATLGEAVQTAYAQAE 

I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I : I I I : I I I || | | | I | | | I 
g085 MGKGQDFTPLRDALKDKAKGVFI.IGVDAPQIRRDLnGCGLNLTDCVTLEEAVQTAYAQAE 
10 20 30 40 50 60 

70 80 90 

mO 85 . pep AGD I VLLS P ACAS FDM FKG YAHRSE VF I E AF KALX 

I I III I II II! II III I II II I II III I II III I I 
g085 AGDI VLLSPACASFDMFKGYAHRSEVFI EAFKALX 

70 80 90 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 269>: 

a085.seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGC TTGCCGGCAA 

51 GGCAAAAGGC GTGTTCCTGA TCGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGATCTG AATATGACCG ACTGCGCCAC TTTGGAAGAA 

151 GCGGTTCAGA AGGCATATGC CCAAGCCGAA GCGGGCGATA TCGTGCTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGGGGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 270; ORF 085.a>: 

a085.pep 

1 MGKGQDFTPL RDALAGKAKG VFLIGVDAPQ IRRDLDGCDL NMTDCATLEE 
51 AVQKAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIGA FKAL* 

m085/a085 94.7% identity over a 94 aa overlap 



10 20 30 40 50 60 

IT1085 . pep MGKGQDFTPLRDALVGKAKGVFLIGVDAPQIRRDLDGCGLNMTDCATLGEAVQTAYAQAE 
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|| I I I I I I j I I I I I : 1 I I i 1 I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I 1 
a 085 MGKGQDFTPLRDALAGKAKGVFLIGVDAPQIRRDLDGCDLNMTDCATLEEAVQKAYAQAE 
10 20 30 40 50 60 

70 80 90 

m085 . pep AGDIVLLSPACAS FDMFKGYAHRSEVFIEAFKALX 

II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
a085 AGDIVLLSPACAS FDMFKGYAHRSEVFIGAFKALX 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 27 1>: 

g086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCGGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCGAAGG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GGCAGGCGGG GTTCGTCGTT GCCGGCCTTA TAGCGAGCGG TTTTTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CTTATCCGGC CTGTTGCTGG TAGCCGTATT GATTGCCGGG CGCGAAATCA 

2 51 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

3 01 GAGCTGTTCA AGCTGGCAGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

3 51 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAACCTG ATTATGTCCG CCACCAATCC GCAGGCACGT 
4 51 CGTGAAACAT TAGAAATGTA CGgcCGTTTC CGGGCGATCA TCCTGCCGAT 
501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
551 GTTCGTTTGT CGTCATTACC GTCATTACCG TTGGAATGCT GTTTCTGGCA 
6 01 GGATTGCCGT GGAAATATTT TTTTGTCCTG GTAGGCAGCG TCTTGGGTGG 
6 51 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 
701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 
751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 
8 01 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 
8 51 TTTTTGCCAT CATCGCTGAA GAATTCGGCT TCTTCGGGAT GTGCGTGCTG 
901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 
951 GTCGCGCGAT TTGGGtttgA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGctgCcg tTGATGTCCT ATGGcggTTC 

1101 GTCAGTCTTT TTCATG CTGA TCAGCATGAT GCTGCTGTTG CGTATCGATT 

1151 ATGAAAACCG CCAGAAAATG CGCGGTTACC GGGTGGAGTA AA 

This corresponds to the amino acid sequence <SEQ ID 272; ORF 086.ng>: 

g086 .pep 

1 MWLMTAFGL LMIYSASV YL ASKEGGDQFF YLTRQA GFW AGLIASGFLW 

51 FLCRMRTWRR L VPWIFALSG LLLVAVL I A G REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VITVGMLFLA 

201 GLPWK YFFVL VGSVLGGMVL MITA APYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

3 01 IFCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SF FNIGVNIG 

3 51 ALPTKGLTL P LMSYG GSSVF FMLISMMLLL RI DYENRQKM RGYRVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 273>: 
m086.seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

2 51 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACc 

301 GAGCTGTTCA AGCtGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GG CCAATCTG ATCATGTCCG CCACCAATCC GCAGrCACGT 
4 51 CGTGAaACAT TAGAAATGTA CGGCCGTwTC CGGGCGATCA TCCTGCCGAT 
501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
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551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGG CAAGC A 

951 GTCGCGCX3AT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGkrTCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA mCAAAgGyCT GACGCyGCCG Tg.AtGTCCw ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTkG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTATC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 274; ORF 086>: 

m086 .pep 

MWLMTAFSL LMI YSASVYL ASKEGGDQFF YLTRQAGFW AGLIASGLLW 



1 
51 
101 
151 
201 
251 
301 
351 



FLCRMRTWRR LVPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 
ELFKLAVILY LASLFTRREE VLRSMESLGW QS I WRGTANL IMSATNPQXR 
RETLEMYGRX RAIILPIMLV AFGLVLIMVQ PDFGSFWIT VIAVGMLFLA 



GLPW KYFFVL VGS VLGGMVL MIT AAPYRVQ RWAFLDPWK DPQGAGYQLT 
HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 
IFCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGXQ SF FNTGVNIG 
ALPXKGLTXP XMSXGGSSVF FMLISMMLLX RIDYENRRKM RGYRVE* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 086 shows 96.7% identity over a 396 aa overlap with a predicted ORF (ORF 086.ng) 
from N. gonorrhoeae: 

m086/g086 

10 20 30 40 50 60 

m086 . pep MWLMTAFSLLMI YSASVYLASKEGGDQFFYLTRQAGFWAGLIASGLLWFLCRMRTWRR 

Ml I II I Ml MINI I III III MM I III II I M I II MM II I hi llllllllll 
g086 MVVLMTAFGLLMIYSASVYLASKEGGDQFFYLTROAGEWAGLIASGFLWFLO^ 

10 20 30 40 50 60 

70 80 90 100 110 120 

m086.pep LVPWIFALSGLLLVWLI AGREINGATRWI PLGPLNFQPTELFKLAVI LYLASLFTRREE 

1 1 1 1 1 1 1 1 1 1 1 1 I h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIIIIIIIIIMII! 

g0 8 6 LVPWIFALSGLLLVAVLI AGREINGATRWI PLGPLNFQPTELFKLAVI LYLASLFTRREE 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 86 . pep VLR SME SLGWQS I WRGTANL I MS ATN PQX RRETLEM YGRXRAI I L P I MLVAFGLVL I MVQ 

I II I II II II 1 1 II III III Mi II M I Mill Nil! llllllllll III Mlllll 

g0 86 VLRSME SLGWQS I WRGTANL IMS ATN PQARRETLEM YGRFRAI I LPIMLVAFGLVLIMVQ 

130 140 150 160 170 130 

190 200 210 220 230 240 

mO 86 . pep PDFGSFWITVI AVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

lllllllllllhl IIIIIIMIIIIIIMI MUM lllllllllll II III Mlllll 

gO 8 6 PDFGSFWITVI TVGMLFLAGLPWKYFFVLVGSVLGGMVLMI TAAP YRVQR WAFLDPW K 

190 200 210 220 230 240 

250 260 270 280 290 300 

m086 . pep DPQGAGYOLTHSLMAIGRGEWFGMGLGASLSKRGFLPFAHTDFIFAIIAEEFGFFGMO/L 

I II Ml II MIMI Ml II MIIIMI Ml I III III IMIM MM I II II I MMMI 

g086 DPOGAGYQLTHSI>iAiGRGEWFGMGU^SLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
250 260 270 280 290 300 



310 



320 



330 



340 



350 



360 
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m086.pep I FCYGWLWRAFS I GKQSRDLGLTFNAY I ASGI G I WI GXQS FFN IGVNI GALPXKGLTXP 

III 111 II llllll III II III Ml Mill II Nil II I I I I I I I I I I I I I [ : I [ | | | 
g086 IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 
310 320 330 340 350 360 



370 380 390 

m086 . pep XMSXGGSSVFFMLISMMLLXRIDYENRRKMRGYRVEX 

1 1 ii iii urn ii iii it i nihil iiiiii i 

g 0 8 6 LMSYGGSSVFFMLI SMMLLLR ID Y ENRQ KMRGY RVEX 

370 380 390 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 275>: 

a08 6 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

20 1 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

301 GAGCTGTTCA AGCTGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

401 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGGCACGT 

4 51 CGTGAAACAT TAGAAATGTA CGGCCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGCTGCCG TTGATGTCCT ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG C3TATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTACC GGG TGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 276; ORF 086.a>: 

a086 .pep 

1 MWLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGLLW 

51 FLCRMRTWRR L VPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF RA IILPIMLV AFGLVLIMV Q PDFGS FWIT VIAVGMLFLA 

201 GLPWKY FFVL VGSVLGGMVL MITA APYRVQ RWAFLDPWK DPQGAGYOLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

301 IFCYGWLVV R AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SFFNIGVNIG 

351 ALPTKGLTLP LMSYGGSSVF FMLISMMLLL RIDYENRRKM R3YRVE* 



m086/a086 98.0% identity over a 396 aa overlap 

10 20 30 40 50 60 

m08 6 . pep MWLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRNRTWRR 
I I I I I I I I I I I II I I i I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I 1 I I I I I I I | | | | | | | 
a086 MVVLMTAFSLTiM I YS AS VYLASKEGGDQFFYLTRQAGFW AG F..T.ASGLLW FLCRMRTWRR 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 6.pep LVPWIFALSGLLLVVVLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 
H I I I I I I I I I I I I I I I i I I I I I I I I I I ! I I I I I I I I ! I M I I I i I i | I I | M I I I II I I 
a086 LVPWIFALSGLLLWVLIAGREINGATRWIPLGPLN FQPTELFKLAVILYLASLFTRREE 



WO 99/57280 



PCT/US99/09346 



277 



70 



80 



90 



100 



110 



120 



130 140 150 160 170 180 

m086.pep VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIILPIMLVAFX3LVLIMVQ 
I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I 
a086 VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m066 . pep PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRVVAFLDPWK 
I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
a086 PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m086 . pep DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
I I i I I II II I II I I I I I I I II I I I I I I I i It I II I I I I I I I I I I I I I I I I I I I I I II I I I 
aO 8 6 DPQGAG YQLTHSLMAI GRGEW FGMGLGASLSKRGFLPEAHTDFI FAI I AEEFGFFGMCVL 

250 260 270 280 290 300 

310 320 330 340 350 360 

mOB6.pep IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 
I II I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I II I I I I I I I I : I I I I I 
a086 IFCYGWLVVRAFSIGKOSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 

310 320 330 340 350 360 

370 380 390 

mO 8 6 . pep XMSXGGS SVFFML I SMMLLXR I D YENRRKMRGYRVEX 

II 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

a086 LMSYGGSSVFFMLISMNLLLRIDYENRRKMRGYRVEX 
370 380 390 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 277>: 

g087 . seq 



l 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 



ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACACAT 
TTTCCCAGCT CTGGCTGTGG CGGATTCATT GCGCGTGCGC GGTCATCATG 
TAATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 
TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGAATAC GCGGCAACGG 
CATCAAACGC AAGCTGATGC TTCCGTTTAC TCTGTACAAA ACCGTCCGCG 
AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 
GGCGGTTTTG TTACCTTTCC CGGCGGTCTG GCGGCGAAAC TCTTGGGCGT 
GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGCTTG TCCAACCGCC 
ctGGGCGAAA CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 
GCGGTTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 
CCTGCCGAAC GCTTCCAAGG GCGCGAAGGC CGTCTGAAAA 
TTTTGGTGGT CGGCGGCAGT TTGGGTGCGG ACGTTTTGAA CAAAACCGTA 
CCG CAGGCGT TGGCACTGCT GCCTGAAGAG GTGCGCCCGC AGATGTACCA 
CCAGTCGGGG CGTAACAAGC TGGGCAATCT T CAGGCGG AT TATGACGCGT 
TGGGCGTGAA AGCGGAATGC GTGGAATTTA TTACCGACAT GGTGTCCGCC 
TACCGTGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 
CGAGTTGACG GCGGCGGGGC TGGGCGCGTT GTTAGTGCCG TATCCTCACG 
CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTCATGGT GCAGGCAGAA 
GCGGGGCTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 
AATCCTCGGC AGCCTCAACC GCGAAAAATG CCTCAAATGG GCGGAAAACG 
CCCGTACGTT GGCATTGCCG CACAGCGCGG ATGACGTTGC CGAAGCCGCG 



AccTGTCGCg 
AGCCACGAAC 
CCTGCCCGTG 



1001 

1051 ATTGCGTGTG CGGCGTAAA 

This corresponds to the amino acid sequence <SEQ ID 278; ORF 087.ng>: 

g087 -pep 

1 MGGKTFMIJVIA GGTGGHIFPA LAVADSLRVR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGIRGNGIKR KLMLPFTLYK TVREAQRIIR KHRVB CVIGF 

101 GGFVTFPGGL AAKLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 



WO 99/57280 



PCT/US99/09346 



278 



201 PQALALLPEE VRPQMYHQSG RNKLGNLQAD YDALGVKAEC VEFITDMVSA 
251 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 
301 AGLLLPQTQL TAEKLAEILG SLNREKCLKW AENARTLALP HSADDVAEAA 
351 IACAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 279>: 



m087 . seq 








1 


ATG GGCGGT A 


AAAC CTTT AT 


GCTGAwkkCG 


51 


TTTCCCCGCG 


CTGGCGGTGG 


CGGATTCATT 


101 


TGATTTGGCT 


GGGCAGCAAG 


GATTCGATGG 


151 


TACGG CATAC 


GCTTGGAAAC 


GCTGGCGATT 


201 


CATCAAACGC 


AAACTGATGC 


TGCCGGTTAC 


251 


AAGCGCAGCG 


GATTATCCGC 


AAACACCGTG 


301 


GGCGGCTTCG 


TTACCTTCCC 


CGGCGGTTTG 


351 


GCCGATTGTG 


ATTCACGAGC 


AAAACGCCGT 


401 


ACCTGTCGCG 


CTGGGCGAAG 


CGGGTGTTGT 


451 


AGCCACGAAG 


GCGGCTTGGT 


CGGCAACCCC 


501 


CCTGCCCGTG 


CCTGCCGAAC 


GCTTCCAAGG 


551 


TTTTGGTGGT 


CGGCGGCAGT 


TTGGGCGCGG 


601 


CCGCATGCAT 


TGGCTTTGCT 


GCCCGACAAT 


651 


CCAATCGGGA 


CGGGGCAAGC 


TGGGCATCTT 


701 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


751 


nnnGCGGGAT 


TGGGTGCGTT 


GTTAGTGCCG 


801 


TCACCAAACC 


GCCAACGCGC 


GTTTTATGGT 


851 


TGTTGCCGCA 


AACCCAGTTG 


ACGGCGGAAA 


901 


GGCTTAAACC 


GCGAAAAATG 


CCTCAAATGG 


951 


GGCACTGCCG 


CACAGTGCGG 


ACGACGTGGC 


1001 


CGGCGTAA 







This corresponds to the amino acid sequence <SEQ ID 280; ORF 087>: 

m087 .pep 

1 MGGKTFMLXX GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGVRGNGIKR KLMLPVTLYQ TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLXVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PHALALLPDN ARPHMYHQSG RGKLGILQAX XXXXXXXXXX XXXXXXXXXX 

2 51 XAGLGALLVP YPHAVDDHQT ANARFMVQAE AGLLLPQTQL TAEKLAEILG 

301 GLNREKCLKW AENARTLALP HSADDVAEAA IACAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 087 shows 83.9% identity over a 355 aa overlap with a predicted ORF (ORF 087.ng) 
from N. gonorrhoeae: 

m087/g087 

10 20 30 40 50 60 

m087 .pep MGGKTFMIJCXGGTGGHIFPAIAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 

MINIM lllllllllllllllllhlllllllllllllllllllllllllllllll 

g087 MGG KT FMLMAGGTGGH I FP ALAVAD S LR VRGHHVI W LG S KD SME ERI VPQYG I RLETLA I 

10 20 30 40 50 60 

70 80 90 100 110 120 

m0 87 . pep KGVRGNGI KRKLMLPVTLYQTVREAQRI I RKHRVECVI G FGG FVTFPGGLAAKLLXVP IV 

Ihllllllll llll llhlllllll lllllllllllllllllll IIIMIIII 1 1 1 1 

g087 KG I RGNG I KRKLML PFTLY KTV RE AQR 1 1 RKHRVE CVI G FGG FV T F PGGLAAKLLGVP IV 

70 80 90 100 110 120 



130 140 150 160 170 180 

m087 .pep I H EQNAVAGLS NRHLS RWAKRVLY A F PKA FSH EGGLVGN P VRAD I SNL P VP AERFQGREG 

I III till IIIIIIIIMIIMM MINI IIMII II M llll II ; IMIII II Mill 

g087 I HE0NAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADI SNLPVPAERFQGREG 

130 140 150 160 170 180 
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190 200 210 220 229 
m087 . pep RLKILWGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQA 

MHIMI I! I Mill Ml I Mil ! I h - I hi I III IMM III 

g087 RLKILWGGSLGADVl^KTVPQAl^ijPEEVKPQMyHOSGRiaKLGNLQADyDALGVKAEC 
190 200 210 220 230 240 



230 240 250 

mO 8 7 . pep AGLGALLVPYPHAVDDHQTANARFMVQAE 

IIIIIIIMIIIIMIIIIIIIIMim 

g08 7 VE F I TDMVSAYRDADLVI CRAGALTI7AELTAAGLGALLVP Y PHAVDDHOTANARFMVQAE 

250 260 270 280 290 300 



260 270 260 290 300 310 

m087 .pep AGLLLPQTQLTAEK1AEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

III lllll II llll llll Ihl III I I I lllllll I III III IN IIIIMIIII I 
g087 AGLLLPQTQLTAEKLAEILGSLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 
310 320 330 340 350 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ED 28 1>: 

a087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG C5GATTCATT GCGCGCGCGC GGCCATCATG 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 

151 TACGACATCC TGCTCGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

201 CATCAAACGC AAGCTGATGC TGCCGTTTAC TTTGTATCAA ACTGTCCGCG 

251 AAGCGCAGCA GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGCTTCG TTACCTTTCC CGGCGGTTTG GCGGCGAAGT TATTAGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

451 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC AGATGTACCA 

651 CCAATCGGGA CGGGGCAAGC TGGGCAGCTT GCAGGCGGAT TACGACGCGC 

701 TGGGCGTGCA AGCGGAATGC GTGGAATTTA TTACCGATAT GGTGTCCGCC 

751 TACCGCGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

801 CGAGTTGACG GCGGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG 

851 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG 

901 GCGGGATTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 

951 GATTCTCGGC GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG 

1001 CCCGTACGTT GGCACTGCCG CACAGTGCGG ACGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 282; ORF 087.a>: 

a 08 7 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YDILLETLAI KGVRGNGIKR KLMLPFTLYQ TVREAQQIIR KHRVECVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PQALALLPDN ARPQMYHQSG R3KLGSLQAD YDALGVQAEC VEFITDMVSA 

251 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 

301 AGLLLPQTQL TAEKLAEILG GLNREKCLKW AENARTLALP HSADDVAEAA 

351 IACAA* 

m087/aoa7 85.4% identity over a 355 aa overlap 

10 20 30 40 50 60 

m087 . pep MGGKTFMLXXGGTGGHIFPALAVADSLRARGHIIVIWLGSKDS^3EERIVPQYGIRLETIAT 
i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I II I I I II | | | | | | | | | I I I I I | 
a087 MGGKTFMLMAGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYDILLETLAI 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m087 . pep KGVRGNGI KRKLMLPVTLYQTVREAQRI IRKHRVECVIGFGGFVTFPGGLAAKLLXVPIV 

I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I t I I I I 11 I I | I I | | | | | | | | | M Mil 
a087 * K G VRGNG I KRK LML P FT L YQT VRE AQQ 1 1 RKHR VE C V I G FG G FVT FPG G LAAKLLGV P I V 

70 80 90 100 110 120 

130 140 150 160 170 180 

m087.pep IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 
I I I I I I I I I I I I I I I I I I t I I ! I I I I I I I I I i II I I I I I I I I I I I I I I I I I i I I I I I I I I 
a087 IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADI3NLPVPAERFQGREG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m087 . pep RLKILWGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQAXXXXXXXXXXX 

I i II I II I I I I I I I I II I I I I : I I I I I II ! II I : II I I I I I I I II III 
a087 R LK I LWG G S LG AD V LNKT VP QAL A LL P DN AR PQM YHQ SG R GK LG S LQAD Y DA LGVQ AEC 

190 200 210 220 230 240 

250 260 270 280 

m08 7. pep XX XXXXXXXXXAGLGALLVPYPHAVDDHQTANARFMVQAE 

: : I I I I I I I I I I I I [ II I I I I I II I I I I I I I 

a087 VE FI T DM V S A Y R DA DL VI CRAG AL T I AE LT AAG LG ALL V P Y PH AV DDH QT AN AR FMV QAE 

250 260 270 280 290 300 

290 300 310 320 330 

m087 . pep AG LL L PQT QLT AEK LAE I LG G LNR E K C LKW AEN ART L A LP H S AD D V AE AA I AC AAX 

I I II I I I I I M 1 1 I 1 1 I I I I I I I I 1 1 I 1 1 I I II 1 1 I 1 1 1 1 I I I M I I I I I I I I I I I 

a087 AGLLLPQTQLTAEKLAEILGGLNREKCLISWAENARTLALPHSADDVAEAAIACAAX 
310 320 330 340 350 



The following partial DNA sequence was identified in Kgonorrhoeae <SEQ ID 283>: 

g088 . seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTTAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGC CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

251 TGTCCACCCT GTTGTGGGGC AACTGGGCGA ACCCGTATAT CTGGATTCTC 

301 TTGGGCGTAC TGCTTGCCAC CGGTGCGCTC GGTTTTTACG ACGACTGGCG 

351 CAAAGTCGTT TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAArGG 

401 TGTGGCAGTC AAGCGTTGCC GTTatcgcCG GTttggcaTT GTTTTACctt 

451 gCcgcCAATT CCGCCAACAA TATTTTGATT GTCCCGtttT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTttt gGtgttgTCT TACCTGACCA 

551 TCGTCGGCAC ATCCAACGCC GTCAACCTCA CcgaCGGCTT GGACGGCCTT 

601 GCCGCcttcc cgttcgtcct cgttgccgcC GGGCTCGCCA ttttcgccTA 

651 CGTCAGCGGA CACTACCAAT TTTCCCAATA CCTCCAGCTT CCCTATGTCG 

701 CCGGCGCGAA CGAAGTCGCT ATATT CTGCA CCGCCATGTG CGGCGCGTGC 

751 CTCGGATTTT TGTGGTTCAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 

801 TGTCGGCGCG CTGGCATTGG GTGCCGCGCT CGGTaccGtt gCCGTcaTcg 

851 tCCGCCAAGA ATTTGTcctc gtcattaTGG GCGGTCTGTT cgtcgtagaa 

901 gccgtgTCCG TTATGCTTCa tgtcggCTGG TACAAGAAAA Ccaaaaaacg 

951 CATCTTcCTg acgGcaccga ttcatcacca ttaCCaactt cgatgCTGGa 

1001 aagaaacgca agtcgtcgtc CGTTtCTGGA TTAtTAccac cgtcgtggtc 

1051 tTgataggtt tGagtacccT caAAattcgc ggaaactatg ccgTCCGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 284; ORF 088.ng>: 

g088.pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G FWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT PTMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSANNILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 
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2 01 AAFP FVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGAN EVA I FCTAMCGAC 
251 LGFLWFNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 
301 AVSVM LHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIVW 

3 51 LIGL STLKIR GNYAVRTPFR RHLNAQ* 

The following partial DNA sequence was identified in N .meningitidis <SEQ ED 285>: 

ra038 . seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAnC TGGTTAACCG GTCTGAATmi 

51 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

101 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

151 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

251 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

3 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

3 51 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
451 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
501 nnnnnnnnnn nnnGGCGTGG TCGGCTTTTT GGTGTTGTCT TACCTGACCA 
551 TCGTCGGCAC ATCCAATGCC GTCAACCTCA CCGACGGCTT GGACGGCCTT 
601 GCGACCTTCC CCGTCGTCCT CGTTGCCGCC GGCCTCGCCA TCTTCGCCTA 
651 TGCCAGCGGC CACTCACAAT TTGCCCAATA CCTGCAATTA CCTTACGTTG 
701 CCGGCGCAAA CGAAGTGGTG ATTTTCTGTA CCGCCATGTG CGGCGCGTGC 
751 CTCGGTTTCT TGTGGTTTAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 
801 TGTCGGTGCA TTGGCATTGG GTGCCGCGCT CGGTACCGTC GCCGTTATCG 
851 TCCGCCAAGA GTTTGTCCTC GTCATTATGG GCGGATTATT TGTCGTAGAA 
901 GCCGTATCCG TTATGCTTCA GGTTGGCTGG TATAAGAAAA CCAAAAAACG 
951 CATCTTCCTG ATGGCGCCCA TCCATCACCA CTACGAACAA AAAGGCTGGA 

1001 AAGAAACCCA AGTCGTCGTC CGCTTTTGGA TTATTACCAT CGTCTTGGTG 

1051 TTGATCGGTT TGAGTACCCT CAAAATCCGC TGAACCTATG CCGTCTGAAC 

1101 ATCTTTCAGA CGGCATTTGA ACGCGCAATA A 



1 MFLWLAHFSN WLTGLNIFOY T TFRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSANNILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIVR QE FVL VIMGGLFWE 

301 AVSVM LHVGW YKKTKKRIFL TAPIHHHYQL RCWKET QVW RFWIITIVW 

3 51 LIGLSTLKIR GNYAVRTPFR RHLNAQ* 



This corresponds to the amino acid sequence <SEQ ID 286; ORF 088>: 

m088 .pep 

1 MFLWLAHFSX WLTGLNXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

101 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XGWGFLVLS YLTIVG TSNA VNLTDGLDGL 

201 ATFPWLVAA GLAIFA YASG HSQFAQYLQL PYVAGANE W I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIVR Q EFVL VIMGGLFWE 

301 AVSVM LQVGW YKKTKKRIFL MAPIHHHYEQ KGWKET QVW RFWIITIVLV 

351 LIGL STLKIR XTYAVXTSFR RHLNAQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 088 shows 91.7% identity over a 205 aa overlap with a predicted ORF (ORF 088.ng) 
from N. gonorrhoeae: 

m088/g088 



10 20 30 

mO 8 8 . pep GWGFLVLSYLTIVGTSNAVNLTDGLDGLA 

II II IIMil III 1 1 II II! 1 1 1 1 II I II! 

g088 IAGLALFYLAANSANNILIVPFFKQIALPLGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 
150 160 170 180 190 200 
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40 50 60 70 80 90 

m088 . pep TFPWLVAAGLAI FAYASGHSQFAGYLQLPYVAGANEWI FCTAMCGACLGFLWFNAYPA 

:|| II I llllll III 0 I I I I ^ I I 1 I I i I I I I I I I I = I I t I 1 I I I I I I I I J I E I J I J t 
g088 AFPFVLVAAGLAIFAYVSGHYQFSCYLQLPYVAGANEVAI FCTAMCGACLGFLWFNAYPA 

210 220 230 240 250 260 

100 110 120 130 140 150 

m088 . pep QVFMGDVGALALGAALGTVAV I VRQEFVLVI MGGLFWEAVSVMLQVGW YKKTKKR I FLM 

I I I II I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I 1 I I I I : I I I I M I I I I I I I 
g088 QVFMGDVGALALGAALGTVAV I VRQ EFVLV I MGGLFWEAVS VMLHVGW YKKTKKR I FI/T 

270 280 290 300 310 320 

160 170 180 190 200 

m088 . pep APIHHHYEQKGWKETQVWRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 

llllllh : IIIIMIIIIIIIIIhlllllilllll :||| I MINIMI 
g088 APIHHHYQLRCWKETQVVVRFWIITIVWLIGLSTLKIRGNYAVRTFFRRHLNAQX 
330 340 350 360 370 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 287>: 

a0B8 . seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTCAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGT CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

251 TGTCCACCCT GTTGTGGGGC AACTGGGCAA ACCCGTATAT CTGGATTCTC 

301 TTGGGCGTAT TGCTCGCCAC GGGCGCACTC GGTTTTTACG ACGACTGGCG 

351 CAAAGTCGTC TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 

4 01 TGTGGCAGTC AAGCGTTGCC ATTATCGCCG GTTTGGCATT GTTTTACCTT 

4 51 GCCGCCAATT CCGCCAACAA TATTTTGATT GTCCCGTTCT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTTTT GGTGTTGTCT TACCTGACCA 

551 TCGTCGGCAC ATCCAATGCC GTCAACCTCA CCGACGGCTT GGACGGCCTT 

601 GCGACCTTCC CCGTCGTCCT CGTTGCCGCC GGCCTCGCCA TCTTCGCCTA 

651 TGCCAGCGGC CACTCACAAT TTGCCCAATA CCTGCAATTA CCTTACGTTG 

701 CCGGCGCAAA CGAAGTGGTG ATTTTCTGTA CCGCCATGTG CGGCGCGTGC 

751 CTCGGTTTCT TGTGGTTTAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 

801 TGTCGGTGCA TTGGCATTGG GTGCCGCGCT CGGTACCGTC GCCGTCATCG 

851 TCCGCCAAGA GTTTGTCCTC GTCATTATGG GCGGATTATT TGTCGTAGAA 

901 GCCGTATCCG TTATGCTTCA GGTCGGCTGG TATAAGAAAA CCAAAAAACG 

951 CATCTTCCTG ATGGCGCCCA TCCATCACCA CTACGAACAA AAAGGCTGGA 

1001 AAGAAAC CCA AGTCGTCGTC CGCTTTTGGA TT ATT AC CAT CGTCTTGGTG 

1051 TTGATCGGTT TGAGTACCCT CAAAATCCGC TGAACCTATG CCGTCTGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 288; ORF 088.a>: 
aOBS.pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLLW G NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YXDPNGVSAK FKMVWQSS VA IIAGLALFYL 

151 AANSANNILI VPFFKQIA LP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPWLVAA G LAI FA Y AS G H5QFAQYLQL PYVAGANE W IFCTAMCGAC 

251 LGFLW FNAYP AQVFMG UVGA LALGAALGTV AVIV RQE FVL VIMGGLFVVE 

301 AVSV MLQVGW YKKTKKR I FL MAPIHHHYEQ KGWKETQ VW RFWIITIVLV 

351 LTGLSTLKIR *TYAV*TPFR RHLNAQ* 

mOB8/a088 99.5% identity over a 205 aa overlap 

150 160 170 180 190 200 

m088 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGVVGFLVLSYLTIVGTSNAVNLTDGLDGLA 

t i I I I I I I I I I II I I I I I I I It I I I I I I I I 
a088 IAGLALFYLAANSANNILIVPFFKQIALPLGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 
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150 160 170 180 190 200 

210 220 230 240 250 260 

m088 . pep T FPWLVAAGLAI FAYASGHSQFAQYLQLP YVAGANEWI FCTAMCGACLGFLWFNAYPA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I II 1 I I I I I I I I I I I I I II I 
a 08 8 T FPWLVAAGLAI FAYASGHSQFAQYLQLPYVAGANEWI FCT AMCGACLG FLW FNAY PA 

210 220 .230 240 250 260 

210 2B0 290 300 310 320 

m088 . pep QVFMGDVGALALGAALGTVAVIVRCEFVLVIMGGLFVVEAVSVMLQVGWYKKTKKRIFLM 
I I I M I I I I I I I I I I 1 I I I I I I I M I I I I I I I I I I I I I I I I II II I I I I I I I I I i I I I I I 
a088 QVFMGDVGAl^LGAALGTVAVIVRCEFVLVIMGGLFVVEAVSVMLQVGWYKKTKKRIFLM 

270 280 290 300 310 320 

330 340 • 350 360 370 

m088 . pep APIHHHYEQKGWKETQWVRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 
I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I ! I I I I 
a088 APIHHHYEQKGWKETQVWRFWIITIVLVLIGLSTLKIRXTYAVXTPFRRHLNAQX 

330 340 350 360 370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 289>: 

g089 .seq 

1 ATGCCGCCCA AAATCACGAA GAGCGGGTTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATG AATACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACACGGT GCGGCAAGCC TTGGAAGGTT 

151 TTGACGTGTT CCAGCAATGC TTCGCGCGGC AAACCGACGG CCTCGCACAA 

201 AGCCACGGCA GCCATAACGT TGGCGGCGTT GTGCAAACCT TGCAGCGGGA 

251 TGTCTTGCGT AGAAATCAAA TCTTCATTGC CTTGTTTTAA ACAGCCCGTC 

301 CCGCGTTCCA ACCAAAAATC GGCTTCGTGT TCCAAGGAAA ACOGTTTCAC 

351 TTCACGCCCT GCCCGTTTCA TGGCGCGGCA GAACACGTCG TCCGCATTCA 

401 AAACCTGCAC TCCATCGCCA CGGAAAATCT CGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 290; ORF 089.ng>: 

g089 .pep 

1 KPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGKPWKV 

51 LTCSSNASRG KPTASHKATA AITLAALCKP CSGMSCVEIK SSLPCFKQPV 

101 PRSNQKSASC SKENRFTSRP ARFMARQNTS SAFKTCTPSP RKISALVCA* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 29 1>: 

m089 .seq 

1 ATGCCGCCCA AAATCACkAw GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA AACACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGGAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG CCTCACACAA 

201 AGCCACkGCA GCCATGACGT TAGCGGCGTT GTGCAkACCT TGCAACGGwA 

251 TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG GCGGCCTGTC 

301 TCGCGTTCCA ACCAGAAATC AGCTTCGTGT T CCAACGAAA ACCATTTTAC 

351 CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG TCCGCATTCA 

401 AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCATAG 

This corresponds to the amino acid sequence <SEQ ID 292; ORF 089>: 

m089 .pep 

1 MPPKITXSGF CKP AIAAAVA PTFVPLL5SI NTTPFFSPIF STRCGKPWKV 

51 LTCSSNASRD KPMASHKATA AMTLAALCXP CNGMSCVTIK SSLPCFRRPV 

101 SRSNQKSASC SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 089 shows 88.6% identity over a 149 aa overlap with a predicted ORF (ORF 089.ng) 
from M. gonorrhoeae: 

m089/g089 
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10 20 30 40 50 60 

m089 pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

MINI I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 I 1 1 1 1 1 1 1 1 1 1 1 h I 1 1 1 i 1 1 1 I 1 1 II 

gO 8 9 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGKPWKVLTCSSNASRG 
10 20 30 40 50 60 

70 80 90 100 110 120 

m08 9 pep KPMAJSHKATAA>H'LAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 

II II I II |||: llll II I hill II II III I I l-l I MINI III hlhlllll 
gO 8 9 KPTASHKATAAITLAALCKPCSGMSCVEI KSSLPCFKQPVPRSNQKSASCSKENRFTSRP 

70 80 90 100 110 120 



130 140 150 

m089 .pep ARFIARQNASSAFKTCTPSPRKILALVCAX 

I I hi llh I I I II II II I I II I I III II 
g089 ARFMARQNTSSAFKTCTPSPRKISALVCAX 
130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 293>: 

a089. seq 

1 ATGCCGCCTA AAATCACGAA GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCA CCGACGTTCG TGCCTTTGCT GTCGTCGATG AACACCACGC 

101 CATTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGAAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGGC AAACCGACGG CTTCGCACAA 

201 GGCAACGGCA GCCATCACGT TAGTGGCGTT GTGCAAGCCT T3CAGCGGAA 

251 TATCTTGCGT GGCAATCAAA TCTTCATTGC CTTGTTTCAG GCGACCTGTC 

301 TCACGTTCCA AC C AAAAAT C GGCTTCGTAT TCCAACGAAA ACCATTTCAC 

351 CTCGCGCCCG GCGCGCTTCA TCGCACGACA GAACGCATCG TCCGCATTCA 

4 CI AAACCTGCAC ACCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 294; ORF 089.a>: 

a089 .pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGRP*KV 

51 LTCSSNASRG KPTASHKATA AITLVALCKP CSGISCVAIK SSLPCFRRPV 

101 SRSNQKSASY SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 



m0B9/a089 91 .9% identity over a 149 aa overlap 

10 20 30 40 50 60 

ro08 9.pep MPPKITXSGFCKPAIAAAVAPTFVFLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I : I I I I I i I I I I i ! I I I I I I I I I I I I I I I I 
a 08 9 MPPKITKSGFCKPAIAAAVAPTFVFLLSSMNTT PFFSPIFSTRCGRPXKVLTCSSNASRG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m089 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 
II I I I I t I I S : I I : I I I I I : I : 1 I I : I I I I I I I I I I I I I I I I I I I I I I U ! I I I I I I 
a 089 KPTA5HKATAAITLVALCKPCSGISCVAIKSSLPCFRRPVSRSNQKSASYSNENHFTSRP 

70 80 90 100 110 120 



130 140 150 

m089.pep ARFIARQNASSAFKTCTPSPRKILALVCAX 
I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I 
a089 ARFIARQNASSAFKTCTPSPRKILALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 295>: 

g090 . seq 

1 ATGCGCGTAG TCGAGCAAAT CGTCGTAGCG GTCGAGATGG 7CTTCGGAAA 
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51 TGTTCATCAC CGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCcca CACCCACACG TCCGCCTTTT TGCCTTCgcg 

151 ctgCAATtct gcctccaaga cgggcgtacc gatATTGCCC GCAATGAcgg 

201 tatccagccc gcacttgatg CAGAGatagc ggaccaggct ggttaccgTG 

251 GTTttgccgt tgctgCcggt aatcgCaacc accttgtcgC CGCGGCGGtt 

301 cAcaaTGTCc gccaGCAATt ggATGTCGCC TAgCACGCGC . ccgccgTTT 

351 TGCttga 

This corresponds to the amino acid sequence <SEQ ED 296; ORF 090.ng>: 

g090.pep 

1 MRWEQIWA VEMVFGNVHH RRRSRAQAFG VFQLEAGKLP HPHVRLFAFA 

51 LQFCLQDGRT DIARNDGIQP ALDAEIADQA GYRG FAYAAG NRNHLVAAAV 

101 HNVRQQLDVA XHAXRRFA* 

The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 297>: 

m0 90.seq 

1 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT .CCGCCGTTT 

. 351 TGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 298; ORF 090>: 

m090 -pep 

1 MRIVEQWVA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

51 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLWPAV 

101 HNVRQQFDVA QHAXRRFA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 090 shows 83.9% identity over a 1 1 8 aa overlap with a predicted ORF (ORF 090.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

MRIVEQWVAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 

I Hlhlili 11)111 hMM hll Mill ill ill IIMIIIIM! I Ih h 

MRWEQIWAVEMVFGNVHHRRRSRAQAFGVFQLEAGKLPHPHVRLFAFALQFCLQDGRT 
10 20 30 40 50 60 

70 80 90 100 110 119 

DI ARDNGI QPALDTE I ADQ AR YRGF AVAAGNRN YL W PAVHNVRQ QFD VAQHAXRRF AX 

lll|::||||llhllllll lllllllllllhlh lllllllhlll IMIMM 

DI ARNDGI QPALDAE I ADQAGY RGFAVAAGNRNHLVAAAVHNVRQQLDVAXHAXRRFAX 
70 80 90 100 110 



The following partial DNA sequence was identified mN. meningitidis <SEQ ID 299>: 

a090 .seq 

1 ATGCGCGTAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC TGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAACTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCAATTCC GCCTCCAAAA CCGGCGCGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC ACACTTGATG CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATC ACCTTGTCGC CGCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTT 

351 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 300; ORF 090.a>: 



m090/g090 

m090 .pep 
g090 

m0 90 .pep 
g090 
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MRWEQWVA VEMVFGNVQH CRRSRAQAFG VFQLETGKLQ HPHVRLFAFA 
LQFRLQWRRA DIARDKGIQP TLDAEIADQA RYRGFAVAAG NRNHLVAAAV 
HNVRQQFDVA QHAXRRFA* 

91.5% identity over a 1 17 aa overlap 

10 20 30 40 50 60 

MRIVEQVWAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 
(I : | I | I I I I I I I I I I I II I I I I I: I I I I I I I I I : I I I I I I I I I 1 I I I II 1 I I I I I I I 
MR VVEQV WAVEMV FGNVQHCRR SRAQA FGV FQLETGK LQH PHVRL FA FALQ FRLQNRRA 

10 20 30 40 50 60 

70 80 90 100 110 119 

D I ARDNG IQPALDTE I ADQARYRGFAVAAGNRNYLVVPAVHNVRQQFDVAQKAXRRFAX 
i I I I I I I 1 II : I I : I I I I I I I I I I I I i I I I I I : I I : I I I I I I I I I II I I I II I I I I I 
DIARDNGIQPTLDAEIADQARYRGFAVAAGNRNHLVAAAVHNVRQQFDVAQKAXRRFAX 

70 80 90 100 110 



The following partial DNA sequence was identified in TV. gonorrhoeae 

g090-l.seq This sequence contains multiple stop codons (not shown) 

This corresponds to the amino acid sequence < ORF 090-1. ng>: 

g090-l.pep (not shown) 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 2>: 

m090-l.seq 

1 ATGACGGCGT TTGCATTTCA GACGGCATCA CAAAGCCTTA AACGCTTCGA 

51 TAAACACTTC CGAACGGTGC GCGTAGCCTT TGAACATATC AAAGCTCGCG 

101 CAGGCGGGGC TGAGCAACAC AATATCGCCT GCTTCGGCTT GGGCATATGC 

151 CGTCTGAACG GCTTCTCCCA AAGTGGCGCA GTCGGTCATA TTCAAGCCGC 

201 AGCCGTCCAA ATCGCGGCGG ATTTGCGGCG CATCGACACC AATCAAGAAC 

251 ACGCCTTTTG CCTTGCCTAC CAGTGCATCG CGCAGGGGCG TGAAGTCCTG 

301 CCCTTTACCC ATGCCGCCCA AAATCACGAA GAGCGGATTT TGCAAACCGG 

351 CAATCGCGGC GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA 

401 AACACCACGC CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC 

451 TTGGAAGGTT TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG 

501 CCTCACACAA AGCCACGGCA GCCATGACGT TAGCGGCGTT GTGCAGACCT 

551 TGCAACGGAA TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG 

601 GCGGCCTGTC TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAACGAAA 

651 ACCATTTTAC CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG 

701 TCCGCATTCA AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT 

751 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

801 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

851 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

901 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

951 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

1001 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

1051 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTTT 

1101 GCTTGAACGC CTCAATATCC CGCTGCCCCT CGCTGATGCC CGGACTGAGA 

1151 GCCAGAATAT CGAAACCGTT GTCCAGCGCA TCTTTCAGAC GGCCCGTGTA 

1201 AAACACCAAC CCGTCAAACA TCTTACCGAT TTGCGACACG CGTTCCGGCT 

1251 TCAGCTCCGC ATCATACGCA GCAACCTCCG CGCCGTTTTT GCGCAGGTAG 

1301 GCAATCATGG AAAT ACCCGT ACCGCCGAGT CCGGCGACGA GGATTTTTTT 

1351 GTTTTGAAAA GTCATTTTGG TTTGTCCTAA 

This corresponds to the amino acid sequence <SEQ ID 3; ORF 090- 1>: 

m090-l.pep 

1 MTAFAFQTAS QSLKRFDKHF RTVRVAFEHI KARAGGAEQH NIACFGLGIC 

51 RLNGFSQSGA VGH I QAAAVQ IAADLRRIDT NQEHAFCLAY QCIAQGREVL 

101 PFTjiAAQNHE ERILQTGNRG GS RAD IRA FA VVDKHHAVFL ADFFHAVRQA 

151 LEG FDVFE QC FARQTDGLTQ SHGSHDVSGV VQTLQRNVLR PN QIFIALFQ 

201 AACLAFQPEI SFVFQRKPFY LAPGTLHRAA ERIVRIQNLH AVATENLGFG 

251 MRIVEQWVA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLO HPHVRLFAFA 

301 LP FRLQNRRA OIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLWPAV 

351 HNV3QQFDVA QHASAVLLER LNIRLPLADA GTESQNIETV VQRI FQTARV 

401 KHQPVKHLTD LRHAFRLQLR I IRSNLRAVF AQVGNHGNTR TAESGDEDFF 

4 51 VLKSHFGLS * 



a090.pep 

1 
51 
101 

m09/a090 

m090.pep 
a090 

m090.pep 
a090 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 303>: 

g091. seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCC3GCG ACGAGGATTT TTTTGTTTGA 

51 AAGTCATTTT GGTTTTGTCC TAAAACAAAT CATATTGGGC AGGAGACGTC 

101 CGCCCTTGCC CAAGCCGCTT TCAGACGGCA TCG CGAGCCG ATTAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCT GTTTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTGT TGAGCGAGCG CAAGGTCTTG GCGCACGCCG 

251 CGTCCATCGT AATACATCAA GCCCAAATTG TATTGGGCTT GGGCATCCCC 

301 TTGTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 304; ORF 091.ng>: 

g091 .pep 

1 MEIPVPPSPA TRIFLFESHF GFVLKQIILG RRRPPLPKPL SDGIASRLIT 

51 RLQ ALVIVAA VLVSVLTSLA KPLLSERKVL AHAAS IVIHQ AQIVLGLGIP 

101 LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 305>: 
m091. seq 

1 ATGGAAATAC CCGTACCGCC GAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAAGTCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGAGATGTC 

101 CGCCCCTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCC GTCTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAACCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGCTT GGGCTACCCC 

301 CTGCGC... 

This corresponds to the amino acid sequence <SEQ ID 306; ORF 09 1>: 

m091 -pep 

1 MEIPVPPSPA TRIFLFEKSF WFVLKQIILS RRCPPLPKPL SDGIASCSIT 
51 RL QALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASFG IHH AQIVLGLGYP 
101 LR. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 091 shows 84.2% identity over a 101 aa overlap with a predicted ORF (ORF 091 .ng) 

from K gonorrhoeae: 

m091/g091 

10 20 30 40 50 60 

m091.pep MEIPVPPSPATRIFLFEKSFVTFVLKQIILSRRCPPLPKPLSDGIASC3ITRLQALVIVAA 

iimmimiiiii: i iiiiiiihii iiiiiiiiiini mimnm 

g091 MEIPVPPSPATRIFLFESHFGFVLK0IILGRRRPPLPKPLSDGIASRLITRLOALVIVAA 
10 20 30 40 50 60 

70 BO 90 100 

mO 9 1 . pep VLVSVLTSLAKPFLCKGAVLAHAAS FGI HHAQ I VLGLG YPLR 

I II ill I II II hi -• II Mill: II :||| I II II II 
g 0 9 1 VLVSVLTSLAKPLLS ERKVLAHAAS I VI HQAQI VLGLG I PLFX 

70 80 90 100 



The following partial DNA sequence was identified in N. meningiditis <SEQ ID 307>: 

a091 . seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTTG 

51 GAAATCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGGGATGTC 

101 TGATCCTGCT CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCTTTCAGG CGTTGGTCAT TGTCGCAGCT GTCTTGGTAT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGC 
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This corresponds to the amino acid sequence <SEQ ID 308; ORF 091 .a>: 

a091 .pep 

1 MEIPVPPSPA TRIFLFWKSF WFVLKQIILS RGCLILLKPL SDGIASCSIT 
51 RFO ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAAS FGIHH AQIVLG 

m09i/aO9i 93.8% identity over a 96 aa overlap 

10 20 30 40 50 60 

m091.pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I i II I M I I I 
a091 MEIPVPPSPATRIFLFWKSFWFVLKQIILSRGCLILLKPLSDGIASCSITRFQALVIVAA 
10 20 30 40 50 60 



70 80 90 100 

m091 . pep VLVSVLTSLAKPFLCKGAVLAHAAS FGIHHAQIVLGLGYPLR 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 
a091 VLVSVLTSLAKPFLCKGAVLAHAAS FGIHHAQIVLG 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 309>: 

g092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGTGCGC 

51 AAACGGTCAG ACCTTTAAAA 7AACGCCTTT ACGCACTAAA AACCAACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTAA GCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGAGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 

301 AACGGTgcgg ATGTCGTCGT TGCCTCTACC GCCGTCAAGA AAGAAaatcC 

351 CGAAGTtgtc gcTGCGTTGG AGCGGCAAAT TCCCGTTATT CCGCGCGCCT 

401 TGATGCTGGC AGAGCTGATG CGCTTCCGTG ACGgcatcgc cattgccggT 

451 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

501 GGCAGGACTC GACCCCACTT TCGTTATCGG CGGCAAACTC AACGCCG CAG 

551 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

601 GAATCCGATG CCTCTTTCCT ACATCTGACC CCGATTATGT CCGTCGTTAC 

651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 

701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 

751 GCCTTTTTGT GTGTTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 

801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 

851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 

901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 

951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGc gtggcGCTgg 

1001 aagtcGgCGC ATcggttgAA GCGAtcCAAA AaggCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATAcggc gacatCAagt tgccaaacgg 

1101 cggGaccgCT TTgctGGTGG ACGATTAcgg ACACCACCCC GTCGAAATGG 

1151 CGGcaaccct tgccgcTGCA CGCGGCGCGT ATCCGGAAAA acgtTTGGTG 

1201 CtcgCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTACTCAATA CCGTTGatgC GCTGGTACTG ACCGAAGTTT 

1301 AtgccgccgG CGAAGAGCCG GTTGCCGCCG CCGactCCCG CGCCTTGGCG 

1351 CGTGCTATCC GCGTATTGGG CAAACTTGAG CCGATTTACT GCGAAAatgt 

1401 cgccgACCTG CCGCAAATGC TGATGAATGT TTTACAGGAT Ggcgatgttg 

1451 tgttgAATAT GggTgcggga agcatcaacc gcgttccttc cgcgctgttg 

1501 gaattgtcga AACAGAtttg A 

This corresponds to the amino acid sequence <SEQ ID 310; ORF 092.ng>: 

g092 .pep 

1 MFFISIRYIF VRKLWCANGQ TFKITPLRTK NQPERNIMMK NRVSNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLSSLGIQ VYPGHTAEHV 

101 NGADVWAST AVKKENPEW AALEROIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 
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251 AFLCVDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLNALAAIG VALEVGASVE AI QKGLLGFE 

3 51 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYPEKRLV 

401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP VAAADSRALA 

451 RAIRVLGKLE PI YCENVADL PQMLMNVLQD GDWLNMGAG SINRVPSALL 

501 ELSKQI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ TD 31 1 >: 

m092 .seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTTA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGCATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAgGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC CGAACACGTT 

301 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 

351 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

401 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 

451 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

501 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

551 GCACTAACGC CCGCTTGGGC AAAGG CGAAT ACATCGTTGC CGAAGCCGAC 

601 GAGTCGGATG CATCCTTTCT GCACCTGACA CCGATTATGT CCGTCGTTAC 

651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 

701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 

751 GCCTTTTTGT GTATTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 

801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 

851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATT CAC CGTCCATGTT 

901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 

951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC ATCGGTTGAA GCGATCCAAA AAGGCTTGCT CGG CTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATACGGC GACATCAAGT TGCCAAACGG 

1101 CGGGACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGACCCT TGCCGCCGCA CGCGGCGCGT ATCTGGAAAA ACGTTTGGTA 

1201 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

1301 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CCGATTCCCG C3CTCTTGCC 

1351 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 

1401 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 

1451 TGTTGAATAT GGGCGCGGGA A3CATCAACC GCGTCCCCGC CGCGCTGCTG 

1501 GCATTGTCGA AACAGATTTG A 

This corresponds to the amino acid sequence <SEQ ID 3 12; ORF 092>: 

m092.pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLN ALAAIG VALEVGASVE AIQ KGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYLE KRLV 

401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 

451 RAIRVLGKLE PI YCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 

501 ALSKQI+ 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 092 shows 96.6% identity over a 506 aa overlap with a predicted ORF (ORF 092.ng) 
from K gonorrhoeae: 

ra092/g092 

10 20 30 40 50 60 

m092.pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 
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g092 



iiiiiiiiimiii 1 1 1 1 Mini! i miiiiiiinniiiiiiiiiiiiii 

MFF I S I RYI FVRKLWCANGQTFKI TPLRTKNQPERNIMMKNRVSNI HFVG IGGVGMSGI A 
10 2C 30 40 50 60 



70 80 90 100 110 120 

mO 92 . pep EVLHNLGFKVSGSDQAKNAATEHLGSLGIQVYPGHTAEHVNGADVVVTSTAVKKENPEW 

I I II II II III III II II IIIMhlll II INI II I III I II! Ill : II I II II II III 
gO 92 EVLHNLG FKVSGSDQARNAATEHLS SLG I QV Y PGHTAEHVNGADVWAS TAVKKENPEW 

70 80 90 100 110 120 

130 140 150 160 170 180 

m092 .pep AALEQQ I PV I PRALMLAELMRFRDG I AI AGTHG KTTTT S LTAS I LGAAGLDPT FV I GGKL 

I I ih iii iii ii i ii ii iii mi ii iii , m in i iii i nun iiiiiiiiiiiii 

gO 92 AALERQI PV I PRALMLAELMRFRDGIAIAGTHGKTTTTSLTAS I LGAAGLDPTFVIGGKL 

130 140 150 160 170 180 



190 200 210 220 230 240 

m092 .pep NAAGTN ARLGKGE Y I VAE AD ES DAS FLHLT P I MSWTNI D EDHMDT YGH S VE KLHQ AF I D 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I II I I I I I I I II I I II I II I I I II 
g092 NAAGTN ARLGKGE Y I V AE ADES DAS FLHLT P I MSWTNI DEDHMDT Y GH S VE KLHQ AF I D 

190 200 210 220 230 240 



250 260 270 280 290 300 

mO 92 .pep FIHRMPFYGKAFLC I DSEHVRAI LPKVS KP YATYGLDDTADI YATD I ENVGAQMKFTVHV 

i iinniinnhi ii iiiiinniii iniiininiii iii iiiiiiiiiiiii 

g092 FI HRMPF YGKAFLCVDSEHVRAI LPKVS KP YATYGLDDTAD I YATD I ENVGAQMKFTVHV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m092 .pep QMKGHEQGSFEWLNMPGRHl^LNAI^IGVALEVGASVEAIOKGLLGFEGVGRRFQKYG 

1 1 inin linn ii M ill ill ill ii 1 1 ii i ii 1 1 ii i in in iii niiinni 

g092 QMKGHEQGSFEWLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
310 320 330 340 350 360 

370 380 390 400 410 420 

mO 92 . pep DI KLPNGGT ALLVDD YGHH PVEMAATLAAARGAYLE KRLVLAFQPHRYTRTRDLFED FTK 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
gO 9 2 DI KLPNGGTALLVDD YGHHPVEMAAT LAAARGAY P E KRLVLAFQPHRYTRTRDLFED FTK 

370 380 390 400 410 420 

430 440 450 460 470 480 

mO 92 . pep VLNTVDALVLTEVYAAGEE P I AAADS RALARAI RVLGKLEP I YCENVADLPEMLLNVLQD 

I I I I I I I I I I II I I I I I II 1 = I I I I I I I I I I I I I I I I I I I I I I I I I I I I I h I h I I I I I 
g092 VLNTVDALVLTEVYAAGEE PVAAADS RALARAI RVLGKLE P I YCENVADLPQMLMNVLQD 

430 440 450 460 470 480 



490 500 
IU092 .pep GD I VLNMGAGS I NRVPAALLALS KQ I X 

ihimiimiiihiii llllll 

g092 GDWLNMGAGS INRVPSALLELSKQIX 

490 500 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 313>: 

a092. seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTGA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 TTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 
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301 


AACGGTGCGG 


ATGTCGTCGT 


351 


CGAAGTTGTC 


GCTGCGTTGG 


401 


TGATGTTGGC 


GGAGTTGATG 


451 


ACGCACGGCA 


AAACCACGAC 


501 


GGCAGGACTT 


GACCCGACTT 


551 


GCACCAACGC 


CCGCTTGGGC 


601 


GAGTCGGATG 


CATCCTTTCT 


651 


CAATATCGAC 


GAAGACCATA 


701 


TGCATCAGGC 


GTTTATCGAT 


751 


GCCTTTTTGT 


GTATTGACAG 


801 


GAGCAAACCT 


TATGCTACTT 


851 


CCACCGACAT 


CGAAAACGTC 


901 


CAAATGAAAG 


GACATGAGCA 


951 


CGGCAGACAC 


AACGTGCTGA 


1001 


AAGTCGGCGC 


ATCGGTTGAA 


1051 


GGTGTCGGCC 


GCCGCTTCCA 


1101 


TGGAACCGCG 


CTCTTGGTGG 


1151 


CGGCGACCCT 


TTCCGCCGCA 


1201 


CTCGCCTTCC 


AGCCGCACCG 


1251 


CTTTACCAAA 


GTCCTCAATA 


1301 


ATGCCGCCGG 


TGAAGAGCCG 


1351 


CGCGCCATCC 


GCGTGTTGGG 


1401 


TGCCGATCTG 


CCCGAAATGC 


1451 


TGTTGAATAT 


GGGTGCGGGA 


1501 


GAATTGTCGA 


AACAGATTTG 



TACCTCTACC GCCGTCAAAA AAGAAAATCC 
AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 
CGCTTCCGTG ACGGCATCGC CATTGCCGGC 
CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
TCGTTATCGG CGGCAAACTC AACGCCGCAG 
AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 
GCACCTGACA CCGATTATGT CCGTCGTTAC 
TGGATACCTA CGGGCACAGT GTTGAGAAGC 
TTCATCCACC GTATGCCCTT CTACGGCAAA 
CGAACACGTC CGCGCGATTT TGCCCAAAGT 
ACGGTTTGGA CGATACCGCC GACATCTACG 
GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
ACGCATTGGC AGCCATCGGC GTGGCGCTGG 
G CG ATCCAAA AAGGCTTGCT CGGCTTTGAA 
AAAATACGGC GACATCAAGT TGCCAAACGG 
ACGACTACGG ACACCACCCC GTCGAAATGG 
CGCGGCGCGT ATCCGGAAAA ACGTTTGGTA 
CTATACCCGC ACGCGCGATT TGTTTGAAGA 
CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 
ATTGC"GCCG CTGATTCCCG CGCTCTTGCC 
CAAACTCGAG CCGATTTACT GCGAAAACGT 
TGTTGAACGT TTTGCAGGAC GGCGACATCG 
AGCATCAACC GCGTCCCCGC CGCGCTGCTG 
A 



This corresponds to the amino acid sequence <SEQ ID 314; ORF 092.a>: 

a092 .pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKENPEW AAXEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLNA LAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLSAA RGAYPZKRLV 

401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 

451 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 

501 ELSKQI* 



m092/a092 99.4% identity over a 506 aa overlap 

10 20 30 40 50 60 

m092.pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 

• I I I I I I I I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I | | | | | I | | | || 

a092 MFFI S IRYI FVRKLWRANGQPFKT T PLRI EN PPERN IMMKNRVTN IH FVG IGGVGM SG I A 

10 20 30 40 50 60 

70 80 90 100 110 120 

m092.pep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADVWTSTAVKKENPEW 
I > I I I I I t I 1 1 I I I f I I t I I I I ! 1 1 I 1 1 I 1 1 I | f | | f | f 1 1 | 1 1 1 1 I | I I | t I f I E I I I ] 
a092 EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEW 
70 80 90 100 110 120 



m092.pep 
a092 

m092.pep 
a092 



130 140 150 160 170 180 

AALEQQI PVI PRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 
I f I I M I I M I I II I I II I I I I i I | I | | | | M | | | | | | | | | | | | | | | | | | | | | , | | , , | , 
AALEOOIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASTLGAAGLDPTFVIGGKL 
130 140 150 160 170 180 

190 200 210 220 230 240 

NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTN I DEDHMDTYGHS VEKLHQAFID 
I' I II I II M I I HI I I I MM I I I I I! I | M I I I I II I II I || Ml I H I I I I I IN I I 
NAAGTNAR LGKGEY I VAEADES DAS FLHLTP IMS WTNIDEDHMDTYGHS VEKLHQAFID 



WO 99/57280 



PCT/US99/09346 



292 



190 200 210 220 230 240 

250 260 270 280 290 300 

m092 . pep FIHRMPFYGKAFLCIDGEHVRAILPKVSKPYATYGLDDTADiyATDIENVGAQMKFTVHV 
I I I M I I I I I I I I I I I I I I I II I i I I I I I I I I I I I II II I I I I I I I I I I I i I I i I I I I I I 
a092 FI HRMP FYGKA FLC I DS EHVRA I L PK VS K P Y AT YG L D DT ADI Y AT D I EN VG AQMKFTVH V 

250 260 270 280 290 300 

310 320 330 340 350 360 

mO 92 . pep QMKGHEQGSFEVVLNMPGRHNVLNALAA3GVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I II I I I i I I I I I I I I I 
a092 QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 

370 380 390 400 410 420 

mO 92 . pep DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I I I I I I I I I I I I I I M I I III I I I 
a092 DIKLPNGGTALLVUDYGHHPVEMAATLSAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 

430 440 450 460 470 480 

m092 . pep VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 
I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
a092 VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 

430 440 450 460 470 480 

490 500 
mO 92 . pep GDI VLNMGAGSINRVPAALLALSKQIX 

i I I I I I I I I I I I i I II I I I I I I I I I I 
a092 GDIVLNMGAGSINRVPAALLELSKQIX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 315>: 
g093 . seq 

1 aTGCAGAATt ttgGCAAAGT ggccgtATTG ATGGGtggtT TTTCCAGCGA 

51 ACGAGAaatC tcgcTGGACA GCgGTACCGC CAT TTTG AAC GCCTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGACC CTAAGGAAAC GCCGTTATCC 

151 GAACTGAAGG AGCGGGGCTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCCTCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTACCCGTTC CCGAGTTCGC 

351 CGTACTGTAC GATGATACCG ATTTCGATGC CGTCGAAGAA AAATTGGGTC 

401 TGCCGATGTT TGTGAAGCCG GCGG CCGAAG GCAGCAGCgt cggcgtggta 

451 aAAGTCAAAG AAAaaggccg TCTGAAAAGC GTTtacgaag aatTGAaaCA 

501 CCTTcagggg cgaAAtcatt gccgAacgTT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATCCC 

601 CGCAACCGAG TTTTACGAct acgaagccaa GtacaaCCGA GACGAcacca 

651 tttaTCAATG TCCTTCGGAA GATTTGACCG AAG CCGAAG A AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCACAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 316; ORF 093.ng>: 

g093 -pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKERGFQTA FNILHGTYGE DGAVQGALEL LG1PYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLY DDTDFDAVEE KLGLPKFVKP AAEGSSVGW 

151 KVKEKGRLKS VYEELKHLQG RNHCRTFYRR RRIFLPRPER QRAARHTHHP 

201 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RTGNRCGRLR 

251 ARRFPORYRR QTLSVGNQHP ARYDRP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 7>: 

n093 . seq 
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ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 
ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 
GCAAAGGCAT AGACGCATAC GCCTTCGATC CTAAAGAAAC CCCATTGTCT 
GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 
TTACGGCrAA GACGGGGCGG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 
CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 
CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 
CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 
TGCCGATGTT TGTGAAACCG SCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 
AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 
CCTTCAGGG. CGAAATCATT GCCGAACGTT TTATCGGCGG CGGCGAATAT 
TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATTCC 
CGCAACCGAG TTTTACGACT ACGAAGCCAA GTACAACCGC GACGACACCA 
TTTATCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGCCTGATG 
CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 
GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 
TCAACACCCT GCCCGGTATG ACGAGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 318; ORF 093>: 

m093 . pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGX DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHLQX RNHCRTFYRR RRIFLPRPER QRAARHTHHS 

201 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDEP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 093 shows 96.7% identity over a 276 aa overlap with a predicted ORF (ORF 093. ng) 
from N. gonorrhoeae: 

m093/o093 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 



10 20 30 40 50 50 

m093 . pep MQNFGKVAVLMGGFSSEREISLDSGTAI LN ALKS KG I DA YAFDPKETPLS ELKAQGFQTA 

I II II II :il M M III III I li !| II IN I | I : II I III II III I I | M Ml :||||| 
g093 MQNFGKVAVLMGGFSSERB ISLDSGTAI LNALKSKGIDAYAFDPKETPLSELKERGFQTA 

10 20 30 40 50 50 

70 80 90 100 110 120 

m093 . pep FNII^GTYGXDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 

HI! i I III III Mill; II M III I HUM II III Mill I II llllllllh 

g 0 93 FNI LHGTYGEDGAVQGALELLGI P YTGS GVAASAI GMDKYRCKL I WQ ALGL P VP E FAVL Y 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 93 . pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 

i ii inn ii i illinium ii mm ii milium mi mm mi 

g0 93 DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKEKGRLKSVYEELKHLQG RNHCRTFYRR 

130 140 150 160 170 180 



m093 -pep 
g093 



190 200 210 220 230 240 

RRTFI.PRPERQRAARHTHHSPNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 

1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M I M 1 1 1! 1 1 M I M I ( I II 1 1 II M I i M 1 1 1 1 

RR.I FXiPRPERQRAARHTHHPRNRVLRLRSQVQPRRHHliSMS FGRFDRSRRKPDARTGGSR 
190 200 210 220 230 240 



250 260 270 

mO 93 . pep RAGNRCGRLRARRFPQRYRRQTLS VGNQH P AR YDE PX 

h III I I I I III I I I I I I | | I | I | III I I Ml | | : | | 
g093 RTGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 

250 260 270 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 319>: 

a093.seq 

1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CCAAGGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

401 T3CCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 

451 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 

501 CTTTCAGGG. CGAAATCATT GCCGAACGGT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCTG TGTTGAACGG CAAAGGCCTG CCCGGCATAC ACATCATCCC 

601 CGCGACCGAG TTTTATGACT ACGAAGCCAA GTACAACCGC AACGACACCA 

651 TTTATCAATG TCCTTCGGAA GATCTGACCG AAGCCGAAGA AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 



This corresponds to the amino acid sequence <SEQ ID 320; ORF 093.a>: 

a093.pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHFQX RNHCRTVYRR RRIFLPCVER CRPARHTHHP 

201 RDRVL*LRSQ VQPQRHHLSM SFGRSDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

m093/a093 95.7% identity over a 276 aa overlap 



10 20 30 40 50 60 

m093 . pep MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 
I I t t I I t I J I I 1 I I I 1 I I 1 I I I 1 I I f t I 1 t 1 1 1 I I I I t I I I I I 1 I I I f I I 1 I t I I I 1 t I I 
a093 MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m093 . pep FN I LHGTYGXDGAVQGALELLG IP YTGSGVAASAIGMDKYRCKL I WQALG LPVPEFAVLH 

I I t I I i I I I I 1 t I I I I I II I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I 
a093 FN I LHGTYGEDGAVQGALELLG I PYTGSGVAASAIGMDKYRCKL I WQALG LPVPEFAVLH 

70 BO 90 100 110 120 



130 140 150 160 170 180 

Itl093 . pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGVVKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 
! I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I: I I I I II I I III 
a093 DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKS VYEELKHFQX RNHCRTVYRR 

130 140 150 160 170 180 



190 200 210 220 230 240 

m093 . pep RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 
I I I I II till I I I I I I I : I I I I I I I II I : I I I I I I I I I I I I I I I I I I I I I I I I I 
a093 RRIFLPCVERQRPARHTHHPRDRVLXLRSQVQPQRHHLSMSFGRSDRSRRKPDARTGGSR 

190 200 210 220 230 240 



250 260 270 

m093 . pep RAGNRCGRLRARRFPQRYRRQT LS VGNQH PARYDE PX 

I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I : I I 
a 0 9 3 RAGNRCGRLRARR FPQRYRRQT LS VGNQH PAR Y DR PX 

250 260 270 



WO 99/57280 



PCIYUS99/09346 



295 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 32 1>: 

g094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTGG GGTCGAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTAccggc aatggcgatg 

151 cCGTCACGGA AGCGCATCAG CTCTGCCAGC ATCAAGGCGC GCGGAATAAC 

201 GGGAATTTGC CGCTCCAACG CAgcgacaAC TTCGGgatCT TCTTTCTTGA 

251 CGGCGGTAGA GGCAACGACG ACATccgcAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 322; ORF 094.ng>: 

g094.pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRISSAS IKARGITGIC RSNAATTSGF SPLTAVEATT TSAPLTCSAV 
101 WPG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 323>: 

ra094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTA GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCAT C AA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCGGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 324; ORF 094>: 

IB094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 094 shows 95.1% identity over a 103 aa overlap with a predicted ORF (ORF 094.ng) 
from N. gonorrhoeae: 

ra094/g094 

10 20 30 40 50 60 

m094 .pep MYSPLPKRALVPAALSLPFITKVGSSPAAPRMEAVRLVVWLPCVPAMAMPSRKRINSAN 

1 1 1 1 II III: III II I II I IN IIIIIIM IIIIIIIIIIIIMII lllllllhll: 

g094 MYSPLPKRALVPAALSLPFITKVGSSPAAPRMEAVRIiVVWLPCVPAMAMPSRKRISSAS 
10 20 30 40 50 60 

70 80 90 100 

m094 .pep I RARGITGI CCSNAATTSGFS FLTAVEVTTTSAPLTCSAVWPGX 

MINIMI UNI llllllilll hllM lllllllllli I 
Q094 I KARG I TGI CRSNAATTSGFS FLTAVEATTTSAPLTCSAVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 325>: 

a094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 



This corresponds to the amino acid sequence <SEQ ID 326; ORF 094.a>: 
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a094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLWV VLPCVPAMA M 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

m094/a094 1 00.0% identity over a 1 03 aa overlap 

10 20 30 AO 50 60 

ItiG94 .pep MYS PL PKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAM PSRKRINSAN 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I II I 1 I I I I I I I II I I I I 1 I I || I I I I I I 
a0 94 MYS PLPKRALVPAALSLPPI TKVGSSPAAP RMEAVRLWVVLPCVPAMAMPSRKRINSAN 

10 20 30 40 50 60 



70 80 90 100 

m0 94 .pep I RARG I TG I C C SN AATT SG F S FLT AVE VT T T S A PLT C S AVW PGX 

I I II I I I I! I II I I II I I I I I I I I I I I I I I I II I I I I I I I f I I I 
a0 94 IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 327>: 

'CATTTGC 3CGCCGACGT 
"CGTCCAA ACCGTAAGTA 
JCGGACGT GTTCGCTGTC 



g095.seq 








1 


ATGTCCTTTC 


ATTTGAACAT 


GGACGGTGAA 


51 


TTTCGATGTC 


GGTGGCGTAG 


ATGTCGGCGG 


101 


GCATAAGGTT 


TGCTCACTTT 


GGGCAAAATC 


151 


AACACACAAA 


AAGGCTTTGC 


CGTAGAAGGG 


201 


TAAACGCCTG 


ATGCAGTTTT 


TCGACGCTGT 


251 


TCTTCGTCGA 


TATTGGTAAC 


GACGGACATA 


301 


GAGGCATCGG 


ATCGTCGGCT 


TCGGCAACGA 


351 


CGGGCGTTGG 


TGCCTGCGGC 


GTTGA 



This corresponds to the amino acid sequence <SEQ ID 328; ORF 095 .ng>: 

g095.pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDK RL MQFFDAVPVG IHMVFVDIGN DGHNRGQCRK 

101 EASDRRLRQR CIRLCPSGRW CLRR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 329>: 



m095. seq 








1 


ATGTCCTTTC 


ATTTGAACAT 


GGACGGTGAA 


51 


TTTCGATGTC 


GGTGGCGTAG 


ATGTCGGCGG 


101 


GCATAAGGTT 


TGCTCACTTT 


GGGCAAAATC 


151 


AATACACAAA 


AAGGCTTTGC 


CGTAGAAGGG 


201 


TAAACGCCTG 


ATGCAGTTTT 


TCGACGCTGT 


251 


TCTTCGTCGA 


TATTGGTAAC 


GACGGACATA 


301 


GATGCATCCG 


ACCGTCGGCT 


TCGGCAACGA 


351 


CGGGCGTTAG 


TGCCTGCGGC 


GTTGA 



This corresponds to the amino acid sequence <SEQ ID 330; ORF 095>: 

m095.pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKR L MQFFDAVPVG IHMVFVD IGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRX CLRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 095 shows 97.6% identity over a 124 aa overlap with a predicted ORF (ORF 095. ng) 
from N. gonorrhoeae: 

m095/g095 

10 20 30 40 50 60 

m095 . pep MSFHLNMDGE FH LRADVFDVGGVDVGGI VQTV SS I RFAHFGQNRADVFAVNTQKG FAVEG 

I lllll III II Ml III III III 1 1 Mill Mlllh II Mil lllll llllllil III I 
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g095 MS FHLNMDG E FH LRAD V FD VGGVD VGG I VQT V SS I R FAH FGQNRADV F AVNTQKG FAV E G 

10 20 30 40 50 60 

70 80 90 100 110 120 

m095 . pep HTVDEI DKRIiMQFFDAVPVGI HMVFVDI GNDGHNRCQCRKDASDRRLRQRCI RLCPSGRX 

II I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I | | 
g095 HTVDEIDKRLMOFFDAVPVGIHMVFVDIGNDGHNRGQCRKEASDRRLRORCIRLCPSGRW 
70 80 90 100 110 120 



m095.pep CLRRX 

inn 

g095 CLRRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33 1>: 

a095.seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA- ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGCTTC TCAACACTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTGG TGCCTGCGGC GTTGA 



This corresponds to the amino acid sequence <SEQ ID 332; ORF 095.a>: 

a095.pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKRL MQLLNTVPVG IHMVFVDIGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRW CLRR* 



m095/a095 



96.0% identity in 124 aa overlap 



10 20 30 40 50 60 

ra095 . pep MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i I I II 1 1 1 H 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

e095 MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 
10 20 30 40 50 60 



70 80 90 100 110 120 

m095.pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

I 1 1 I I 1 1 I I I 1 1 :::: 1 1 1 1 M I f I I I 1 1 I 1 1 I I I I I I I I I I I I I I I I I I I I I II I I 1 1 
a095 HTVDEIDKRLMQLLNTVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 



m095.pep CLRRX 
Mill 

a095 CLRRX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 

333>: 

g096. seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGTGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGAAGGAC ATTGAtaaat ggtgTCGTCT CGGttgtaCt 
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201 tggcttcgta gTCGTAAAAC TCGGTTGCGG GGATGATGTG TATGCCGGGC 

251 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 AcgtTcggca atgaTT-cgc ccctgAAGGT GttTCAattc ttcgtaAACG 

351 CTTTTCAGAc ggccttTTTC TTTGA 



This corresponds to the amino acid sequence <SEQ ID 334; ORF 096.ng>: 

g096.pop 

1 MAGHTGQGVD FQQIEFAVGI FEE I DAHAAF RTDCLCAANR QFAHQAFFGF 

51 GQIFRRT LIN GVVSVV'^GFV VVKL GCGDDV YAGQPFAVQD GAGIFAAADK 

101 TFGNDFAPEG VSILRKRFSD GLFL* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 335>: 



m096.seq 

1 ATGGCTCGTC ATACCGGGCA 

51 CGTCGGTATC TTTGAGGAAA 

101 GCCTGCGCGC CGCGAACCGC 

151 GGTCAAATCT TCCGAAGGAC 

201 TGGCTTCGTA GTCGTAAAAC 

251 AGCCCTTTGC CGTTCAGGAC 

301 ACGTTCGGCA ATGATTTCGC 

351 CTTTTCAGAC GGCCTTTTCC 



GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 
TCGACGCGCA CGCAGCCTTC CGCACCGATT 
CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 
ATTGATAAAT GGTGTCGTCG CGGTTGTACT 
TCGGTTGCGG GAATGATGTG TATGCCGGGC 
GGGGCAGGAA TATTCGCCGC CGCCGATAAA 
CC.TGAAGGT GTTTCAATTC TTCGTAAACG 
TTTGA 



This corresponds to the amino acid sequence <SEQ ID 336; ORF 096>: 

m096.pop 

1 MARHTGQGVD FQQIEFAVGI FEE I DAHAA F RTDCLRAANR QFAHQAFFGF 

51 GQIFRRT LIN GWAWLGFV WKL GCGNDV YAGQPFAVQD GAGIFAAADK 

101 TFGNDFAXEG VSILRKRFSD GLFL * 

m096/g096 96.0% identity in 124 aa overlap 



10 20 30 40 50 60 

m096.pep MARHTGQGVDFQQIEFAVGIFEEIDAI1AAFRTDCLRAANRQFAIIQAFFGFGQIFRRTLIN 
II I I I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I II II I I I I I I I I I 
g096 MAGHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLCAANRQFAHQAFFGFGQIFRRTLIN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m096.pep G WAWLG FV WKLGCGN DVYAGQ P FAVQDGAG I FAAADKT FGN D FAXEGVS I LRKR FS D 

I I I : I I I I I I I I I I I I : I I I I I I I II I I I I I I I I 1 I II I I I I I I I I I I I I II II i I I I 
g096 GVVSWLGFVWKLGCGDDVYAGQPFAVQDGAGIFAAADKTFGNDFAPEGVSILRKRFSD 

70 80 90 100 110 120 



m096.pep GLFLX 

inn 

gO 9 6 GLFLX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 337>: 

a096. seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGCGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAGATCT TCCGAAGGAC ATTGATAAAT GGTGTCGTTG CGGTTGTACT 

201 TGGCTTCGTA GTCATAAAAC TCGGTCGCGG GGATGATGTG TATGCCGGGC 

251 AGGCCTTTGC CGTTCAACAC AGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 CCGTTCGGCA ATGATTTCGC CCT.GAAAGT GTTTCAATTC TTCGTAAACG 

351 CTTTTCAGAC GGCCTTTTCC TTTGA 

This corresponds to the amino acid sequence <SEQ ID 338; ORF 096.ng>: 
a096.pep 

1 MAGHTGQGVD FQQIEFAVGI FEE I DAHAAF RTDCLRAANR QFAHQAFFGF 
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51 GQIFRRT LIN GWAWLGFV VIKL GR6DDV YAGQAFAVQH RAGIFAAADK 
101 PFGND FAXES VSILRKRFSD GLFL* 



m096/a096 92.7% identity in 124 aa overlap 

10 20 30 40 50 60 

m096.pep MARHTGQG VD FQQI EFAVGI FEE I DAHAAFRT DCLRAANRQFAHQA FFG FGQI FRRTLI N 

II I I I I I I I 1 I I I I I I II I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I M 
a096 MAGHTGQGVDFQQIEFAVGI FEE I DAHAAFRT DCLRAANRQFAHQAFFG FGQI FRRTLIN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m096.pep GWAWLGFVWKLGCGNDVYAGQPFAVQDGAG I FAAADKTFGNDFAXEG VSILRKRFSD 

11111111111:111 I : I I I I I I I I I I I I I I I II II I I I I I I II : I I I I I I I I I I 
a096 GWAVVLGFWIKLGRGDDVYAGQAFAVQHRAG I FAAADKPFGNDFAXES VSILRKRFSD 

70 80 90 100 110 120 



m096.pep GLFLX 
II I I I 

a096 GLFLX 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 339>: 

g097 . seq 

1 ATGGATATTT CAAAACAAAC ATTGCTGGAT AGGGTTTTTA ACCTGAAGGC 

51 AAACGGTACG ACGGTACGTA CCGAGTTGAT GGCGGGTTTG ACGACCTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAATC CCCTGATTTT GGGCGAGACC 

151 GGAATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CATCCGCCAT 

201 CGGCTGTTTT GTCATGGGTT TTATCGGCAA CTATCCGATT GCGCTTGCCC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTGGC GTTGGGTGCG GTGTTCATTT CCGGTCTGAT 

351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCCG CCGGTATCGG TTTGTTTTTG 

451 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC CAGCGCACTG TTGGCATTGT 

551 TCGGTTTTGT CATGGTGGTC GTATTGGGGT ATTTC CGCGT TCAAGGCGCA 

601 ATCATCATCA CCATTCTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAACGAGTTT CACGGCGTGG TCGGCGAAGT ACCGGGCATT GCGCCGACCT 

701 TTATGCAGAT GGATTTTAAA GGTCTGTTTA CCGTCAGCAT GGTCAGCGTG 

751 ATTTTCGTCT TCTTCTTGGT CGATTTGTTC GACAGTACCG GAACGCTGGT 

801 CGGCGTATCC CACCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 

901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 

951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TGGCGTGTCT GATGTTCTCC CCATTGGCGA AAAGTGTTCC GGTATTTGCC 

1051 ACCGOGCCCG CACTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATTGAT TGGGACGATA TGACTGAAGC CGCGCCCGCG TTCCTGACCA 

1151 TTGT CTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 

12 01 TTCATCAGCT ATGCCGTGGT CAAACTTTTG TGTCGCCGGA CTGGGGACGT 
1251 GCCGCCTATG GTATGGGTTG TTGCCGTATT GTGGGCATTG AAATTCTGGT 

13 01 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 340; ORF 097.ng>: 

g097 .pep 

1 MDISKQTLLD RVFNLKANGT TVRTE LMAGL TTFLTMCYIV IVN PLILGET 

51 GMDMGAVFVA TCIASAIGCF VM GFIGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI IVA NPATLVG LGDIHQPS AL LALFGFVMW VLGYF RVQGA 

201 IIITILTrTV IASLMGL NEF HGWGEVPGI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLVD LF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

3 01 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPVFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 



WO 99/57280 



PCTAJS99/09346 



300 



401 FISYAWKLL CRRTGDVPPM VWWAVLWAL KFWYLG * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 341>: 

m097 . seq 

1 ATGGACACTT CAAAACAAAC 

51 AAACGGTACk ACGGTGCGTA 

101 TGACGATGTG CTACATCGTT 

151 GGCATGGATA TGGGGGCGGT 

201 CGGCTGTTTT GTTATGGGTT 

251 CGGGGATGGG GCTGAATGCC 

301. GGCGTGCCTT GGCAGGTTGC 

351 TTTTATCCTG TTCAGCTTTT 

401 TGCCTATGGG TTTGAAAATG 

451 GCACTGATTT CCCTGAAAGG 

501 CTTGGTCGGT TTGGGCGATA 

551 TCGGTTTTGC TATGGTGGTC 

601 ATCATCATCA CCATCTTGAC 

651 GAATGAATTT CACGGCATCA 

701 TTATGCAGAT GGATTTTGAA 

751 ATTTTCGTCT TCTTCTTGGT 

801 CGG CATATCC CACCGTGCCG 

851 TGAAACGCGC ACTGCTTGCA 

901 TTGGGTACTT CTTCCACCAC 

951 GGCAGGCGGA CGGACCGGCC 

1001 TCGCCTGCCT GATGTTTTCA 

1051 ACCGCGCCCG CCCTGCTTTA 

1101 GGATATTGAT TGGGACGATA 

1151 TTGTTTTCAT GCCGTTTACT 

1201 TTCATCAGTT ATGCCGTGGT 

1251 TCCGCCTATG GTATGGATTG 

1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 342; ORF 097>: 

in097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTELMAGL TTFLTMCYIV IVN PXILGBT 

51 GMDMGAVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPW QVALGA VFISGLIFIIi FS FFKVREML WALPMGLKM SIAA GIGLFL 

151 AIiISLKGAGI IVA NPATLVG LQDIHQPS AL IALFGFAMW VLGHF RVQGA 

201 IIITILTITV IASLMGL NEF HGIIGEVPSI APTFMQMDFE GLFTVSMVSV 

251 I FVFFLVD LF DSTGTLVGIS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAWKLL CRRTKDVPPM VWIVAVLWAL KFWYLG * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 097 shows 96.3% identity over a 436 aa overlap with a predicted ORF (ORF 097.ng) 
from A r . gonorrhoeae: 

m097/g097 

10 20 30 40 50 60 

m0 9 7 . pep MDT S KQTLLDG I F KL KANGTTVRTELMAGLTTFLTMC Y I V I VN PX I LGE TGMDMGAVFVA 

I II I I I I I I = I : I I I I I I I I I I t I I I I I II I I II I I I I I I II I I I I I I t I I I I I I I I 
gO 9 7 MDI S KQTLLDRVFNLK7VNGTTVRTELMAGLTTFLTMCYI VIVNPLI LGE TGMDMGAVFVA 

10 20 30 40 50 60 



ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 
CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 
ATCGTCAACC CTCyGATTTT GGGCGAGACC 
ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 
TTGTCGGCAA CTATCCGATT GCACTCGCAC 
TATTTCACCT TTGCCGTCGT TAAGGGTATG 
GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 
TTAAAGTCAG GGAAATGCTG GTCAACGCAC 
TCGATTGCTG CCGGTATCGG TTTGTTTTTG 
CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 
TTCATCAGCC GTCCGCGTTG TTGGCATTGT 
GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 
CATTACCGTC ATTGCCAGCC TGATGGGTTT 
TCGGCGAAGT ACCGAGCATT GCGCCGACTT 
GGCCTGTTTA CCGTCAGCAT GGTCAGTGTG 
CGATCTATTT GACAGTACCG GAACGCTGGT 
GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 
GACTCTACCG CCATTGTGGC AGGTGCGGCT 
GCCTTATGTG GAAAGCGCGG CGGGCGTATC 
TGACGGCGGT TACCGTCGGC GTATTGATGC 
CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 
TGTCGGCACG CAGATGCTCC GCAGTGCGAG 
TGACGGAAGC CGCACCTGCG TTCCTGACCA 
TATTCGATTG CAGACGGCAT CGCTTTCGGC 
TAAACTTTTA TGCCGCCGCA CCAAAGACGT 
TTGCCGTATT GTGGGCACTG AAATTCTGGT 



70 80 90 100 110 120 

m097 . pep TCIASAIGCFVWFVGNYPIAIAPGMGLNAYFTFAVVKGMGVPWQVALGAVFISGLIFIL 

I I I II I! II Mil h Ml llllllllltl II III II I I III I I It II I llllllllllll 
g097 TCI ASAI GCFVMGF IGN Y PI ALAPGMGLNAY FTFAWKGMGVPWQVALGAVFI SGL I FI L 

70 80 90 100 110 120 

130 140 150 160 170 180 
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FS F FKVREMLVNALPMGLKMS I AAG I GLFLALISLKGAGI I VAN PATL VGLGD I HQPS AL 

llllllllllllllllllll IIMIIIIIIIII III llllll llllllll III MINI I 

FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 
130 140 150 160 170 180 

190 200 210 220 230 240 

LAL FG FAMWVLGHFRVQGA 1 1 1 T I LT I TV I AS LMGLNE FHG 1 1 G EV P S I APT FMQMD F E 

I II llhlillihlll IMMIMIII IMIIMI IMIIhMllhMMMIMh 

LAL FG FVMWVLG YFRVQGA 1 1 1 T I LT I TV I AS LMGLNE FHG WG EV PGI A PT FMQMD FK 
190 200 210 220 230 240 

250 260 270 280 290 300 

GLFTVSMVSVI FVFFLVDLFDSTGTLVGI SHRAGLLVDGKLPRLKRALLADSTAI VAGAA 

I Mill I II II M Mil III II llll IM I II INI III II MM II 1 1 1 1 MM III I 

GLFWSMVSVIFVFFL\mLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAIVAGAA 
250 260 270 280 290 300 

310 320 330 340 350 360 

LGT S S TT P YVE S AAGVS AGGRTGLTAVTVGVLMLACLMFS PLAKS VP AF ATAP ALLYVGT 

i 1 1 1 1 1 1 1 1 1 1 El 1 1 1 1 1 1 1 1 1 i 1 1 ) 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 

LGT S S TT P YVE S AAGVS AGGRTGLTAVTVGVLMLACLMFS PLAKS VP V F ATAP ALLYVGT 
310 320 330 340 350 360 

370 380 390 400 410 420 

QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 

I II MMM M III Ml Ml II MM MM Ml III I Mill MIMI 1 1 II 1 1 MM I 

QMLRS ARD I D WDDMTE AAP AFLT I V FMP FT YS I ADG I AFGF I S YAW KLLCRRTGD V P PM 
370 380 390 400 410 420 

430 

VWIVAVLWALKFWYLGX 
I I: I II I II I I I I I I I I 
VWWAVLWALKFWYLGX 
430 

The following partial DNA sequence was identified in N, meningitidis <SEQ TD 343> 

a0 97 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACG ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCTGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

401 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 

451 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCACTGT 

551 TCGGTTTTGC CATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 

601 ATCATCATCA CCATTTTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAACGAATTT CACGGCATCA TCGGCGAAGT GCCGAGCATT GCGCCGACTT 

701 TTATGCAGAT GGATTTTAAA GGGTTGTTTA CCGTCAGCAT GGTCAGCGTG 

751 ATTTTCGTCT TTTTCCTAGT CGATCTGTTC GACAGTACCG GAACACTGGT 

801 CGGTGTATCG CATCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CTATTGTGGC AGGTGCGGCT 

901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGTGCGG CGGGCGTATC 

951 GGCAGGCGGG CGGACAGGTC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1C01 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1C51 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATCGAT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 



m097 .pep 
9097 

m097 .pep 
g097 

m097 .pep 
g097 

m097 .pep 
g097 

m097.pep 
g097 

m097.pep 
g097 
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1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 344; ORF 097.a>: 
a097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IV NPLILGET 

51 GMDMG AVFVA TCIASA3GCF VM GFVGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI IVA NPATLVG LGDIHQPS AL LALFGFAMW YLGHFR VQGA 

201 IIITILTITV IASLMGL NEF HGIIGEVPSI APTFMQNDFK GLFTVSMVSV 

251 IFVFFLVDLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

351 TAPALLYVGT QMLRSAF.DID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAWKLL CRRTKDVPP M VWIVAVLWAL KFWYLG * 

m097/a097 99.3% identity in 4 36 aa overlap 

10 20 30 40 50 60 

Itt097 . pep MDTSKQTLLDGIFKLKANG7TVRTELMAGLTTFLTMCYIVIVNPXILGETGMDMGAVFVA 
I I I I I I I II I I I I I I I I I I ! I II I I I I I I I I I i I I I I I I I I I I I I I I I II I I I I I I I I I 
a097 MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAVFVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m097 . pep TCIASAIGC FVMG FV GN Y P I ALAPGMG LNA Y FT FA WKGM G V PW QVALG AV FISGLIFIL 

I 1 1 1 1 I I I I I I 1 1 I I I I I I : I I I ! I I I I I I M I I 1 1 1 1 I I I I I I I I I I I I I I I I I 1 I I i I 

a097 TCI AS AIGCFVMGFVGNYP I ALAPGMG LNAYETFAWKGMGVPWQVALGAVFISGLIFIL 

70 80 90 100 110 . 120 

130 140 150 160 170 180 

m097 .pep F S FFK VREM LV N AL PMG LKMS I AAG I G L FLAL I S LKG AG 1 1 VAN PAT LVG LG D I HQ P SAL 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
a097 FSFFKVREMLVNALFMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m097 .pep LALFGFAMVVVLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFE 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I : 
a097 LAL FG FAM VW LGH FR VQG A 1 1 1 T I LT I TV I AS LMG LN EFHG 1 1 GE V P S I APT FMQMD FK 

190 200 210 220 230 240 

250 260 270 280 290 300 

mO 97 . pep GLFTVSMVSVI FVFFLVDLFD5TGTLVGI5HRAGLLVDGKLPRLKRALLADSTAIVAGAA 

I M I I I I I I I I I I I M I 1 I I I I I I M I I : 1 I I I I I I I M I I I I I I I I I I I II I I I I I ! I 
a097 GLFTVSMVSVI FVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

250 260 270 280 290 300 

310 320 330 340 350 360 

mO 97 . pep LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I ! I I j I 
a097 LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m097 . pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 
I I I I I I I I I I I II I I I I I I ! I I I I I I I I I I I I I I I M I I I I I I I I I I I II II I I I I I j I 
a097 QMLRSARD I DW DDMTE AAPAFLT I V FM P FT Y S I ADG I AFG FI S YAWKLLCRRTKDV P PM 

370 380 390 400 410 420 

4 30 

m097 . pep V W I V A V LW ALK FW Y LGX 

I I I I I II I I I I I I I I I I 
a097 VWIVAVXWALKFWYLGX 
430 
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The following partial DNA sequence was identified in tf. gonorrhoeae <SEQ ID 345>: 
g098 .seq 

1 ATGACCGCCG ACGGTCTCTT CGTCGCTTTC AACTTCAATA CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAGCAGGA TGCTGCCCAG GCTGGCGATC 

101 AGTTTG TCGG CGATGTCGCG CGCTTCGCTG TCGGGATGGC TTTCGCGTTC 

151 GGGATGAACG CAGCCGAGCA TGGACACGCC GGTACGCATC ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAAT CAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

3 01 GTTGGGCAGA TGGCCGTGAA TCAGCAAGTG TGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATTAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 346; ORF 098.ng>: 

g098.pep 

1 MTADGLFVAF NFNTFAWRI LIPVQQDAAQ AGDQ FVGDVA RFAVGMAFAF 

51 GMNAAEHGHA GTHHVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV CDFFKLAFLC QIRMS* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 347>: 

m098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

2 51 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 
301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

3 51 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 348; ORF 098>: 

m098 .pep 

1 MTADGLFVAF NLNAFAWRI LIPVQEDAAE AGDQ FVGDVA RFTFRMAFTF 

51 RMNAAOHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 098 shows 89.6% identity over a 125 aa overlap with a predicted ORF (ORF 098.ng) 

from N. gonorrhoeae: 
m098/g098 

10 20 30 40 50 60 

MTADGLFVAFNLNAFAWRI L I P VQEDAAEAGDQ FVGDVARFT FRMAFT FRMNAAQHGY A 

llMllllllhhllllllHIIhllhllllllllMlh I I hi I I! hi hi 

MTADGLFVAFNFNTFAWRI LI PVQQDAAQAGDQFVGDVARFAVGMAFAFGMNAAEHGHA 
10 20 30 40 50 60 

70 80 90 100 110 120 

GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQl^FVGQmVNQQVGDFFKLAFLC 

llhlllllllllllllMIIIIIIIIIIIIIIIIIIIIIIMIMIIII I I I I I I I I I 
GTHHVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEF^GQMAVNQQVCDFFKLAFLC 
70 80 90 100 110 120 



m098 .pep 
g098 

m098 .pep 
g098 



m098.pep QIRMSX 
llllll 

g098 QIRMSX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 349>: 
a098. seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 
51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 
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101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 350; ORF 098.a>: 

a098.pep 

MTADGLFVAF NLNAFAWRI LIPVQEDAAE AGDQFVGDVA 



1 
51 
101 



RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE 
VGQMAVNQQV GDFFKLAFLC QIRMS* 



RFTFRMAFTF 
LGFISGQLEF 



n098/a098 



100.0% identity in 125 aa overlap 



10 20 30 40 50 60 

m098 . pep M T A DG L FVAFN LN AF A WR I L I P VQE DAAE AG DQ FVG DVAR FT FRMA FT FRMN AAQH G Y A 

I I I I I I 1 I I I I i 1 I I II I I I I I I I I I I I I I I I I I I II I I M I I II I I I I I I I I I I I I I I I 
a098 MTADGLFVAFNLNAFAWRILIPVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 

10 20 30 40 50 60 

70 80 90 100 110 . 120 

m098 . pep GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFI SGQLEFVGQMAVNQQV GDFFKLAFLC 

I I I I I I I I I I I t I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a098 GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

70 80 90 100 110 120 



m098.pep 
a098 



QIRMSX 
I II I I I 
QIRMSX 



The following partial DNA sequence 

g099 . seq 



1 


ATGCTGGGAC 


GCGCGTCCAT 


51 


GCTGACGGGC 


AAACGGCAGG 


101 


CACTGACCGA 


ATTCTTGCGT 


151 


TTTTTCGGCG 


AGGGCGCGAG 


201 


TTCCAACATG 


ACGCCGGAGT 


251 


ACGCGCAAAC 


TATTGATTAT 


301 


GTGAAATTGG 


TGGAAAC CT A 


351 


CTTGAAAACC 


GCCGTTTATC 


401 


TAACGCGCAA 


TATGGCAGGC 


451 


GCCGATTTGG 


CGGCGAAAGG 


501 


CGGCCAAATG 


CCTGACGGTG 


551 


CCAATACTTC 


CAACCCGCGC 


601 


AATGCCAACC 


GCCTCGGCTT 


651 


TGCCCCGGGT 


TCAAAAGTAG 


701 


TGCCCGAAAT 


GGAAAAACTC 


751 


ACCTGTAACG 


GCATGAgcgG 


801 


CATCGACCGC 


GAtttgtacg 


851 


TCGACGGCCG 


TATCCATCCG 


901 


CCTTTGGTCG 


TTGC CTACGC 


951 


AAACGACGTA 


CTCGGCGTTG 


1001 


TCTGGCCGAC 


AGACGAAGAA 


1051 


CCGCAACAAT 


TCCGCGACAT 


1101 


GCAAAAAGCA 


CCAAGCCCGC 


1151 


TCCGCCGTCC 


GCCCTATTGG 


1201 


AGAGGTATGC 


GTCCGCCGGC 


1251 


CATCTCgcca 


tCCAATGCGA 



identified in N. gonorrhoeae <SEQ ID 35 1>: 

GATGCGCCTG CCCGATATTG TCGGCGTGGA 
CGGGCATTAC TGCCACAGAC ATCGTGTTGG 
AAAGAGCGCG TGGTCGGGGC GTTTGTCGAA 
AAGCCTGTCT ATCGGCGACC GCGCGACCAT 
TCGGCGCGAC TGCCGCCATG TTCGCCATCG 
TTGAAACTGA CCGGACGTGA CGACGCGCAG 
CGCCAAAACC GCAGGCTTAT GGGCAGGTGG 
CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 
CCGAGCAACC CGCACGCGCG TTTTGCCACC 
GCTGGCGAAG CCTTACGAAG AG CCTTCAGA 
CAGTGATTAT TGCCGCGATT ACTTCGTGTA 
AACGTTGTCG CCGCCGCACT GTTGGCACGC 
GAAACGCAAA CCTTGGGTGA AATCTTCGTT 
CCGGAATCTA TTTGAAAGAA GCAGGCTTGT 
GGCTTCGGTA TCGTCGCCTT CGCATGTACC 
CGCGCTcgaC CCGAAAATCC AACAAGAAAT 
CCACCGCCGT ATTGTCAGGC AACCGCAACT 
TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
ATTGGCAGGT AGCATCCGTT TCGATATTGA 
CAGACGGCCG CGAAATCCGC CTGAAAGATA 
ATCGATGCCA TCGTTGCCGA ATATGTGAAA 
TTATATCCCG ATGTCCGACA CCGGCACAGC 
TGTACGACTG GCGACCGATG TCCACCTACA 
GAAGGCGCAC TGGCAGGGGA ACGTACATTA 
GATTTTGCCC GACAACATCA CCACCGACCA 
TTTTGGCCGG cagTGCcgca ggtgaATATT 
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1301 TGGCGAAAAT GGGTTTGCCT GAAGAagaCT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTtcgt 
14 51 tggcacgcgT tgaacCAGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTT ATCATCATTG CCGGTGCGGA 
1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC GGCGAAGGGC GTGCGGCTGG 
1601 CGGGTGTGGA AGCCATCGCC GCCGAAGGTT TCGAGCGCAT CCACCGCACC 
1651 AACCT CATCG GCATGGGCGT CTTGCCGCTG CAATTCAAAC CCGGCACCAA 
1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGG CG 
1751 AACGCACACC GCGCTGCGGC CTGACCCTCG TGATTCACCG TAAAAACGGA 
1801 GAAACCGTCG AAGTTCCGGT TACCTGCCGC CCCGATACCG CAGAAGAAGC 
1851 ATTGGTATAT GAAGCCGGCG GCGTATTGCA ACGGTTTGCA CAGGACTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 352; ORF 099.ng>: 

g099 -pep 

1 MLGRASMMRL PDIVGVELTG KRQAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDAQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWAGGLKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAGIYLKE AGLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQQEIIDR DLYATAVLSG NRNFDGRIHP YAKQA FLASP 

301 PLWAYALAG SI RFDIENDV LGVADGREIR LKDIWPTDEE IDAIVAEYVK 

3 51 PQQFRDIYIP MSDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 
401 RGMRPPAILP DNITTDHISP SNAILAGSAA GEYLAKMGLP EEDFHSYATH 
451 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 
501 ETYMNRKQPL I I I AGADYGQ GSSRDWAAKG VRLAGVEAIA AEGFERIHRT 
551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCG LTLVIHRKNG 
601 ETVEVPVTCR PDTAEEALVY EAGGVLQRFA QDFLEGNAA* 

Tlie following partial DNA sequence was identified in AT. meningitidis <SEQ ID 353>: 

ra099 . seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGG CGTTGA 

51 GCTGAACGGC AAACGGCAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACX3CGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

351 CTTGAAAACC GCCGTTTATC CTCGCGTTTT GAAATTTGAT TTGAGCAGCG 

4 01 TAACGCGCAA TATGGCAGGC CCAAGTAACC CGCATGCCCG TTTTGCGACC 
451 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCGGA 
501 CGGCCAAATG CCCGACGGCT CGGTCATCAT CGCCGCGATT ACCAGTTGCA 
551 CCAACACTTC CAACCCGCGC AACGTTGTTG CCGCCGCGCT CTTGGCACGC 
601 AATGCCAACC GTCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 
651 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCGGGCCTGT 
701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCCTGCACC 
751 ACCTGCAACG GCATGAGTGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 
801 CATCGACCGC GATTTGTACG CCACCGCCGT ATTATCAGGC AACCGCAACT 
851 TCGACGGCCG TATCCACCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGTATCCGTT TCGATATTGA 
951 AAACGACGTA CTCGGCGTTG CAGACGGCAA GGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTGC CGATGAAGAA ATCGATGCCG TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAGT TCCGCGATGT GTATGTACCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCCAGTCCGC TGTACGATTG GCGTCCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCTTACTGG GAAGGCGCGC TGGCAGGGGA ACGCACATTA 

1201 AGAGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 

1251 CCTCTCGCCG TCCAATGCGA TTTTGGCCGT CAGTGCCGCA GGCGAGTATT 

1301 TGGCGAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCT ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAAAA ACGAAGACGG CAGCGTGCGC CAAGGCTCGT 
14 51 TCGCCCGCGT CGAACCCGAA GGCGAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC ATCATCATTG CCGGTGCGGA 
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1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 

1601 CCGGCGTAGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 

1651 AACCTTATCG GCATGGGCGT GTTGCCGCTG CAGTTCAAAC CCGACACCAA 

1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTGGTCGGCG 

1751 AACGCACACC GCGCTGCGAC CTGACCCTCG TGATTCACCG TAAAAACGGC 

1801 GAAACCGTTG AAGTTCCCGT TACCTGCTGC CTCGATACTG CAGAAGAAGT 

1851 ATTGGTATAT GAAGCCGGCG GCGTGTTGCA ACGGTTTGCA CAGGATTTTT 

1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 354; ORF 099>: 

m099.pep 

MLGRASMMRL PDIVGVELNG KRQAGITATD IVLALTEFLR KERWGAFVE 
FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 
VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 
ADLAAKGLAK PYEEPSDGQM PDGSVIIAAI TSCTNTSNPR NWAAALLAR 
NANRLGLKRK PWVKSSFAPG SKVAEIYLKE AGLLPEMEKL GFGIVAFACT 
TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQ AFLASP 
PLWAYALAG SIRFDIEMDV LGVADGKEIR LKDIWPADEE IDAWAEYVK 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 



PQQFRDVYVP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 
RGMRPLAILP DNITTDHLSP SNAILAVSAA GEYLAKMGLP EEDFNSYATH 
RGDHLTAQRA TFANPKLFNE MVKNEDGSVR QGSFARVEPE GETMRMWEAI 
ETYMNRKQPL I I I AGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERIHRT 
NLIGMGVLPL QFKPDTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 
ETVEVPVTCC LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 099 shows 96.2% identity over a 639 aa overlap with a predicted ORF (ORF 099.ng) 
from TV. gonorrhoeae: 

m099/g099 



10 20 30 40 50 60 

m099 . pep MLGRASMMRL PD I VGVELNGKRQAGI TATDI VLALTEFLRKERWGAFVEFFGEGARS LS 

I II IIIIIIIM II llhll II I I III III III III I I! II I II II III! Ill M II II 
g0 99 MLGRASMMRLPDIVGVELTGKRQAGI TATDI VLALTEFLRKERWGAFVEFFGEGARS LS 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 9 9 . pep I GDRAT I SNMT P E FG ATAAM FAI DEQT I D YLKLTG RDDAQVKLVE T Y AKTAGLWADALKT 

IIIMIIIIIIIIIMIIIMIll IIIIIIIIIIIIMIIIIIIIIIIIMIII HI! 

gO 9 9 I GDRAT I SNMTPEFGATAAMFAI DAQTID YLKLTGRDDAQVKLVET YAKTAGLWAGGLKT 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 9 9 . pep AWPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVI IAAI 

I II llllll II III llll IIIIIIIIIMII III MM MM MM I MM Ihll II II 

gO 9 9 AWPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGAVI IAAI 

130 140 150 160 170 180 

190 200 210 220 230 240 

mO 9 9 . pep TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSS FAPGSKVAE I YLKEAGLLPEME KL 

! 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 ! 1 1 ! I llllll MINIMI 

g0 9 9 TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSS FAPGS KVAG I YLKEAGLLPEME KL 

190 200 210 220 230 240 

. 250 260 270 2B0 290 300 

mO 99 . pep GFG IVAFACTTCNGMSGALDPKI QKEI I DRDL YATAVLSGNRNFDGRI HPYAKQAFLAS P 

i 1 1 M II 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gO 9 9 GFG IVAFACTTCNGMSGALDPKI QQE 1 1 DRDL YATAVLS GNRNFDGRI HPYAKQAFLAS P 

250 260 270 280 290 300 



310 



323 



330 



340 



350 



360 
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m099 . pep P L WAYALAGS I RFD I ENEVLGVADGKE I RLKDI W PADEE I DAWAEYVKPQQFRDVYVP 

I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I II I : I I I I | | : I i | | | | | | | | | | : | = | 
g 0 9 9 PL WAYALAGS I RFD I ENEVLGVADGRE I RLKDI WPTDEE IDAI VAE YVKPQQFRDI YIP 

310 320 330 340 350 360 



m099 -pep 
g099 



370 380 390 400 410 420 

MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGr4RPLAIIiPDNITTDHLSP 

I MIMM Mill I II Mil II I III MM III MINI I II 1 1 1 1 1 1 1 1 1 11 0 I 

MSDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPPAILPDNITTDHISP 
370 380 390 400 410 420 



430 440 450 460 470 480 

mO 9 9 . pep SNAI LAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 

I I I I I I I I i I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I II = I I I I I I I 
g 0 9 9 SNAI LAGS AAGE YLAKMGLPEEDFNS Y ATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 440 450 460 470 480 

490 500 510 520 530 540 

mO 9 9 . pep QGSFARVEPEGETMRMWEAI ETYMNRKQPLI 1 1 AGAD YGQGSS RDWAAKGVRLAGVEAI V 

IMMII II I IMM I M I MMM MM II I Mill Mill M I MM 1 1 II MM II - 

g099 QGSLARVEPEGQTMRMWEAI ETYMNRKQPLI 1 1 AG AD YGQGSS RDWAAKGVRLAGVEAI A 

490 500 510 520 530 540 

550 560 570 580 590 600 

m09 9 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDWGERTPRCDLTLVIHRKNG 

II Ml! Ml I II MM II I Mill i II II II Mill 1 1 III Ml Ml I I Ml III II I 

g 0 9 9 AEGFERI HRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDWGERTPRCGLTLVIHRKNG 

550 560 570 580 590 . 600 



610 620 630 640 

mO 9 9 . pep BTVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

I I I I I I I I I I I I I I - I I M I I I I I I I I I I I I I I I I II I 
gO 9 9 ETVEVPVTCRPDTAEEALVYEAGGVLQRFAQDFLEGNAAX 
610 620 630 640 



identified in N. meningitidis <SEQ ID 355>: 



The following partial DNA sequence was 

a099.seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGAAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC CAT TG ATT AT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

351 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 

401 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCGACC 

451 GCCGATTTGG CCGGCAAAGG CTTGGCTAAA CCTTACGAAG AGCCTTCAGA 

501 CGGCCAAATG CCTGACGGTG CAGTGATTAT TGCCGCGATT ACTTCCTGTA 

551 CCAATACTTC CAATCCGCGC AACGTTGTCG CCGCCGCGCT GTTGGCACGC 

601 AATGCCAACC GCCTCGGCTT GCAACGCAAA CCTTGGGTGA AATCTTCGTT 

651 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCAGATCTGC 

701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTTGCCTT CGCATGTACC 

7 51 ACCTGTAACG GCATGAGCGG CGCGCTGGAT CCG AAAATCC AGAAAGAAAT 

801 CATCGACCGC GATTTGTACG CCACCGCCGT ATTGTCAGGC AACCGCAACT 

851 TTGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 

901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGCATCCGTT TCGATATTGA 

951 AAACGACGTA CTCGGCGTTG CAGACGGCAA AGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTAC CGATGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAAT TTCGCGACGT TTATATCCCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCAAGCCCGC TGTACGACTG GCGTCCAATG TCTACCTATA 

1151 TCCGCCGCCC ACCTTACTGG GAAGGCGCAC TGGCAGGGGA ACGCACATTA 

1201 AGCGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 



WO 99/57280 



PCT/US99/09346 



308 



1251 TCTCTCGCCA TCCAATGCGA TTTTGGCAAG CAGTGCCGCA GGCGAATATT 

1301 TGGCAAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

1351 CGTGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAACGTTCGC 

14 51 TGGCACGCGT TGAACCCGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 

1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC AT CAT CAT TG CCGGCGCGGA 

1551 CTACGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 

1601 CCGGCGTGGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 

1651 AACTTGATCG GTATGGGCGT GTTGCCGCTG CAGTTCAAAC CGGGTACCAA 

17 01 CCGCCACACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 

17 51 AACGCACACC GCGCTGCGAC CTGACCCTTG TGATTCACCG TAAAAACGGC 

1801 GAGACCGTCG AAGTCCCCAT TACCTGCCGC CTCGATACCG CAGAAGAAGT 

1851 GTTGGTATAT GAAGCCGGTG GCGTATTGCA ACGGTTTGCA CAGGATTTTT 

1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 356; ORF 099.a>: 
a099.pep 

1 MLGRASMMRL PDIVGVELNG KRKAGITATD IVLALTEFLR KERVVGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAGKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NVVAAALLAR 

201 NANRLGLQRK PWVKSSFAPG SKVAEIYLKE ADLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQ AFLASP 

301 PLWAYALAG SI RFDIENDV LGVADGKEIR LKDIWPTDEE 1DAIVAEYVK 

351 PQQFRDVYIP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW F.GALAGF.RTL 

4 01 SGMRPLAILP DNITTDHLSP SNAILASSAA GEYLAKMGLP EEDFNSYATH 

4 51 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERIHRT 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPITCR LDTAEEVLVY EAGGVLQRFA QDFLEGNAA+ 



m099/a099 97.5% identity in 639 aa overlap 

10 20 30 40 50 60 

mO 99 .pep MLGRASMMRLPDIVGVELNGKRQAGITATDIVLALTEFLRKERWGAFVEFFGEGARSLS 
I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I i 1 II I t I I I I I 
a099 MLGRASMMRL PDIVGVELNGKRKAGITATDIVLALTEFLRKERWGAFVE FFGEGARSLS 

10 2C 30 40 50 60 



70 80 90 100 110 120 

m099.pep IGDRATISNMTPEFGATAAMFAIDEQTIDYLKLTGRDDAQVKLVETYAKTAGLWADALKT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
a 0 9 9 I G DRAT I SNMT PE FGATAAMFAI DEQT I D YLKLTGRDDAQVKLVE T Y AKTAGLWADALKT 

70 80 90 100 110 120 



130 140 150 160 170 1B0 

m099.pep AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 
I I I I I I I I I I II I i I I I I I I I I! I I I I I I II I I I : I I I I I I I I I I I I I I I I I I : I I I I I I 
a099 AVYPRVLKFULSSVTRNMAGPSNPHARFATAULAGKGLAKPyEEPSDGQMPDGAVIIAAI 

130 140 150 160 170 180 



190 200 210 220 230 240 

m099.pep TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAEIYLKEAGLLPEMEKL 
I I I I I I I I I I I I M I I II I I I I I I I I I : I I I I I I I I I i I I I I I I I I II I I I 11111)11 
a 0 9 9 TSCTNT SN PRHWAAALLARNANRLGLQRKPWVKS S FAPGS KVAE I YLKEADLLPEME KL 

190 20C 210 220 230 240 



250 26C 270 280 290 300 

m099.pep GFCIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNF.NFDGRIHPYAKQAFLAGP 
I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I i I I I I I I I I I I I 
a099 GFG I VAFACTTCNGMSGALDPK IQKE 1 1 DRDL Y AT AVLSGN RN FDGR I H P YAKQAFLAS P 

250 260 270 280 290 300 



m099.pep 



310 320 330 340 350 360 

P L V V A Y ALAG S I R F DI EN D V LG VADGKE I R LK D I W P ADE E I D A V V AE YVK PQQ FR D VY V P 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I : I I I I M : I I I I I II I | | | | | |: | 
a099 PLVVAYALAGSIRFDIENDVLGVADGKEIRLKDIWPTDEEIDAIVAEYVKPQQFRDVYIP 
310 320 330 340 350 360 

370 380 390 400 410 420 

m099 . pep MFDTGTAQKAPS PLYDWRPMSTYIRRPPYVJEGALAGERTLRGMRPLAILPDNITTDHLSP 

I I I I I I I I I I 1 1 I I I I I I I I I II I I I I I II I I i I i ! I I I I I I I I ! I I I 1 1 | | | | | | | | | 
a099 MFDTGTAQKAPS PLYDWRPMSTYIRRPPYWEGALAGERTLSGMRPLAILPDNITTDHLSP 

370 380 390 400 410 420 

430 440 450 460 470 480 

JT:099 . pep SNAILAVSAAGEYL.°iKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 
I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I 1 I I I I I : I | | | | 1 | 
a099 SNAILASSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 440 450 460 470 480 

490 500 510 520 530 540 

Itl099.pep QGSFARVEPEGETMRMWEAIETYMNRKQPLIIIAGADYGQC-SSRDWAAKGVRLAGVEAIV 
I I I : I I I I I ! I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I 
a099 QGSLARVEPEGQTMRMWEAIETYMNRKQPLIIIAGADYGQGSSRDWAAKGVRLAGVEAIV 

490 500 510 520 530 54 0 

550 560 570 580 590 600 

m099 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDVVGERTPRCDLTLVIHRKNG 
I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
a099 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDVVGERTPRCDLTLVIHRKNG 

550 560 570 580 590 600 

610 620 630 640 

mO 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 
111111:11 I I I I I I I I I I I I I I I II I I II I I II I I II I 
aO 9 9 ETVEVPITCRLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 357>: 

gl02 . soq 

1 AtgtCCGCCA AAactccgtc gctcttcggc ggcgcgatga Ttatcgccgg 

51 gaaggttatc ggcgcAGgta tgttccccaa ccccaccgcc aacttggggg 

101 acgggttaat aggctcgctg attgtgctgc tgtacocctg gtttccattc 

151 tcctccggcg ccctcatgat tttggaagtc aacacccata acCCccgagg 

201 ggcaAGtttt gacaccATGg tcAAagacct gctcgGaCGc ggctggaaca 

251 tcatcaacgg catcgccgtc gctttggTCc tatacggctc gacctacgcg 

301 cacattttag tcggcggtga cctGACCGCC AAAGGCAtcg GCAgCGCAGT 

351 AGGCGGCAAA ATTTCgctca CCGTCGGACA actcgtCttC tTCGGCATCC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTTACCGGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GGTTGCCGAT GCCAAACCGT CCGTCCTCTT CGACACCCAA GCCCCCGTCG 

551 GCACCGGCTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTTTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGcgcCc aaagtGgCGA aATCcatctg gGcaggtaca ttggTTGCCt 

701 tggtaattta cgtccTCTgg caaaccgcca tCcaaagcaa ccTGCcgcgc 

7 51 aacgagttcg cCCCcgtgat tgccgccgag aggcaactCT CCGTCCTgaa 

801 tgaaacccTG tccaaattcg cccaaaccgg cgatatggat aAaatattgt 

851 ccctatttcc ctacatggca atcgccacct cctttttagg cgTAACctta 

901 ggcctgtttg acaacatcgc cgacatcttc aaatggaacg acagtatgtc 

951 cgggcggggc accaaaaccg tcgcgctgaa cttcctgccg CCCCtgattt 

1001 cctggctgct cctccccacc ggcttcttta ccgccattgg tgcgtccggc 

1051 ctggcggcaa ccgtctggga ccaagGcatc atccccgcca tgcf.gctcta 

1101 cgtttccccc caaaaaattG gcGcaggcaa gacttataAa gtttaCGGCG 

1151 gcttgtggct gatgttagtc ttccttttcg gcatcgccaa catcgccgca 

1201 CAGGTATTGA GccaAatgGa ACtcgtCccc GTATTTAAAG GATAA 

This corresponds to the amino acid sequence <SEQ ID 358; ORF 102.ng>: 

gl02.pop 
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1 MSAKTPSLFG GAMIIAGKVI GAGMFPNPTA NLGDGLIGSL IVLLYTWFPF 

51 SSGALMILEV NTHNPRGASF DTMVKDLLGR GWNIING IAV ALVLYGSTYA 

101 YILVGGDLTA KG1GSAVGGK ISLTVGQLVF FGILAFC VWA SARLVDRFTG 

151 v l:ggm vltf iwatgglv ad akpsvlfdtq apvgtgywiy aatalpvcla 

201 sfgfhgnvss llkyfkgdap kvaksiw agt lvalviyvlw qtai qsnlpr 

251 NEFAPVIAAE RQLSVLNETL SKFAQTGDMD K ILSLFPYMA IATSFLGV TL 

301 GLFDNIADIF KWNDSMSGRG TKTVALNFLP P LISWLLLPT GFFTAIGA SG 

351 LAATVWDQGI IPAMLLYVSP QKIGAGKTYK VYGGLW LMLV FLFGIANIAA 

401 QVLSQMELVP VFKG* 



The following partial DNA sequence was 

ml02.seq 

1 ATGCCCAACA AAACCCCTTC ACTGTTCGGC 

51 CACGGTCATC GGCGCAGGCA TGCTCGCCAA 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC 

151 CTTTCCAGCG GCCTGATGAT TTTGGAAGTC 

201 CGCAAGTTTC GACACGATGG TCAAAGACCT 

251 TCATCAACGG CATCGCCGTC GCCTTCGTTT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC 

351 AGGCGGCGAC GTTTCACTCA CCGTCGGACA 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT 

451 GTCCTTATCG GCGGCATGGT ATTGACCTTT 

501 GATTGCCGAT GCCAAGCCGT CCGTCCTCTT 

551 GCACAAACTA CTGGATTTAC GCCGCCACCG 

601 TCCTTCGGCT TCCACGGCAA CGTCTCCAGC 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG 

701 TGGTAATTTA CGTCCTCTGG CAAACCGCCA 

751 AACGAGTTCG CCCCCGTCAT CGCCGCCGAA 

801 CGAAACCCTG TCCAAATTCG CCCAAACCGG 

851 CCCTGTTTTC CTATATGGCG ATCGCCACCT 

901 GGACTCTTCG ACTACATCGC CGACATCTTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT 

1001 GCCTGCTCTT CCCCACCGGC TTCGTTACCG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA 



identified in N. meningitidis <SEQ ID 359>: 



GGCGCGATGA 
CCCGACCGCC 
TGTACACCTG 
AACACCCATT 
GCTCGGACGC 
TATACCTGCT 
AAAGGCTTAG 
ACTCGTCTTC 
TGGTCGACCG 
ATTTGGGCGG 
CGATACCCAA 
CCCTGCCCGT 
CTGCTCAAAT 
GACGGGCACA 
TCCAAGGCAA 
GGGCAAGTCT 
CAATATGGAC 
CGTTTTTAGG 
AAATGGAACG 
CCTGCCGCCC 
CCATCGGCTA 
GCCATGCTGC 
TAAAGTTTAC 
TCGTCAACAT 
TTTAAAGGAT 



TTATCGCCGG 
ACATCCGGCG 
GTTTTCTATG 
ATCCGCACGG 
GGCTGGAACA 
TACTTACGCT 
GCAGCGCGGC 
TTCGGCATCC 
CTTCACCGGC 
CCGGCGGGCT 
GCCCCCGCCG 
CTGCCTCGCT 
ACTTTAAAGG 
CTGATTGCGC 
CCTGCCGCGC 
CCGTCCTCAT 
AAAATATTGT 
CGTAACGCTC 
ACAGCATCTC 
CTGATTTCCT 
CGTCGGCCTG 
TCTACCGTTC 
GGCGGCTTGT 
CGCCGCACAG 
AA 



This corresponds to the amino acid sequence <SEQ ID 360; ORF 102>: 

ml02-pop. . 

1 MPHKTPSLFG GAMIIAGTVI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 



51 LSS GLMILEV 
101 YIFVGGDLTA 
151 VLIGGMVLTF 



NTHYPHGASF 
KGLGSAAGGD 
IWAAGGLIAD 



201 SFGFHGNVSS 

251 NEFAPVIAAE 

301 GLFDYIADIF 

351 AATVWTGJI P 

401 VLSQMELVPV 



LLKYFKGDAP 
GQVSVL1 ETL 
KWNDSISGRT 
AMLLYRSRKK 
FKG* 



DTMVKDLLGR GWNII NGIAV AFVLYLLTYA 

VSLTVGQLVF FGILAFC VWA SARLVDRFTG 

AKPSVLFDTQ APAGT NYWIY AATALPVCIA 

KVAKS IWTGT LIALVIYVLW QT AIQGNLPR 

SKFAQTG NMD KILSLFSYMA IATS FLGVTL 

KTAALTFLPP LISCLL FPTG FVTAIGYVGL 

FGAGKTYKVY GGLWLMVWVF LFGIVNIAAQ 



ml02/gl02 86.0% identity in 415 aa overlap 

10 20 30 40 50 60 

ml02.pep M P NKT PS LFGGAM 1 1 AGT V I G AGML AN PT AT S G VWFT GS LA VLLY TWFSML S S GLMI LEV 

I I I [ I I I I I I I 11111:1111:1 : I I I I I I i I I i : I : : I I I I 1 1 

gl02 MSAKTPSLFGGAMIIAGKVIGAGMFPNPTANLGDGLIGSLIVLLYTWFPFSSGALMI LEV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml02 . pep NTHYPHGAS FDTMVKDLLGRGWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 

111 \ : I I I I I I I I I I I I I I I I I J I I I I I I I : 1 1 I I I I I I : I 1 1 I I I I I ! : I I I : I I 
gl02 NTHNPRGAS FDTMVKDLLGRGWN 1 1 NGI AVALVLYGSTYAY ILVGGDLTAKGIGSAVGGK 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml02 . pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
:l 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 i ! I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I I I I I I z 1 1 | r | I 1 1 1 1 1 1 1 I 1 1 
gl02 ISLTVGQLVFFGILAFCWASARLVDRFTGVLIGGMVLTFIWATGGLVADAKPSVLFDTQ 

130 140 150 160 170 180 



ml02.pep 



190 200 210 220 230 240 

APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIMTGTLIALVIYVXW 
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M : I I : I I! I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 ! I I : I I I : I I I I I I I I 
g l02 APVGTGYWIYAATALPVCLASFGFI1GNVSSLLKYFKGDAPKVAKSIWAGTLVALVIYVLW 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml 02 . pep QTAIQGNLPRNEFAPVI AAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAI AT3FLGVTL 

I I I t I : I f I I I I t i I I I I I t 1:111 I ! t I I II I I t : I I I I I I i I I I I I I I I I I I I I I 
gl02 QTAIQSNLPRNEFAPVIAAER0LSVLNETLSKFAQTGDMDKILSLFPYMAIAT3FLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 

ral02.pep GLFDYIADIFKWNDSISGR-TKTAAT..TFT.PPLISCLLFPTGFVTAIGYVGLAATVWT-GI 
Mil 1111111111:111 111:11:1111111 I I : I I I I I I I I I I I I I I i I I 
gl02 GLFDNIADIFKWNDSMSGRGTKTVALNFLPPLISWLLLPTGFFTAIGASGLAATVWDQGI 

310 320 330 340 350 360 

360 370 380 390 400 410 

ral02 . pep I P AMLLY RS RKK FGAGKT Y KV YGGLWLMVWV FL FG I VN I AAQVL S QM EL V P V FXGX 

I j I I I I I I : I : I i I I I I I I I I I I I I I : I I I I I I : ! I I I I I I I I I I I I I I I I i I 
gl02 I P AMLLY VS PQKI GAGKTYKVYGGLWLML- VFLFGI AN I AAQVLSQMELVPVFKGX 

370 380 390 400 410 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 361 >: 

al02 . seq 

1 ATGCCCACCA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGNTCATC GGCGCAGGTA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCCATG 

151 CTCTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCACT ACCCCCACGG 

201 CGCGANCTTC GACACCATGG TTAAAGACCT GCTCGGACGG AGCTGGAACA 

251 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGCAAT GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATTC 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG ATTCACCAGC 

451 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GATTGCCGAT GCCAAACTGC CCGTCCTCTT CGACACCCAA GCCCCTACCG 

551 GCACCAACTA CTGGATTTAT GTCGCCACCG CCCTGCCCGT CTGCCTTGCG 

601 TCATTCGGTT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

701 TGGTAATTTA CGTCCTCTGG CAAACCGCCA TCCAANGCAA CCTGCCGCGC 

751 AACGAGTTCG CCCCCGTGAT TGCCGCCGAA GGGCAAGTCT CCGTCNTGAT 

301 TGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

351 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 

901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCGTGTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCT NTAATTTCCT 

1301 GCCTGCTCTT CCCCACCGGC TTTGTTACCG CCATCGGNTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TNTACCGTTC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA TAAAGTTTAG GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCNTCAACAT CGCCGCACAN 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 
1202 

This corresponds to the amino acid sequence <SEQ ID 362; ORF 102.a>: 

al02 -pep 

1 MPTKTPSLFG GAMIIAGTXI GAGMLANPTA TSGVW FTGSL AVLLYTWFSM 

51 LSSGLMILEV NTHYPHGAXF DTMVKDLLGR SWNIINGIAV AFVLYLLTYA 

101 YIFVGGDLTA KGLGSAAGGN VSLTVGQLVF FGILAFCVWA SARLVDRFTS 

151 VLIGGMVLTF IWATGGLIAD AKLPVLFDTQ APTGTNYWIY VATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIWTGT LIALVIYVLW QTAIQXNLPR 

251 NEFAPVIAAE GQVSVXIETL SKFAQTGNMD KILSLFSYMA IATSFLGVTL 

301 GLFDYIADIF KWNDSVSGRT KTAALTFLPP XISCLLFPTG FVTAIGYVGL 

351 AATVWTGIIP AMLLYRSRKK FGAGKT YKVY GGLWLMVWVF LFGIXNIAAX 

401 VLSC»1ELVPV FKG* 



ml02 / al02 



95.9% identity in 413 aa overlap 
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10 20 30 40 50 60 

ml 02. pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I 
a 102 MPTKTPSLFGGAMIIAGTXIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 02 . pep NTHYPHGASFDTMVKDLLGRGWNIINGIAVAFVLYLLTYAYI FVGGDLTAKGLGSAAGGD 

I I I I M I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I ! I : 
al02 N TH ¥ FHG AX F DTMV KDL LG R S WN 1 1 N G I AV AF V L Y L LT Y A Y I FVGG D LT AKG LG S AAGG N 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 02 . pep V SLT VGQ L V F FG I LA FCVW AS AR L VDR FT G VL I GGM VLT F I W AAGG L I A DAK P S VL F DT Q 

II || I I I I t I I I I I I I II I I I I I I I II I I : I I I I! I I I I I I II : I I I I I I I I I I II I I 
a 102 V SLT VGQL V F FG I LAFCVWAS ARLVDRFT S VL I GGMVLT F I W ATGG L I ADAKL PVL FDTQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 02 . pep APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 
I I : I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I ! I I 
al02 APTGTN YW I YVATALPVCLAS FG FHGNVS S LLKYFKGDAPKVAKS I WTGTLIALVI YVLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml02 . pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I 
a 102 QTAIQXNLPRNEFAPVIAAEGQVSVXIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 

250 260 270 200 290 300 

310 320 330 340 350 360 

ml02 . pep GLFDYIADIFKWNDSISGRTKTAALTFLPPL1SCLLFPTGFVTAIGYVGLAATVWTGIIP 
I II I I I I I I I I I I I I : I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al02 GLFDYIADIFKWNDSVSGRTKTAALTFLPPX1SCLLFPTGFVTAIGYVGLAATVWTGIIP 

310 320 330 340 350 360 

370 380 390 400 410 

ml 02 . pep AMLLYRSRKKFGAGKT YKVYGGLWLMVWVFLFG I VN IAAQVL SQME LVPVFKGX 

I I M I 1 I I I I I I 1 I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al02 AML L YRS RKK FGAGKT YKV YGGLWLM VWV FL FG I XN I AAX VL S QME L V P V FKGX 

370 380 390 400 410 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 363>: 

gl05.seq 

1 Atgtccgcag aaaCATACAc acAAAtcggc tGGgtaggct taggGcaaat 

51 gGgtctgcct atgGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCcgc CAAAGGAGCA 

151 AAAGTTTACG GCagcACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

2 01 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 CACTCAAACA CGCTTCCAAA GACCtTAACC TCGccgtcAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 
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This corresponds to the amino acid sequence <SEQ ID 364; ORF 105.ng>: 

g!05 -pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RAC PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

2 01 DTIVEAIGGS AMDSPMFQTK KSLWAMREFP PAFALKHASK DLNLAVKELE 

2 51 QAGNTL.PAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 365>: 
mios .seq 

1 ATGTCCGCAA ACGAATACGC ACAAATCGGC TGGaTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAm ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAaGC TCGCCGTCAA AGCACTTGTC GAAGCGCAGm GaCAGTTTGC 

351 CGAAGCACCC GTTTCCGGAT CGGTCGGGCC CGCCACCAAC GGCACGCTGC 

4 01 TGATTCTGTT CGGCGGCAGC GAAcCGtTTT AAACCCGCTG CAAAAAATAT 

4 51 TTTCCCTCGT CGGCAAAAAA ACCTTCCATT TCGGCGATGT CGGCAAAGGT 
501 TCGGGCGCGA AACTCGTCTT GAACTCGCTC TTGGGCATTT TCGGCGAaCG 

5 51 TAcAGCGAAs GmTgCTGATG GCGCGGCAGT TCGGCATCGA TACCGACACC 
601 ATCG TCGAAG CCATCGGsGA CTCGGCAATG GACTCGCCCA TGTTCCAAAC 
651 CAAAAAATCC CTGTGGGCAA ACCGCGAATT CCCGmCCGmC TTCGCCCTCA 
701 AACACGCCTC CAAAGACCTC AACCTCGCCG TCAAAGAGCT TGAACAGGCA 
751 GGCAACACCC TGCCCGCCGT CGAAACCGTT GCTGCCAGCT AC CGCAAAGC 
8 01 AGTCGAAGCC GGCTACGGGA CACAGGACGT TTCCGGCGTT TACCTGAAAC 
851 TGGCAGAACA CTGA 

This corresponds to the amino acid sequence <SEQ ID 366; ORF 105>: 

ml 05 . pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDG1AGX IIVNNSTISP 

101 TEKLAVKALV EAQRQFAEAP VSGSVGPATN GTLLILFGGS EPFXTRCKKY 

151 FPSSAKKPSI SAMSAKVRAR NSSXTRSWAF SANVQRXXLM ARQFGIDTDT 

201 IVEAIGDSAM DSPMFQTKKS LWANREFPXX FALKHASKDL NLAVKELEQA 

251 GNTLPAVETV AASYRKAVEA GYGTQDVSGV YLKLAEH 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 1 05 shows 79.9% identity over a 289 aa overlap with a predicted ORF (ORF 105.ng) 
from N. gonorrhoeae: 

ml05/gl05 

10 20 30 40 50 60 

9105. pep M S AE T YTQ I G WVG LGQM G LPMVTRLLD GG I EVG VYNR S P D KT A P I S AKGAKVYGS T AELV 

I I I : I : I t I I • I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I t I I I I : I I I I I 
ml05 MSANEYAOIGWrGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
10 20 30 40 50 60 



70 80 90 100 110 120 

gl05 . pep RACPV I FLMVSD YAAVCD I LNGVRDGLAGKI I VNMS T I S PTENLAVKALVEAAGGQ FAEA 

I II I I II III I II MM III I II II H I I III II II llhll II I III I Mill 
ml05 RDYP V I FLMVSDYAAVCD I LNGVRDGLAGX 1 1 VNMS T I S PTE KLAVKALVEAQR - Q FAEA 

70 80 90 100 110 



130 140 150 160 170 180 

gl 0 5 . pep PVSGSVGPATNGTLU LFGGSEAVLNPLQKI FS LVGKKTFH FGDVG KGS GAKLVLNS LLG 

IIIIIMIIIIIIIIIIIIIII I I :|l - h 

ml05 PVSGSVGPATNGTLLILFGGSEPFXTRCKKYFPSSAKKP- SISAMSAKVRARNSSXTRSW 

120 130 140 150 160 170 



WO 99/57280 



PCT/US99/09346 



314 



190 200 210 220 230 240 

gl05 . pep IFGEAYSELAMLMARQFGIDTDTIVEAIGGSAKDSPMFQTKKSLV?7\NREFPPAFALKHASK 

h llll II I I II UN MINI I II II II I II Mill I II I II I I II I 

ml05 AFSANVQRXXLMAROFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPXXFALKHASK 
180 190 200 210 220 230 

250 260 270 280 289 

gl05 . pep DLN LAV KE LEQ AGNT L P AV ET V AAS Y RKAV E AG YG EQDVS G V YL KLAEH 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
ml05 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGTODVSGVYLKLAEH 
240 250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 367>: 

al05 . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA AC AT GAG C AC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATC GAT ACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 368; ORF 105.a>: 

al05 .pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDS PMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



ml05/al05 



96.5% identity in 289 aa overlap 



10 20 30 40 50 50 

ml05.pep M S ANE Y AQ I GW I GLGQMG L PMVTRLL DG G I E VGV YNR S P DKT AP I S AKGAKV YGN T AE LV 

I I lllllll MIIIH II II I II tllllll H ill ill IM1IMMM! I! 1 

al05 MSANEYTQIGWI GLGQMG LPMVTRLLDGGIEVGVYNRSPDKTAP I SAKGAKVYGNTAELV 

10 20 30 40 50 60 

70 80 90 100 110 119 

ml 0 5 . pep RDY PVI FLMVSDYAAVCDI LNGVRDGLAGKI 1 VNMST I S PTENLAVKALVE AAG - Q FAE A 

I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I M I I I I I I I I I I f I I I I I I I I I I I 
al05 RDYPVI FLMVSDYAAVCDI LNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFASA 

70 80 90 100 110 120 

120 130 140 150 160 170 179 

ml05.pep PVSGSVGPATNGTLLILFGGSEAVLNPLOKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I I t I I i I I I I II I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I 
al05 PVSGSVGPATNGTLLTLFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
130 140 150 160 170 130 



180 



190 



200 



210 



220 



230 
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ml05 . pep IFGDV-QRXMLMARQFGIDTDTIVEAIGDSAMDSPKFQTKKSLWANREFPXAFALKHASK 
III:: : : I I I I i I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I MINIMI 
al05 IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPKFQTKKSLWANREFFPAFALKHASK 
190 200 210 220 230 240 

240 250 260 270 280 

ml05 - pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I M I I I I I I I I I I I I I I I 1 1 I 1 1 I I I I I I I I I I I I I I II I I I I I 1 1 M I I 
al05 DLNLAVKELSQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 369>: 

gl05-l . seq 

1 ATGTCCGCAG AAACATACAC ACAAATCGGC TGGGTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGAGCA 

151 AAAGTTTACG GCAGCACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 CACTCAAACA CGCTTCCAAA GACCTTAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 370; ORF 105-l.ng>: 

glOS-l.pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RACP VIFLMV 5DYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 371 >: 

ml05-l . seq 

1 ATGTCCGCAA ACGAATACGC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCcG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAnCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGsGACTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCG CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAACCCGGCT ACGGCGAACA GGACGTTTCC CCCCTTTACC 

851 TGAAACTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 372; ORF 105-1>: 

ml05-l.pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVI FLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 
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101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 I FSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEXM LMARQFGIDT 

201 DTIVEAIGDS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



ml05-l/gl05-l 96.9% identity in 2B9 aa overlap 



10 20 30 40 50 60 

ml05-l .pep MSANEYAQIGWIGLGOMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
III: I : I I I I : I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I 
g 105-1 M5AETYTQIGWVGLGQMGLPHVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGSTAELV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml05-l .pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
I I I I 1 I I I I I I I I I I I I I I 1 I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I i I 
g 10 5-1 RACPVIFLMVSDYAAVCDILHGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml05-l . Dep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFH FGDVGKGSGAKLVLNSLLG 
I I I I I I ! I I I I I II I I i I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I i I 
g 10 5-1 PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFH FGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml05-l.pep IFGEAYSEXMLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPPAFALKHASK 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 

gl05-l IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
190 200 210 220 230 240 



250 260 270 280 290 

ml05-l .Dep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I) I I I i I I I 1 I I I I I 1 I i 
gl05-l DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 373>: 

al05-l.seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 374; ORF 105-l.a>: 

al05-l .pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVF.AJGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



alO5-l/ml05-l 99.0% identity in 289 aa overlap 



10 20 30 40 50 60 

a 105-1 .pep MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
I I I I I I : I I I I I I I I I I I I I I I I I I t I I I I I ! I I I I I I I I I I I I ! I I I II I I I It I I I I 



WO 99/57280 



PCT/US99/09346 



317 



ml05-l MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
10 20 20 40 SO 60 

70 80 SO 100 110 120 

al 05-1. pep RD Y P V I FLMVS DY AA VC DI LN G V RDG LAG KIIVNMSTIS PTEN L A VK AL V E AA GGQ FAE A 
I I I I I I I I I I I I 1 I I I I I I I I I I I I ! I i I I I I I I I I If I M I i I f I I I I I I I II I I I I I I 
ml 05-1 RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 

70 80 SO 100 110 120 

130 140 ISO 160 170 180 

al 05-1. pep PVSG3VGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I | j I I I I I I I I M I I I 1 i I I I I I I I I I I I I I I I I I I I f I I I I IH I ! I I I I I 1 I I I I \ I 1 
ml 05-1 PVSGSVGPATNGTLLILFGGSEAVLNPLOKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

190 200 210 220 230 2<0 

al05-l .pep IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I II I I 
ml 05 - 1 I FGEA YSEXMLMARQFG IDTDTI VEAIGDSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

250 260 270 280 290 

al05-l.pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i 1 I I I I I I I I I I I 
ml 05-1 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 375>: 

gl07. seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGGTTG CCGATGCCAA 

51 ACCGTCCGTC CTCTTCGACA CCCAAGCCCC CGTCGGCACC GGCTACTGGA 

101 TTTACGCCGC CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTTT CCAGCCTGCT CAAATACTTT AAAGGCGACG cgcCcaaagt 

201 GgCGAaATCc atctggGcag gtacattggT TGCCttggta atttacgtcc 

251 TCTggcaaac cgccatCcaa agcaaccTGC cgcgcaacga gttcgcCCCc 

301 gtgattgccg ccgagaggca actCTCCGTC CTgaatgaaa cccTGtccaa 

351 attcgcccaa accggcgata tggataAaat attgtcccta tttccctaca 

401 tggcaatcgc cacctccttt ttaggcgTAA Ccttaggcct gtttgacaac 

451 atcgccggac atcttcaaat ggaacgacag tatgtccggg cggcaccaaa 

501 accgtcgcgc tga 

This corresponds to the amino acid sequence <SEQ ID 376; ORF I07.ng>: 
gl07.pep 

1 MVIiTFIWATG GLVADA KPSV LFDTQAPVGT GYWIYAATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IW AGTLVALV IYVLWQTAI Q SNLPRNEFAP 

101 VIAAERQLSV LNETLSKFAQ TGDMD KILSL FPYMAIATSF LQV TLQLFDN 

151 IAGHLQMERQ YVRAAPKPSR * 

The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 377>: 

ml 07 . seq 

1 ATGGTATTGA CCTTTATTTG GGCGGCCGGC GGGCTGATTG CCGATGCCAA 

51 GCCGTCCGTC CTCTTCGATA CCCAAGCCCC CGCCGGCACA AACTACTGGA 

101 TTTACGCCGs CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTCT CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 

201 GGCTAAATCC ATCTGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 

251 TCTGGCAAAC CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

301 GTCATCGCCG CCGAAGGGCA AGTCTCCGTC CTCATCGAAA CCCTGTCCAA 

3 51 ATTCGCCCAA ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 
401 TGGCGATCGC CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 

4 51 ATCGCCCATC TTCAAATGGA ACGACAGCAT CTCCGGgCCG CACCAAAACC 
501 GCCGCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 378; ORF 107>: 

ml07 .pep. . 

1 MVLTFIWAAG GLIADA KPSV LFDTQAPAGT NYVJI YAXTAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IW TGTLIALV IYVLWQTAI Q GNLPRNEFAP 
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101 VIAAEGQVSV LIETLSKFAQ TGNMDK ILSL F5YMAIATSF LGV TLGLFDY 
151 IAHLQMERQH LRAAPKPPR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 107 shows 89.4% identity over a 170 aa overlap with a predicted ORF (ORF 107.ng) 
from N. gonorrhoeae: 

ml07/gl07 

10 2C 30 40 50 60 

ml 07 .pep MVLTFI WAAGGLI ADAKPSVLFDTQAPAGTNYW I YAXTALPVCLAS FGFHGNV SS LLKYF 

I | | | I I I I : I I | s | | | M I I I I II I I h I 1 = I I I I I I I I i I I I I I I I I M I I I I I I I I I 
gl07 MVLTFIWATGGLVADAKPSVLFDTQAPVGTGYWIYAATALPVCIiASFGFHGNVSSLLKyF 
10 2C 30 40 50 60 

70 8C 90 100 110 120 

ml 0 7 . pep KGDAPKVAKS I WTGTL I ALV I Y VLWQTA1 QGNLPRNEFAPV I AAEGQV S VLI ETLS KFAQ 

I I II III I III hll hlllll II I M I Ihllllllllll Ml I hill II Mil II 
gl07 KGDAPKVAKS I WAGTLVALV I YVLWQTAI QSNLPRNEFAP VI AAERQLS VLNETLS KFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

ml 07 . pep TGNMDKI LSLFSYMAI ATS FLGVTLGLFDY I A- HLOMERQHLRAAPKPPR 

I hi I II Ml I II II II I II MM III I II MllllhMMIM I 

gl 0 7 TGDMDKI LSL FPYMAI ATS FIX5VTLGLFDNI AGHLQMERQYVRAAPKPSR 

130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 379> : 
al07 .seq 



3CT CAAATACTTT AAAGGCGACG CGCCCAAAGT 
:gg GCACACTGAT TGCGCTGGTA atttacgtcc 
:aa GGCAACCTGC CGCGCAACGA GTTCGCCCCC 
jCA agtctccgtc ctgattgaaa CCCTGTCCAA 
\TA TGGACAAAAT attgtccctg ttttcctata 
rTT ttaggcgtaa cgctcggact cttcgactac 



1 


atggtattaa 


51 


ACTGCCCGTC 


101 


TTTATGTCGC 


151 


GGCAACGTCT 


201 


GGCTAAATCC 


251 


TCTGGCAAAC 


301 


GTGATTGCCG 


351 


ATTCGCCCAA 


401 


TGGCGATCGC 


451 


ATCGCCGACA 


501 


CGCCGCGCTG 


551 


CCGGCTTTGT 


601 


ACAGGCATCA 


651 


CGCAGGCAAA 


701 


TCTTCCTTTT 


751 


GAACTCGTCC 



This corresponds to the amino acid sequence <SEQ ID 380; ORF 107.a>: 

al07 .pep 

1 MVLTFIWATG GLIADAKLPV LFDTQAPTGT NYWIYVATAL PVCLASFGFH 

51 GNVSS LLKYF KGDAPKVAKS 1WTGTLIALV IYVLWQTAIQ GNLPRNEFAP 

101 VIAAEGQVSV LIETLSKFAQ TGNMDKILSL FSYMAIATSF LGVTLGLFDY 

151 IADIFKWNDS VSGRTKTAAL TFLPPLISCL LFPTGFVTAI GYVGLAATVW 

201 TGIIPAMLLY RSRKKFGAGK TYKVYGGLWL MVWVFLFGIV NIAAQVLSQM 

251 ELVPVFKG* 

ml07/al07 94.8% identity in 154 aa overlap 

10 2C 30 40 50 60 

ml 07 .pep MVLTFIWAAGGLIADAKPSVLFDTQAPAGTNYWIYAXTALPVCLASFGFHGNVSSLLKYF 
I II I i I I I : I I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I M I I i I I I I I I I I I I 
a!07 MVLTFIWATGGLIADAKLPVLFDTQAPTGTNYWIYVATALPVCLASFGFHGNVSS LLKYF 

10 20 30 40 50 60 
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70 80 90 100 110 120 

ml07 . pep KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 
| | | | | I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II [ I I I I I I I I I I 
al07 KGDAPKVAKSIWTGTLIALVIYVLWQTAtQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

m!07 . pep TGNMDKILSLFSYMAIATSFLGVTLGLFDYIAHLQMERQHLRAAPKPPRX 

I I | I I I I I I I I II I I I I 1 I II I I I I t I I I i I I : 
a i07 TGNMDKILSLFSYMAIATSFLGVTLGLFDYIADIFKWNDSVSGRTKTAALT FLPPLISCL 

130 140 150 160 170 180 

al07 LFPTGFVTAIGYVGLAATVWTGIIPAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIV 
190 200 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 381>: 

gl08 . seq 

1 ATGttgccgg gCTTCAACCG GATATTCAaa cggTTTGCTC CAACACTCGG 

51 AAcggCGCAT AAAACGCCgc ccTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATCCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGCG GCAATCTTAC TCGGCGGGTG 

201 CGCCGCCGGC GGCAACACAT TCGGCAGCTT AGACGGCGGC ACGGGTATGG 

251 GTGGCAGCAT CGTCAAAATG ACGGTAGAAA gccAATGCCG TGCGGAATTG 

301 GACAGGCGCA GCGAATGGCG TTTGACCGCG CTGGCGATGA GTGCCGAAAA 

3 51 ACAGGCGGAA TGGGAAAACA AGATTTGCGG CTGCGCTACC GAAGAAGCAC 

401 CTAACCAGCT GACCGGCAAC GATGTGATGC AGATGCTGAa ccagtccacG 

451 CGCaatcagg cacTtgccgc CCtgaccgTC AAAacggtTT CcgcctgcTT 

501 CAaacgcctg tACCGCTAa 
This corresponds to the amino acid sequence <SEQ ID 382; ORF 108.ng>: 
glOS.pep 

1 MLPGFNRIFK RFAPTLGTAH KTPPFALSRT GRLIRSYRHK RRGFNRKGIE 

51 MNK TLSILPA AILLGGCAAG GN TFGSLDGG TGMGGSIVKM TVESQCRAEL 

101 DRRS EWRLTA LAMSAEKQAE WENKI CGCAT EEAPNQLTGN DVMQMLNQST 

151 RNQALAALTV KTVSACFKRL YR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 383>: 

ml08. seq 

1 ATGTTGCCGG GCTTCAACCG GATATTCAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGA GGCGGTAACA CATTCGGCAG CTTAGACGGT GGCACAGGCA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTTG GGAGCCAATG CCGTGCGGAA 

301 TTGGACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGCCG GAGTGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

401 CACCCGAACG GATGACCGGC AACGATGTGA TGCAGATGCT GGCTCCGTCC 

451 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 384; ORF 108>: 

ml08 .pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG GG NTFGSLPG GTGMGGS I VK MAVGSQCRAE 

101 LDKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPERMTG NDVMQMLAPS 

151 TRNQALAALT AKTVSACFKH LYR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 108 shows 89.6% identity over a 173 aa overlap with a predicted ORF (ORF 108.ng) 
from K gonorrhoeae: 

ral08/gl08 
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10 20 30 40 50 60 

ML PG FNRI FKRFVPTLGTAHKTP P FALSRTGRLI RFYRHKRRGFNRKGI EMNKTLS I LPV 

M 1 1 II Ml III ■ III II M li MUM ! IMIII 1 1 1 1 M M 1 1 1 1 M 1 1 1 M 1 1 1 h 

ML PG FNRI FKRFAPTLGTAHKTPPFALSRTGRLIRS YRHKRRGFNRKGI EMNKTLS I LP A 
10 20 30 40 50 60 

70 80 90 100 110 120 

A I LLGGCAAGGGNTFGSLDGGTGMGGS IVKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 

1 1 ! 1 1 1 1 1 1 1 1 MHMMMMMMMIM 1 M f I M I H 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AILLGGCAAGG-NTFGSLIX3GTCMGGSIVKMTVESQCRAELDRRSEWRLTALAMSAEKQA 
70 80 90 100 110 

130 140 150 160 170 

E WEN KI C ACVAQ E A P ERMTGNDVMQMLAPS TRNQALAALTAKWSACF KHL YRX 

Mill I IM-M ll-MIII I III I 1 1 M 1 1 1 II M M 1 1 1 M 1 1 M 1 1 

E W EN KI CGCATE E APNQLTGND VMQMLNQSTRNQALAALTVKTVS ACF KRL YRX 
120 130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 385>: 
alOS.seq 

1 ATGTT GCCGG GCTTCAACCG GATATTCAAA CGGTTT3TTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGG GGCGGTAACA CATTCGGCAG CTTAGACGGC GGCACAGGTA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTAG AAAGCCAATG CCGTGCGGAA 

301 TTGAACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAATGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

401 CACCCAACCA GCTGACCGGC AACGATGTGA TGCAGATGCT GGATCCGTCC 

451 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 386; ORF I08.a>: 
al08.pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 
51 MNKT LSILPV AILLGGCAAG GG NTFGSLDG GTGMGGSIVK MAVESQCRAE 
101 LNKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPNQLTG NDVMQMLDPS 
151 TRNQALAALT AKTVSACFKH LYR* 

ml08/al08 96.5% identity in 173 aa overlap 

10 20 30 40 50 60 

ml08.pep MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLS1LPV 
I M I I II M I! I I I I I I II II II I I I I I I I I I I II I I M I I I I II I I I I I I I I I I I I I I I 
al08 MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml08 .pep AI LLGGCAAGGGNTFGSLDGGTGMGGS I VKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 

I i I I I I I I I I I I I I I I I M I i I I I I II I I I I I I I I I I I I I: I I I I I I I I I I I I I M I I I 
al08 AI LLGG C AAGGG N T FG S LDGG TGMGG S I VKMAVE S QCRAE LNKR S EWR LT ALAM S AE KQA 

70 80 90 100 110 120 

130 140 150 160 170 

ml08 . oep EWENKICACVAQEAPERMTGNDVM(^1APSTRNQALAALTAKTVSACFKHLYRX 
I I I I I I I I I I I I I I I : : : I I I I I I I I I II I I I I I I I I I I I I I I I I II I ! I I 1 I 
al08 E W ENK 1 C AC V AQE AP N QLTGN DVMQM LD P S T RNQ ALAALT AKTV S AC FKHL YRX 

130 140 150 160 170 



tnl08 .pep 

gios 

pi 08 .pep 

gl08 

ml 08 .pep 
gl08 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 387>: 
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gl09.seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GATCGTAGGC GTATGCTTAC CGCTTTTGGA AGCGGGCATG 

101 GAAATGACGC GCAAAGGCAA AACCACCCAA TCCGCCGCCA TCGTGGTGTT 

151 CTCTTCCGTC TGGTCAATCC GGTTTTCGGC TGGGCGTTGA CGATGCTGTT 

201 GGATAATTTG GGCTTAATCG GCTGCAAAGA ACGCAGCGCG CAATTAGGTT 

251 TTGTCG GACG AGTATTGATA CCCGCAGTAG GTTTCTTAAT CTTGTGTGTG 

301 GCGATGGGTG CGGTCGGGAT GCTGCCCGGT ATCCCTCCGT TTTTGGAGCA 

351 GTTCAAATCT TTGGGCTAG 

This corresponds to the amino acid sequence <SEQ ID 388; ORF 109 .ng>: 
gl09.pep 

1 MYYRRWGLS DGLGDLAAG I DRRRMLTAFG SGHGNDAQRQ NHPIRRHRGV 
51 LFRLVNPVFG WALTMLLDNI, GLIGCKERSA QLGFVG RVLI PAVGFLILCV 
101 AMGAVGMLPG IPPFLEQFKS LG 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 389>: 

ml09 .eeq 

1 ATGTATTATC GCCGGGTTAT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CATCGTGGTG TTCTCTTCCG CCTTGTCAAT CCGGTTTTCG GCTGGGCGTT 

201 GACGATG CTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGTG 

251 CGCAATTAGG TTTCGCCGGA CGCGTGTTGA TACCCGCAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAA CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 109>: 

ml 09 .pep 

1 MYYRRVMGLS DGLGDLAAG I ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFAG R VLIPAVGFL 

101 ILCVAMGA VG MLPGI PPFLE HFKSLG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 109 shows 92.9% identity over a 126 aa overlap with a predicted ORF (ORF 109.ng) 
from N. gonorrhoeae: 

ml09/gl09 

10 20 30 40 50 60 

MYYRRVMGLS DGLGDLAAG I ERSLGRRRI LTAFGSGHGNDAQRQNHP IRRHRGVLFRLVN 

MlllhlllllMIIIMhl Ihllllllllllillllllllllllllllllll 

MYYRRWGLS DGLGDLAAG 1 DR RRMLTAFGSGHGNDAQRQNHP IRRHRGVLFRLVN 

10 20 30 40 50 

70 80 90 100 110 120 

PVFGWALTMLLDNLGLI GCKERSAQLGFAGRVL I PAVGFLI LCVAMGAVGMLPGI PPFLE 

Ml IIMMIIIllllMMIIIIIllhllllllllll II II I Mill I !MI UN I 

PVFGWALTMLLDNLGL I GCKERSAQLGFVGRVLI PAVGFL I LCVAMGAVGMLPGI PPFLE 
60 70 80 90 100 110 



ml09 .pep 
9109 

ml 09 .pep 
gi09 



HFKSLGX 

=11111 
OFKSLGX 
120 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 391>: 

al09. seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CACCGTGGTG TTCTCTTCCG CTTGGTCAAT CCGGTTTTCG GCTGGGCGTT 



ml09 .pep 
9109 
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201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGCG 

251 CGCAATTAGG TTTCACCGGA CGCGTATTGA TACCCGTAGT AGGTTTCTTG 

203. ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

251 GTTTTTGGAG CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 392; ORF 1 09>: 
al09.pep 

1 MYYRRVVGLS DGLGDLAAGI 2RSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFTG R VLIPWGFL 

101 ILCVAMGAVG MLPGIPPFLE MFKSLG* 



ial09/al09 



97.6% identity in 126 aa overlap 



10 20 30 40 50 60 

ml09.pep M Y Y RRV MG L S DG LG DL AAG I E R S LG RRR I LT A FG S G HGN DAQRQN H P I RRHRG VL FR L VN 

I I i I I I : I I I I I I I I I I I I I I i I I I I I I I I I I I I ! I II I I I I I I I I I I I I I I I I I I I I I I 
al09 MYYRRWGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 09. pep PVFGWALTMLLDNLGLIGCXERSAQLGFAGRVLIPAVGFLILCVAHGAVGMLPGIPPFLE 
I I I I I I I I I I I I I I I I I I I I I I I ! I I 1 I : I I i I I I : I I I I I I I I I I I I I ! I I I I I I I I I i 
al09 PVFGWALTMLLDNLGLIGCKERSAQLGFTGRVLIPWGFLTLCVAKGAVGMLPGIPPFLE 

70 80 90 100 110 120 



ml09.pep HFKSLGX 
i I I I I I i 

a 109 HFKSLGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 393>: 

glll.seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGaacaaacC GCGCAaaccg 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCtATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

2 01 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGtCCaCC TACCAGACCG 

251 ATTCCGAAAT CAGCCGGTTt atacagacan atgctggaga gctcttcgcg 

301 tntcatgcag nttctataac tgattccgcc gaagactgtc tgcctaatac 

351 gcctatctca tcggcgctct ga 

This corresponds to the amino acid sequence <SEQ ID 394; ORF 1 1 1 .ng>: 

gill. pep 

1 MPSETRLPNL IRALIFALGF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF IQTAGELFAH 

101 AS I TDSAEDC LPNTPISSAL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 395>: 

mill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATAyCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AwAAACG CAT 

201 CGATGACGCG CTTAAAGAAk TCAACCGGyA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
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601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG cTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGcTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 396; ORF 1 1 1>: 
mill .pep 

1 MPSETRLPNF IRVLIFALGF IFLNA CSEQT AQTVTLQGET MGTTYXVKYL 

51 SNNRDKLPSP AEIXKRIDDA LKEXNRXMST YQPDSEISRF KQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKI ILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQTIVPL 

251 NNRSLATSGD YRI FHVDKNG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

ORF 1 1 1 shows 88.7% identity over a 97 aa overlap with a predicted ORF (ORF 1 1 1 ,ng) 
from N. gonorrhoeae: 

mill .pep/glll. pep 

10 20 30 40 50 60 

mill. pep MPSETRIjPNFI rvli falgfiflnacseqtaqtvtlqgetmgttyxvkylsnnrdklpsp 

I I I I I I I I h I I : I I I I I I I I I I I I i I I I ! I I I I I I I I I I I I I I : I I I I I I I I I I I I I I 
gill MPSETRLPNLIRALIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
10 20 30 40 50 60 

70 80 90 100 110 120 

mill . pep AE IXKRIDDALKEXNRXMSTYQPDSEI SRFNQHTAGKPLRI SSDFAHVTAEAVRLNRLTH 

hi lllllllll I! Illll lllllll I :|h 
gill AKIQKRIDDALKEVNRQMSTYQTDSEISRFIQTXAGELFAXHAXSITDSAEDCLPNTPIS 
70 80 90 100 110 120 

130 140 150 160 170 180 

mill . pep GAliDVWGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKI ILKQGKD YASLSKTHPK 

gill SALX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 397>: 

alll . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

401 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

451 ATCAAACAAG CAGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 

701 AACAGCCCAA CATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
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801 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 398; ORF 1 1 1 .a>: 

alll .pep 

1 MPSETRLPNF IRTLIFALSF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKI I LKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

mlll/alll 97.7% identity in 351 aa overlap 

10 20 30 40 50 60 

mill. pep MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSP 
I I I I I I I I I I I I : I I I I I : I I I I I I I I I I I I I II I I i I I I I I I II : I i II I I I I I M I I I 
alll MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 

70 80 90 100 110 120 

mill, pep AEIXKRIDDALKEXNRXMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
III 1 I I I I I I I I II I I I I I I I I I II I II I I M I I I 1 I I I I I I I II I I I I I : 1 I I I I I 
alll AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

mill, pep G ALD VT VG P LVN LWG FG P DKS VT RE P S PEQ I KQAAS Y T G I DK 1 1 LKQG K D Y A S L S KTH PK 

1 1 1 1 1 1 M 1 1 1 1 M 1 1 m 1 1 1 1 1 1 1 1 1 1 ii i it 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 i I 

alll GALDVTVGPLVNLWGFGPDKSVTRE PS PEQI KQAAS YTG I DKI I LKQGKD YASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

mill . pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGE PWRIGIEQPNIVQ 

1 1 1 1 1 1 1 1 1 f t i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 f i t i i 

alll AYLDLS SI AKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGE PWRIGIEQPNIVQ 

190 200 210 220 230 240 

250 260 270 280 . 290 300 

mill, pep GGNTQIIVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISWADSAM 
I I I I II I I I I I I I I I I II I I I I I I ! I I I : I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
alll GGNTQIIVPLNNRSLATSGDYRIFHVDKSGKRLSHIINPNNKRPISHNLASISWADSAM 

250 260 270 280 290 300 

310 320 330 340 350 

mill. pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I 
alll TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 399>: 

glll-l.seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 
51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAacCG 
101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCTATACCGT CAAATACCTT 
151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 
201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TACCAGACCG 
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251 


ATTCCGAAAT 


CAGCCGGTTC 


301 


ATTTCAAGCG 


ATTTCGCACA 


351 


CCTGACTCAC 


GGCGCACTGG 


401 


GGGGGTTCGG 


CCCCGACAAA 


451 


ATCAAACAGG 


CGGCATCTTA 


501 


AGGCAAAGAT 


TACGCTTCCT 


551 


ATTTATCTTC 


GATTGCCAAA 


601 


CTGGAAAAAT 


ACGGCATTCA 


651 


GCACGGCAAA 


GGCAAAAATG 


701 


AGCAACCCAA 


TATcatccaa 


751 


aaCaaccgtt 


cgcttgccac 


801 


TAAAAACGGC 


Aaacgccttt 


851 


CCATCAGCCA 


CAAcctcgcc 


901 


ACGGCGGACG 


GTTTATCCAC 


951 


CTTAAGGCTG 


GCAGAACAAG 


1001 


ATAAGGACGG 


CTACCCCACC 


1051 


CGCTAA 





AACCAACACA CAGCCGGCAA GCCCCTCCGC 
CGTTACCGCC GAAGCCGTCC GCCTGAACCG 
ACGTAACCGT CGGCCCTTTG GTCAACCTTT 
TCCGTTACCC GTGAACCGTC GCCGGAACAA 
TACGGGCATA GACAAAATCA TTTTGCAACA 
TGAGCAAAAC CCACCCCAAA GCCTATTTGG 
GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
AAATTATCTG GTCGAAATCG GCggcGAGTT 
CGCACGGCGA ACCGTGGCGC ATCGGTATAG 
ggcggcaata cgcAGattat cgtcccgctg 
ttccggcgAT taccgtaTTT tccacgtcgA 
CCCacATCAT CAATCCCAAC AACAAACGAC 
tcCATCAgCg TGGTCTCAGA CAGTGCAATG 
AGGATTATTT GTTTTAGGCG AAACCGAAGC 
AAAAACTCGC TGTTTTCCTA ATTGTCCGGG 
GCCATGTCTT CCGAATTTGC CAAGCTGCTC 



This corresponds to the amino acid sequence <SEQ ID 400; ORF 111-1 .ng>: 

glll-l.pep 

1 MPSETRLPNL IRALIFALGF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNKRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILQQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNAHGEPWR IGIEQPNIIQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVSDSAM 

301 TADGLSTGLF VLGETEALRL AEQEKLAVFL IVRDKDGYRT AMSSEFAKLL 

351 R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 401>: 

ml 11-1 . 8Qq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAACGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

401 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

451 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAACAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 402; ORF 1 1 1-1>: 

ml 11-1 .pep 

1 MPSETRLPNF IRVLIFALGF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



mlll-l/glll-1 96.6% identity in 351 aa overlap 

10 20 30 40 50 60 

ml 1 1- 1 . pep MPSETRLPNFTRVLI FALGFI FLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPS P 
I I I I I I I I I : I I : I ! I I I I i I M I I I I I M I M I I I I I I I I I I ! I I I I I I I f I I I I I I I ( 
glll-1 MPSETRLPNLIRALIFALGFIFLNACSECrTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
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10 20 30 40 50 60 

70 80 90 100 110 120 

mlll-1 .pep AEIOKRIDDALKEVNRQMSTYQPDSEISRFNOHTAGKPLRISSDFAHVTAEAVRLNRLTH 
I : I I ! I I I II I I I I I I I I I 1 1 I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I 
glll-1 AKIQKRIDDALKEVNRQMSTYQTDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

70 80 90 300 110 120 

130 140 150 160 170 180 

mlll-1 .pep GALDVTVGPLVNLWGFGPDKSVTPXPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I I I I i I I I I I I ! I I I I I I I I I J I I I 1 I I I I I I I I I I I I I I I I I I I : I I I I I I I II I I I I I 
glll-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILQQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

mlll-1. pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
Mill llll II I III III I MM III IMII I ill II II II II IMMI llllll II |:| 
glll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNAHGEPWRIGIEQPNIIQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

mlll-1 .pep GGNTQI IVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISVVADSAM 
M II I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I 1 1 I 1 1 M M : I I II 
glll-1 GGNTQI IVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISVVSDSAM 

250 260 270 280 290 300 

310 320 330 340 350 

mlll-1 .pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 : 1 1 1 : i 1 1 ii 1 1 1 1 1 1 1 nmimi IMII 

glll-1 TADGLSTGLFVLGETEALRLAEQEKLAVFLIVRDKDGYRTAMSSEFAKLLRX 
310 320 330 340 350 

glll-l/p44550 

sp|P4 4550|YOJL_HAEIN HYPOTHETICAL LIPOPROTEIN HI0172 PRECURSOR >gi 1 1074292 Ipir | | C64144 
hypothetical protein HI0172 - Haemophilus influenzae (strain Rd KW20) >gi 1 1573128 (U32702) 
lipoprotein, putative (Haemophilus influenzae Rd] Length =34 6 
Score - 349 bits (885), Expect = 2e-95 

Identities = 177/328 (53%), Positives = 240/328 (72%), Gaps = 4/328 (1%) 

Query: 23 LNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSPAEIXKRIDDALKEXNRXKSTYQ 82 

L AC ++T + ++-L G+TMGTTY VKYL + S + + 1+ LK+ N KSTY+ 

Sbjct: 17 LAACQKET-KVISLSGKTMGTTYHVKYLDDGSITATS-EKTHEEIEAILKDVNAKKSTYK 74 

Query: 8 3 PDSEISRFNQHT-AGKPLRISSDFAHVTAEAVRLNRLTHGALDVTVGPLVNLWGFGPDKS 141 

DSE+SRFNQ+T Pi- IS+DFA V AEA+RLN++T GALDVTVGP+VNLWGFGP+K 
Sbjct: 75 KDSELSRFNQNTQVNTPIEISADFAKVLAEAI RLNKVTEGALDVTVGPVVNLWGFGPEKR 134 

Query: 142 VTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPKAYLDLSSIAKGFGVDKVAGEL 201 

++P+PEQ+ + ++ G1DKI L K+ A+LSK P+ Y+DLSSIAKGFGVD+VA +L 
Sbjct: 135 PEKQPTPEQLAERQAWVGIDKITLDTNKEKATLSKALPQVYVDLSSIAKGFGVDQVAEKL 194 

Query: 202 EKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQGGNTQIIVPLNNRSLATSGDY 2 61 

E+ QMY+VE1GGE+ KGKN G+PW+I IE+P + ++ LNN +A+SGDY 

Sbjct: 195 EQLNAQNYMVEIGGEIRAKGKNIEGKPWQIA1EKPTTTGERAVEAVIGLNKMGMASSGDY 254 

Query: 262 RIFHVDKNGKRLSHIINPNNKRPISHNLASISVVADSAMTADGLSTGLFVLGETEALKLA 321 

RI+ ++NGKR +H I+P PI H+LASI+V+A ++MTADGLSTGLFVLGE +AL++A 
Sbjct: 255 RIY-FEENGKRFAHEIDPKTGYPIQHHLASITVLAPTSMTADGLSTGLFVLGEDKALEVA 313 

Query: 322 EREKLAVFLIVRDKGGYRTAMSSEFEKL 349 

E+ LAV+LI+R G+ T SS F+KL 
Sbjct: 314 EXNNLAVYLIIRTDNGFVTKSSSAFKKL 341 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 403>: 

alll-l ,0oq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 
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351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

401 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAAG CAGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 

701 AACAGCCCAA CATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

801 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CT7AAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 



This corresponds to the amino acid sequence <SEQ ID 404; ORF 1 1 1-1 .a>: 

all 1-1. pep 

1 MPSETRLPNF I RTLI FALSF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YOPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRI FHVDKSG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



alll-l/ralll-1 S8.9% identity in 351 aa overlap 



10 20 30 40 50 60 

all 1- 1 . pep MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
I I | I I I I 1 I I I I : I II I I :l I I I I I I II I I I I II I I I 1 I I I I I I I I I I I I I I I I I I i I t I 
mil 1- 1 MPSETRLPNFI RVLI FALGFI FLNACSEQTAQTVTLQGETMGTrYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 



70 80 90 100 110 120 

all 1- 1 , pep AEIQKRI DDALKEVNRQMSTYQPDSE1SRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I 1 I I I I I I \ I I I : I I I I I I 
mil 1- 1 AEIQKRI DDAXKEVNRCMSTYQPDSEISRFNQHTAGKPLRI SS DFAH VTAEAVRLNRLTH 

70 80 90 100 110 120 



130 140 150 160 170 180 

all 1-1 . pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKI ILKQGKDYASLSKTHPK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I 
mlll-1 GALDVTVGPLVNLWGFGPDKS VT RE PS PEQIKQAASYTGIDKI ILKQGKDYASLSKTHPK 

130 140 150 160 170 180 



190 200 210 220 230 240 

alll- 1 . pep A YLDL SS I AKG FG VDK V AG ELEK YG I QN YLVE I GGE LHGKG KNARGE PW RI G I EQ PN I VQ 
I I I ! I I I I I I I I I I I I I I I I I I I i I I I I 1 I I I I I I I I I ! I I I I I I I 1 I I I i I I I I I I I I I 
mlll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

190 200 213 220 230 240 



250 260 270 280 290 300 

all 1-1. pep GGNTQIIVPLNNRSLATSGDYRIFHVDKSGKRLSHIINPNNKRPISHNLASISWADSAM 
I I I I I I I I I I II I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 1 1- 1 GGNTQI I VPLNNRSLATSGDYRI FH VDKNGKRLSHI I NPNNKRP I SHNLAS I S WAD SAM 

250 260 270 280 290 300 



310 320 330 340 350 

a 11 1-1 .pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
I I I I I I I I I I I I I I I I I I I I t I I I I I I i I I i I I I I I I I I I 1 I I I I II 1 I I I I 
mil 1-1 TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
310 320 330 340 350 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 405>: 

gll4 . Geq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCAGCAGG AATGCAG CAA 
51 GACTTTTTTA TGTCCGCCGG GCGGGACGAG TATGGGGCGG TCAATGTCGG 
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101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTCGAA 

151 TACGGTCAAA GCGGCTATTT TACCAGAGCC GCCGAATGTA AAACAGGGTG 

201 TCAGGGCATC AGCCCGAGCT GCCTGAACGA ACGGACGGTT TGCGAGGTAA 

251 CGATAAAATG GTCGAGCAGC GAAACATCAA CCAGCGACAT GGCCTGTGCC 

301 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAAGGTTCAG GCGAGCCGCC 

351 CGGATGGTTG TGCGCGATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

401 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 406; ORF 1 14.ng>: 

gll4 .pep 

1 MASITSPLHG AQQECSKTFL CPPGGTSMGR S MSVTVGLFC VSINLTISV E 

51 YGQSGYFTRA AECKTGCQGI SPSCLNERTV CEVTIKWSSS ETSTSDMACA 

101 SRLVNMMSSC EGSGEPPGWL CAIIRLSAYS SNASLTISRM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 407>: 

m!14.seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCACAGAG AATGCAGCAA 

51 GACTTTTTTA TGTCCACCGG GCGGGACGAG TATAGGGCGG TCAATGTCGG 

101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTTGAA 

151 TACGGTTGAA GCGGCTATTT TATCAGAGCC GCCGCATGTA AAACAGAGTG 

201 TCAGGGCATC AACCCGAGCT GTCTGAACGA ACAGACGCTT TGCGAkGTAA 

251 CGATAAAATG GTCGAGCAGC GACACATCGA CCAGCGACAT TGCCTGTGCC 

301 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAArGTTCsG GCGAGCCGcC 

351 CGgATGGTTG TGCGCAATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

401 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 408; ORF 1 14>: 

mll4.pep 

1 MASITSPLHG AHRECSKTFL CPPGGTSIGR S MSVTVGLFC VSINLTISV E 

51 YGXSGYFIRA AACKTECQGI NPSCLNEQTL CXVTIKWSSS DTSTSDIACA 

101 SRLVNMMSSC EXSGEPPGWL CAIIRLSAYS SNASLTISRM * 



mll4/gll4 90.0% identity over a 140 aa overlap 

10 20 30 40 50 60 

mll4 . pep MASITSPLHGAHRECSKTFLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSGYFIRA 

II I III II II l-l lllll I Mill Ihl I !l I II I II I IN III I I I I! II I I I I II 
gll4 - MAJSITSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLTISVE YGQSGYFTRA 

10 20 30 40 50 60 



70 80 90 100 110 120 

mll4.pep AACKTECQGINPSCIiNEQTLCXVTIKWSSSDTSTSDIACASRLVlJMMSSCEXSGEPPGWL 
I III lllhlllllhhl III llllhl III hll III Mill! II I llllll I 
gll4 AECKTGCQGISPSCLNERTVCEVTIKWSSSETSTSDMA<^ 

70 80 90 100 110 120 



130 140 
mll4 .pep CA I I RLS AYSSNAS LTI SRMX 

III III ilill M llllll II 
g 1 1 4 CAI I RLSAYS SNAS LT I SRMX 

130 140 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 409>: 

al!4 . soq 

1 ATGCCGGAGG CAAGCATCGC CTCCATCACT TCGCCGCTGC ACGGGGCGCA 

51 ACAGGAATGC AGCAAGACTT 7TTTATGTCC GCCGGGCGGG ACGAGTATGG 

101 GGCGGTCAAT GTCGGTAACG GTAGGTTTGT TTTGTGTTTC CATTAACTTA 

151 ACGATATCTG TCGAATACGG 7TGAAGCGGC TATTTTATCA GAGCCGCCGC 

201 ATGTAAAACA GGGTGTCAGG GCATCAGCCC GAGCTGCCTG AACGAACGGA 

251 CGGTTTGCGC CGTTACGATA AAATGGTCGA GCAGCGACAC ATCGACCAGC 

301 GACATTGCCT GTGCCAGCCG. CCTTGTGAAC ATGATGTCTT CCTGCGAAGG 

351 TTCGGGCGAG CCGCCCGGAT GGTTGTGCGC GATAATCAGG CTGTCGGCAT 

401 ATTCGTCCAA TGCCAGTTTG ACAATTTCAC GGATGTAA 
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This corresponds to the amino acid sequence <SEQ ID 410; ORF 1 14.a>: 
all4 .pep 

1 MPEASIASIT SPLHGAQQEC SKTFLCPPGG TSMGRSM SVT VGLFCVSINL 
51 TISV EYG*SG YFIRAAACKT GCQGISPSCL NERTVCAVTI KWSSSDTSTS 
101 DIACASRLVN MMSSCEGSGE PPGWLCAIIR LSAYSSNASL TISRM* 

mll4/all4 92.9% identity in 140 aa overlap 

1C 20 30 40 50 

MASITSPLHGAHRECSKTFLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSG 
:||||||||||::|MI1IIIIIII1|: IIIIIIIIMIt II llllll III IN I 
MPEASIASITSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLTISVEYGXSG 
10 20 30 4 0 50 60 

60 7C 80 90 100 110 

YFIRAAACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGE 
I I I I ! M I I i I I I I : I I I I I : I: I I I t I t I I I I I 1 I I I t I I I I I I I I I I I I I I I I I 
YFIRAAACKTGCQGISPSCLNERTVCAVTIKWSSSDTSTSDIACASRLVNMMSSCEGSGE 
70 80 90 100 110 120 

120 13C 140 

PPGWLCAIIRLSAYSSNASLTISRMX 
I I I I I I I I I I II I i I I I I I MINI 
PPGWLCAIIRLSAYSSNASLTISRMX 
130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 41 1>: 

gll7 . eeq 

1 atggtcgacg aactcgacCT GCTGCCCGAT GCCGTCGCCG CCACCCTGCT 

51 TGCCGACATC GGACGCTACG TCCCCGATTG GAACCTATTG GTTTCCGAGC 

101 GCTGCAACAG CACCGTCGCC GAGCTGGTCA AAGGTGtgga CGAAGTGCAG 

151 AAACTTACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 

201 CGCACAGCAA GCGGAAACCA TGCGGAAAAT GCTGCTGGCg atggttaccg 

251 Acatccgcgt cgtaTTAATC AAACTGGCGA TGCGTacgcg caccCTGcta 

301 ttTTtaaGCA ACGCCCCCGA CAGCCCTGAA AAACgcgccG TCgccaaAga 

351 aacccTCGAC ATCTTCGCCC CGCTCGCCAA CCGCTTGGGC GTGTGGCAGC 

401 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 

451 TACCGCGAAA TCGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 

501 ATACATCGAA AACTTCCTCG ATATCCTGCG TACGGAACTC AAAAAATACA 

551 AT AT CCACTT TGAAGTCGCC GGCCGTCCGA AACACATCTA CTCCATTTAC 

601 AAAAAAATGG TGAAGAAAAA ACTCAGCTTC GACGgccTGT TCGACATCCG 

651 CGCCGTGCGG ATTCTGGTCG ATACCGTCCC CGaGTGTTAC ACCAOGCTGG 

701 gcaTCGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGagtt CGAcgactAC 

751 ATCGCCAACC CCAAAGgcaA CGgttATAAA AG t TTGCACA CCGTCATCGT 

801 cggcccGGAa gacaaaggtg tggaaGtgCA AATCCGCACC TTCGAtatGC 

851 accAATTCaa CgaatTcggT gtcgccgCCC ACTGGCGtta caaagaaggc 

901 ggcaaaggcg attccGCCtA cgaacaaAAA ATcgccTggt TGCg ccaACT 

951 CTTGGACTGG CGCGAAAATA TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCTCT GCCAACGGGC GCAACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TcggCGACCG CTGCCGGGGC GCGAAAGTCG 

1151 AaggGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGCGTC 

1201 GAAATcatta cCGCcaaAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

1251 AGGctgGGCC aAATCCGGCA AGGCCATCGG caaAATCCGC GCCTAcatCC 

1301 GCCAGcaaAa cgCcgaCACC GTGCGCGAAG AAGGCCGTGT CCAACTCGAC 

1351 AAGCAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTgccga 

1401 aaATCTCGGC tacaaAAAGC cagaagacct ctacacCGCc gtcggacaag 

1451 gegaaattte caaccgcgcc atCcaaaaag cctgcggcac GCTgaacgaa 

1501 ccgccccCCG TGCCCGTCAG CGCAACCACC ATCGTCAAAC AGTCCAAAAT 



ml 1 4 . pep 
all4 

ml 14 .pep 
all4 

ml 14 .pep 
a!14 
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1551 CAAAAAAGGT 3GCAAAACCG GCGTGCTCAT CGACGGCGAA GACGGCTTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGATATTGCC 

1651 GGCTTCGTTA CCCGCGAGCG CGGCATTTCC GTCCACCGCA AAACCTGCCC 

1701 CTCTTTCCGA CACCTTGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

1751 GTTGGGCGGC GTTGCAGGAA GGGCAAGTGT TCGCCGTCGA TATCGAAATC 

1801 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

18 51 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGtCAA CGacCTCCCG 

1951 CGCGTCCTCG CCGGCCTCGG CGATGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 412; ORF 1 17.ng>: 

gll7 .pep 

1 MVDELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMR KMLLA MVTDIRWLI KLAM RTRTLL 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLDILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

251 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

301 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 

4 01 EIITAKEGHP SVNWLYEGWV KSGKAIGKIR AYIRQQNADT VREEGRVQLD 

4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 

501 PPPVPVSATT IVKQSKIKKG GKTGVt.TDGE DGIiMTTTaAKC CKPAPPDDIA 

551 GFVTRERGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 

601 RAQDRSGLLR DVSDAIiARHK LNVTAVQTQS RDLEASMRFT LEVKQVNDLP 

651 RVLAGLGDVK GVLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 413>: 

mll7.seq (partial) 

1 . . GTGAAACTCA AGAAATACAA TGTCCATTTC GAAGTCGCCG GCCGCCCGAA 

51 ACACATCTAC TCCATTTACA AAAAAATGGT GAAGAAAAAA CTCAGCTTCG 

101 ACGGCCTCTT TGACATCCGC GCCGTGCGAA TTCTGGTTGA TACCGTCCCC 

151 GAGTGTTACA CCACGCTGGG TATCGTCCAC AGCCTCTGGC AGCCCATTCC 

201 CGGCGAGTTC GACGACTACA TCGCCAATCC CAAAGGCAAC GGCTATAAAA 

251 GTTTGCACAC CGTCATCGTC GGCCCGGAAG ACAAAGGCGT GGAAGTACAA 

301 ATCCGCACCT TCGATATGCA CCAATTCAAC GAATTCGGTG TCGCCGCCCA 

3 51 CTGgCGTTAC AAAGAGGGCG GCAAGGGCGA TTCCGCCTAC GAACAGAAAA 

401 TCGCCTGGTT GCGCCAACTC TTGGACTGGC GCGAAAACAT GGCGGAAAGC 

451 GGCAAGGAAG ACCTCGCCGC CGCCTTCAAA ACCGAGCTTT TCAACGACAC 

501 GATTTATGTT TTC-ACCCCGC ACGGCAAAGT CCTCTCCCTG CCCACGGGCG 

551 CGACCCCCAT CGACTTCGCC TACGCCCTGC ACAGCAGCAT CGGCGACCGT 

601 TGCCGCGGTG CGAAAGTCGA AGGGCAGATT GTGCCGCTGT CCACCCCGCT 

651 CGAAAACGGA CAGCGCGTCG AAATCATTAC CGCCAAAGAA GGGCATCCTT 

701 CCGTCAACTG GCTTTACGAA GGCTGGGTCA AATCCAACAA GGCAATCGGC 

751 AAAATCCGCG CCTACATCCG CCAGCAAAAC GCCGACACCG TGCGCGAAGA 

801 AGGCCX3CGTC CAACTCGACA AACAGCTTGC CAAACTCACG CCCAAACCCA 

851 ACCTGCAAGA GCTTGCCGAA AATCTCGGCT ACAAAAAGCC AGAAGACCTC 

901 TACACCGCCG TCGGACAAGG CGAAATTTCC AACCGCGCCA TCCAAAAAGC 

951 CTGCGGCACg CTGAACGAAC CGCCGCCCGT ACCCGTCAGC GAAACCACCA 

1001 TCGTCAAACA GTCCAAAATC AAAAAAGGCG GCAAAAACGG CGTGCTCATC 

1051 GACGGCGAAG ACGGTCTGAT GACCACGCTT GCCAAATGCT GCAAACCCGC 

1101 GCCGCCCGAC GATATTATCG GCTTCGTTAC CCGCGAGCGC GgCATTTCAG 

1151 TGCACCGCAA AwyyTkCyCG TCTTTCCAAC ACCTCGCCGA ACACGCGCCC 

1201 GAwAAAGTGC TGGACGCAAG CTGGGCGGCA TTGCAGGAAG GACAAGTATT 

1251 CGCCGTCGAT ATCGAAATCC GCGCCCAAGA CCGCTCCGGG CTTTTGCGCG 

1301 ACGTATCCGA CGCGCTCGCC CGCCACAAAC TCAACGTTAC CGCCGTGCAA 

1351 ACCCAGTCCC GCGACTTGGA AGCCAGCATG AGGTTCACGC TCGAAGTCAA 

14 01 ACAAGTCAAC GACCTCCCGC GCGTCCTCGC CAGCCTCGGC GACGTCAAAG 

1451 GCGTATTGAG CGTTACCCGG CTTTAA 

This corresponds to the amino acid sequence <SEQ ID 414; ORF 1 17>: 

mll7.pep (partial) 
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1- . 


. . VKLKKYNVHF 


EVAGRPKHIY 


SIYKKMVKKK 


51 


ECYTTLGIVH 


SLWQPIPGEF 


DDYIANPKGN 


101 


IRTFDMHQFN 


EFGVAAHWRY 


KEGGKGDSAY 


151 


GKEDLAAAFK 


TELFNDTIYV 


LTPHGKVLSL 


201 


CRGAKVEGQI 


VPLSTPLBNG 


QRVEIITAKE 


251 


KIRAYIRQON 


ADTVREEGRV 


QLDKQLAKLT 


301 


YTAVGQGEIS 


NRAIQKACGT 


LNEPPPVPVS 


351 


DGEDGLMTTL 


AKCCKPAPPD 


DIIGFVTRER 


401 


X KVLD AS W AA 


LQEGQVFAVD 


IEIRAQDRSG 


451 


TQSRDLEASM 


RFTLEVKQVN 


DLPRVLASLG 



LSFDGLFDIR 
GYKSLHTVIV 
EOKIAWLRQL 



AVRILVDTVP 
GPEDKGVEVQ 
LDWRENMAES 



PKPNLQELAE NLGYKKPEDL 
ETTIVKQSKI KXGGKNGVLI 



L* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 117 shows 97.6% identity over a 490 aa overlap with a predicted ORF (ORF 1 17.ng) 
from N. gonorrhoeae: 

mll7/gll7 

10 20 30 

mll7. pep VKLKKYNVHFEVAGRPKHIYSI YKKMVKKKL 

- I I I I I ' I I I I I I I I II I I I I I I I I I I I II 
g 1 1 7 E KYRE I ALLLDE KRT ERLE YI ENFLDI LRTELKKYNI HFEVAGRPKH I YS I YKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 80 90 

mll7 . pep S FDGLFD I RAVR I L VDTVP EC YTTLG I VHS L WQ P I PGE FDD Y 1 AN P KGNG Y KS LHTV I VG 

II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I 
g 1 1 7 S FDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 

210 220 230 240 250 260 

100 110 120 130 140 150 

mll7. pep P EDKGVEVQ IRT FDMHQFNEFGVAAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAE SG 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
gll7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 

160 170 180 190 200 210 

ml 17 . pep KEDLAAAFKTELFNDTI YVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

g 1 1 7 KEDLAAAFKTEL FNDTI YVLTPHGKVLS LPTGATP IDFAYALHS S I GDRCRGAKVEGQ I V 

330 340 350 360 370 380 

220 230 240 250 260 270 

mll7 . pep PLSTPLENGQRVEI I TAKEGHPSVNWLYEGWVKSNKAIGK.1 RAY IRQQNADTVREEGRVQ 

MIMIIIIIII III II II III! IlilllllllMMI llllllllllll Mil! Ill I 

gll7 PLSTPLENGQRVEI I TAKEGHPSVNWLYEGWVKSGKAIGKI RAY IRQQNADTVREEGRVQ 

390 400 410 420 430 440 

280 290 300 310 320 330 

mll7.pep LDKQLAKLTPKPNLQEL7AENLGYKKPEDLYTAVGQGEI SNRAIQKACGTLNEPP PVPVSE 

I III IIIIIMI II Mi llllll III II IIIIIIMI I UN lllllllll II III III 
gl 1 7 LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEI SNRAI QKACGTLNEPP PVPVS A 

450 460 470 480 490 500 



340 350 360 370 380 390 

nill7 . pep TTIVKQSKI KKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDI IGFVTRERGI SVHRKXXXS 

I III II I IIMI Ihll III III III II llllll I II II II IIIIIIMMIIIh I 

gll7 TTIVKQSKI KKGGKTGVLIDGEDGLMTTLAKCCKPAPPDDIAGFVTRERGI SVHRKTCPS 

510 520 530 540 550 560 



m!17 .pep 



400 410 420 430 440 450 

FQHLAEHAPXKVLDASWAALQEGQV FAVDI EI RAQDRSGLLRDVSDALARHKLNVTAVQT 
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hlllllll lillllMMIIIIIIIIIIIMIMIIIIIIIIIIIMIIIIIIIIIII 

gll7 FRHLAEHAPEKVLDASWAALQEGQVFAVDI E IRAQDRSGLLRDVSDAIiARHKLNVTAVQT 

570 590 590 600 610 620 

460 470 480 490 

mll7 . pep OSRDLEASMRFTLEVKOVNDLPRVLASLGDVKGVLSVTRLX 

I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I I 
gll7 OSRDLEASMRFTLEVKQVNDLPRVLAGLGDVKGVLSVTRLX 
630 640 650 660 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 415>: 
all7.seq 

1 ATGGTTCATG AACTCGACCT GCTCCCCGAT GCCGTCGCCG CCACCCTGCT 

51 TGCCGACATC GGACGCTACG TCCCCGACTG GAACCTATTC- GTTTCCGAAC 

101 GCTGCAACAG TACCGTCGCC GAGCTGGTCA AAGGTGTGGA CGAAGTGCAG 

151 AAACTCACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 

201 CGCCCAGCAG GCAGAAACTA TGCGGAAAAT GCTGCTGGCG ATGGTTACCG 

251 ACATCCGCGT CGTGTTAATC AAACTGGCGA TGCGTACGCC- CACCCTGCAA 

301 TTTTTAAGCA ACGCCCCCGA CAGCCCCGAA AAACGCGCCC- TCGCCAAAGA 

351 AACCCTCGAC ATCTTCGCCC CGCTCGCCAA CCGTTTGGGC GTGTGGCAGC 

4 01 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 

451 TACCGCGAAA 7CGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 

501 ATACATCGAA AACTTCCTTA ATATCCTGCG TACGGAACTC AAAAAATACA 

551 ATATCCACTT TGAAGTCGCC GGCCGTCCGA AACACATCTA CTCCATTTAC 

601 AAAAAAATGG TGAAGAAAAA ACTCAGCTTC GACGGGTTGT TCGA CAT CCG 

651 CGCCGTGCGG ATTCTGGTTG ATACCGTCCC CGAGTGTTAC ACCACACTGG 

701 GCATTGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGAGTT CGACGACTAC 

751 ATCGCCAACC CGAAAGGCAA CGGCTATAAA AGTTTGCACA CCGTCATCGT 

801 CGGCCCGGAA GACAAAGGCG TGGAAGTGCA AATCCGCACC TTCGATATGC 

851 ACCAATTCAA CGAATTCGGT GTCGCCGCGC ACTGGCGTTA CAAAGAGGGC 

901 GGCAAAGGCG ATTCCGCCTA CGAACAAAAA ATCGCCTGGT TACGCCAACT 

951 TTTGGACTGG CGCGAAAACA TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCCCT GCCCACAGGC GCGACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TCGGCGACCG TTGCCGCGGT GCGAAAGTCG 

1151 AAGGGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGTGTC 

1201 GAAATCATTA CCGCCAAAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

1251 AGGCTGGGTC AAATCCAACA AGGCAATCGG CAAAATCCGC GCCTACATCC 

1301 GCCAGCAAAA CGCCGACACC GTGCGCGAAG AAGGCCGCGT CCAACTCGAC 

1351 AAACAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTGCCGA 

1401 AAATCTCGGC TACAAAAAGC CAGAAGACCT CTACACCGCC GTCGGACAAG 

14 51 GCGAAATTTC CAACCGCGCC ATCCAAAAAG CCTGCGGCAC GCTGAACGAA 

1501 CCGCCGCCCG TACCCGTCAG CGAAACCACC ATCGTCAAAC AGTCCAAAAT 

1551 CAAAAAAGGC GGCAAAAACG GCGTGCTCAT CGACGGCGAA GACGGTCTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGACATTGTC 

1651 GGCTTCGTTA CCCGCGATCG CGGCATTTCG GTACACCGCA AAACCTGCCC 

1701 CTCTTTCCGA CACCTCGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

1751 GTTGGGCGGC GTTGCAGGAA GGACAAGTGT TCGCCGTCGA TATCGAAATC 

1801 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

1851 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGTTAC CGACCTCCCA 

1951 CGCGTCCTCG CCAGCCTCGG CGACGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 416; ORF 1 17.a>: 

all 7. pop 

1 MVHELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMR KMLLA MVTDIRVVLI KLA MRTRTLQ 

101 FLSNAPDSPE KRAVAKETLD I FAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLNILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

251 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

301 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTSL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 
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401 EIITAKEGHP SVNWLYEGWV KSNKAIGKIR AYIRQQNADT VREEGRVQLD 

4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 

501 PPPVPVSETT IVKQSKIKKG GKNGVLIDGE DGLMTTLAKC CKPAPPDDIV 

551 GFVTRDRGI3 VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 

601 RAQDRSGLLR DVS DALARHK LNVTAVQTQS RDLEASMRFT LEVKQVTDLP 

651 RVLASLGDVK GVLSVTRL* 



117/all7 98.0% identity in 490 aa overlap 



10 20 30 

mll7 .pep VKLKKYNVHFEVAGRPKH I YS I YKKMVKKKL 

: I I I H : I I I I I I II I I I I I I I I I I I I I I I 
alll EKYRE IALLLDEKRTERLEY IENFLN I LRTELKKYN I HFEVAGRPKH I YS I YKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 80 90 

mll7.pep SFDGLFDIRAVRII.VDTVPECYTTLGTVHSLWQPIPGEFDDYIANPKGNGYKSI.HTVIVG 
t I I I I I I I I I M I I I I II I I I I M I I I II I I I I I I I I I I I II I I I I I I II I I I I I I I I I I 
all7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 

100 110 120 130 140 150 

mll7 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
I I I I I I I I I I I I I I I I II I I I I I I I 1 II I I I I I II ! I I I I I II I I I I I I I ! I I I I I I I I I 
all7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 



160 170 180 190 200 210 

mll7 . pep KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
I I I I I II I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
all7 KEDLAAAFKTELENDTIYVLTPHGKVLSLPrGATPIDFAYAUISSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 



220 230 240 250 260 270 

mll7.pep PLST PLENGQRVEI I TAKEGHPSVNWLYEGW VKSNKAIGKI RAY IRQQNADTVREEGRVQ 

I I II I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
all7 P LS T P LENG QR VE 1 1 TAKEGH P S VN WL YEGWVKSNKA IGK I RAY I RQQNADTVREEGR VQ 

390 400 410 420 430 440 



280 290 300 310 320 330 

mil 7 . pep LDKQLAKLTPKPNLOELAENLGYKKPEDLYTAVGOGEISNRAIOKACGTLNEPPPVPVSE 
I I II I M 1 1 I 1 1 I I I I I 1 1 I II I 1 1 1 1 1 I I II M I I I 1 1 II I I M I I I I I I I I I 1 1 II I I 

all7 LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
450 460 470 480 490 500 



340 350 360 370 380 390 

mll7 . pep TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 
I I I I I I I I I I I I I I I I I I I I I I II I ! I II I I I I I I II II I I : II I II : II I I I I I I : I 
all7 TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKFAPPDDIVGF7TRDRGISVHRKTCPS 
510 520 530 540 550 560 



400 410 420 430 440 450 

mll7 . pep FQH L AE HA PXKV LD A SW AALQEGQ V F AV D I E I RAO DR SGLLRD V S D ALARH K LN VT AVQT 

I : I II I I I I I 11 I I I I I I M I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I 
all7 FRHLAEHAPEKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 



460 470 480 490 

ml 17 . pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 
I I I I 1 1 I I I I I I II ! I I I : I I I I ! I I I I I I I I I I 1 1 I I ( I I 
all7 QSRDLEASMRFTLEVKQVTDLPRVLASLGDVKGVLSVTRLX 
630 640 650 660 
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The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 41 7>: 

gll7-l.seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CCCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTCGCTGG CGCAGGAACA TTATCCTGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTCGACGAAC TCGACCTGCT GCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGATTGGAA CCTATTGGTT 

301 TCCGAGCGCT GCAACAGCAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTTACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

401 AAGAACGCGC ACAGCAAGCG GAAACCATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT ATTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCTGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG CTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCGATA TCCTGCGTAC GGAACTCAAA 

7 51 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTGTTCG 

851 ACATCCGCGC CGTGCGGATT CTGGTCGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGCA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGagttCGA 

951 cgactACATC GCCAACCCCA AAGgcaACGg ttATAAAAGt TTGCACACCG 

1001 TCATCGTcgg cccGGAagaa aaaggtgtgg aagtgcAAAT CCGCACCTTC 

1051 GATATGCacc AATTCaaCga ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAAGGCGGC AAAGGC GATT CCGCCTACGA ACAAAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAATATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTrT 

1251 GACCCCGCAC GGCAAAGTCC TCTCTCTGCC AACGGGCGCA ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGCTG CCGGGGCGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAAC GG AC A 

14 01 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCGGCAAGG CCATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGTGTCCA 

1551 ACTCGACAAG CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTGC CCGTCAGCGC AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGTGGC AAAAC CGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGCTTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

18 51 TATTGCCGGC TTCGTTACCC GCGAGCGCGG CATTTCCGTC CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTTGCCGAAC ACGCGCCCGA AAAAGTACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGG CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCG GCCTCGGCGA TGTC AAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 418; ORF 117-l.ng>: 

gll7-l.pep 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WSLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VDELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRK MLLAM 

151 VTDIRVVLIK LAM RTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLDILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVH5LW0 PIPGEFDDYI ANPKGNGYKS LHTVIVGPEE KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

4 01 LAAAFKTELF NDTIYVLTPH GKV1SLPTGA TPIDFAYALH SSIGDRCRGA 

4 51 KVEGQIVPLS TPLENGQRVE I ITAKEGHPS VNWLYEGWVK SGKAIGK1RA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGErSNRAI QKACGTLNEP PPVPVSATTI VKQSKIKKGG KTGVLIDGED 

601 GLMTTLAKCC KPAPPDDIAG FVTRERGISV HRKTCPSFRH LAEHAPEKVL 

651 DAS WAAL QEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLAGLGDVKG VLSVTRL* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 419>: 

mll7-1.8©q 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 
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51 


ATTGCGCGAA 


101 


AAAACCTCAT 


151 


GATGCCGCCA 


201 


GGCGCAAATG 


251 


CCCTGCTTGC 


301 


TCCGAACGCT 


351 


AGTGCAGAAA 


401 


AAGAACGCGC 


451 


GTTACCGACA 


501 


CCTGCAATTT 


551 


CCAAAGAAAC 


601 


TGGCAGCTCA 


651 


CGAAAAATAC 


701 


GCCTCGAATA 


751 


AAATACAATG 


801 


CATTTACAAA 


851 


ACATCCGCGC 


901 


ACGCTGGGTA 


951 


CGACTACATC 


1001 


TCATCGTCGG 


1051 


GATATGCACC 


1101 


AGAGGGCGGC 


1151 


GCCAACTCTT 


1201 


CTCGCCGCCG 


1251 


GACCCCGCAC 


1301 


ACTTCGCCTA 


1351 


AAAGTCGAAG 


1401 


GCGCGTCGAA 


1451 


TTTACGAAGG 


1501 


TACATCCGCC 


1551 


ACTCGACAAA 


1601 


TTGCCGAAAA 


1651 


GGACAAGGCG 


1701 


GAACGAACCG 


1751 


CCAAAATCAA 


1801 


GGTCTGATGA 


1851 


TATTATCGGC 


1901 


CCTGCCCGTC 


1951 


GACGCAAGCT 


2001 


CGAAATCCGC 


2051 


CGCTCGCCCG 


2101 


GACTTGGAAG 


2151 


CCTCCCGCGC 


2201 


TTACCCGGCT 



TGGTTCGACA 
CGGTACCGCA 
CGCCGTATGG 
GTTCATGAAC 
CGACATCGGA 
GCAACAGTAC 
CTCACCCACT 
CCAGCAGGCA 
TCCGCGTCGT 
TTAAGCAACG 
CCTCGACATC 
AATGGCAGCT 
CGCGAAATCG 
CATCGAAAAC 
TCCATTTCGA 
AAAATGGTGA 
CGTGCGAATT 
TCGTCCACAG 
GCCAATCCCA 
CCCGGAAGAC 
AATTCAACGA 
AAGGGCGATT 
GGACTGGCGC 
CCTTCAAAAC 
GGCAAAGTCC 
CGCCCTGCAC 
GGCAGATTGT 
ATCATTACCG 
CTGGGTCAAA 
AGCAAAACGC 
CAGCTTGCCA 
TCTCGGCTAC 
AAATTTCCAA 
CCGCCCGTAC 
AAAAGGCGGC 
CCACGCTTGC 
TTCGTTACCC 
TTTCCAACAC 
GGGCGGCATT 
GCCCAAGACC 
CCACAAACTC 
CCAGCATGAG 
GTCCTCGCCA 
TTAA 



GCTACTGCGC 
TGG7TGCTGG 
CGAGCCGCTG 
TCGACCTGCT 
CGC7ACGTCC 
CGTCGCCGAG 
TCGCCCGGGT 
GAAACTATGC 
GTTAATCAAA 
CCCCCGACAG 
TTCGCCCCGC 
CGAAGATTTG 
CGC7GCTTTT 
TTCCTCAACA 
AGTCGCCGGC 
AGAAAAAACT 
CTGGTTGATA 
CCTCTGGCAG 
AAGGCAACGG 
AAAGGCGTGG 
ATTCGGTGTC 
CCGCCTACGA 
GAAAACATGG 
CGAGCTTTTC 
TCTCCCTGCC 
AGCAGCATCG 
GCCGCTGTCC 
CCAAAGAAGG 
TCCAACAAGG 
CGACACCGTG 
AACTCACGCC 
AAAAAGCCAG 
CCGCGCCATC 
CCGTCAGCGA 
AAAAACGGCG 
CAAATGCTGC 
GCGAGCGCGG 
CTCGCCGAAC 
GCAGGAAGGA 
GCTCCGGGCT 
AACGTTACCG 
GTTCACGCTC 
GCCTCGGCGA 



CGCTCTGCCG 
CGCAGGAACA 
CCCGACCACT 
CCCCGATGCC 
CCGACTGGAA 
CTGGTCAAAG 
GGACAGCCTC 
GGAAAATGCT 
CTGGCGATGC 
CCCCGAAAAA 
TCGCCAACCG 
GGCTTCCGCC 
GGACGAAAAA 
TCCTGCGCGG 
CGCCCGAAAC 
CAGCTTCGAC 
CCGTCCCCGA 
CCCATTCCCG 
CTATAAAAGT 
AAGTACAAAT 
GCCGCCCACT 
ACAGAAAATC 
CGGAAAGCGG 
AACGACACGA 
CACGGGCGCG 
GCGACCGTTG 
ACCCCGCTCG 
GCATCCTTCC 
CAATCGGCAA 
CGCGAAGAAG 
CAAACCCAAC 
AAGACCTCTA 
CAAAAAGCCT 
AACCACCATC 
TGCTCATCGA 
AAACCCGCGC 
CATTTCAGTG 
ACGCGCCCGA 
CAAGTATTCG 
TTTGCGCGAC 
CCGTGCAAAC 
GAAGTCAAAC 
CGTCAAAGGC 



GACAACGATA 
TTACCCCGCC 
TCCTCGGCGC 
GTCGCCGCCA 
CCTATTGGTT 
GTGTGGACGA 
GCCACGCCGG 
GCTGGCGATG 
GTACGCGCAC 
CGCGCCGTCG 
TTTGGGCGTG 
ATCAAAAGCC 
CGCACCGAAC 
TGAACTCAAG 
ACATCTACTC 
GGCCTCTTTG 
GTGTTACACC 
GCGAGTTCGA 
TTGCACACCG 
CCGCACCTTC 
GGCGTTACAA 
GCCTGGTTGC 
CAAGGAAGAC 
TTTATGTTTT 
ACCCCCATCG 
CCGCGGTGCG 
AAAACGGACA 
GTCAACTGGC 
AATCCGCGCC 
GCCGCGTCCA 
CTGCAAGAGC 
CACCGCCGTC 
GCGGCACGCT 
GTCAAACAGT 
CGGCGAAGAC 
CGCCCGACGA 
CACCGCAAAA 
AAAAGTGCTG 
CCGTCGATAT 
GTATCCGACG 
CCAGTCCCGC 
AAGTCAACGA 
GTATTGAGCG 



This corresponds to 

mll7-l .pep 

1 MTA1SPIQDT 

51 DAATPYGEPL 

101 SERCNSTVAE 

151 VTDIRVVLIK 

201 WQLKWQ1EDL 

251 KYNVHFEVAG 

301 TLGIVHSLWQ 

351 DMHQFNEFGV 

401 LAAAFKTELF 

451 KVEGQIVPLS 

501 YIRQQNADTV 

551 GQGEISNRAI 

601 GLMTTLAKCC 

651 DASWAALQEG 

701 DLEASMRFTL 



the amino acid sequence <SEQ ID 420; ORF 1 1 7-1 > 



QSATLQELRS 
PDHFLGAAQM 
LVKGVDEVQK 
LAMRTRTLQF 
GFRHQKPEKY 
RPKHIYSIYK 
PIPGEFDDYI 
AAHWRYKEGG 
NDTIYVLTPH 
TPLENGQRV3 
REEGRVQLDK 
QKACGTLNEP 
KPAPPDDIIG 
QVFAVDIEIR 
EVKQVNDLPR 



WFDSYCAALP 
VHELDLLPCA 
LTHFARVDSL 
LSNAPDSPEK 
REIALLLDEK 
KMVKKKLSFD 
ANPKGNGYKS 
KGDSAYEQKI 
GKVLSLPTGA 
IITAKEGHPS 
QLAKLTPKPN 
PPVPVSETTI 
FVTRERGISV 
AQDRSGLLRD 
VLASLGDVKG 



DNDKNLIGTA 
VAATLLADIG 
ATPEERAQQA 
RAVAKETLDI 
RTERLEYIEN 
GLFDIRAVRI 
LHTVIVGPED 
AWLRQLLDWR 
TPIDFAYALH 
VNWLYEGWVK 
LQELAENLGY 
VKQSKIKKGG 
HRKTCPSFQH 
VSDALARHKL 
VLSVTRL* 



WLLAQEHYPA 
RYVPDWNLLV 
ETMRKMLLAM 
FAPLANRLGV 
FLNILRGELK 
LVDTVPECYT 
KGVEVQIRTF 
ENMAESGKED 
SSIGDRCRGA 
SNKAIGKIRA 
KKPEDLYTAV 
KNGVLIDGED 
LAEHAPEKVL 
NVTAVQTQSR 



mll7-l/gll7-l 98.2% identity in 737 aa overlap 

10 20 30 40 50 60 

mll7-l . pep MTAISP1QDTQSATLQELREWFDSYCAALPDNDKHLIGTAWLLAQEHYPADAATPYGEPL 
I I I I I I I I I I I I I II I I I II I I I I II I I I II I I I I I I I I If I I I I I I I I I I I I I I I I I I 
g 117-1 r4TAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWSLAQEHYPADAATPYGEPL 

10 20 30 40 50 60 



70 



80 



90 



100 



110 



120 



WO 99/57280 



PCT/US99/09346 



336 



ml 1*7-1 - pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 

I I I I I I I I I I t I I I I I I I I I I I I I 1 I 1 I i I I I I I I I I I I I I I I I I I I I 1 I I II I I I I t I 
g 1 17 - 1 PDHFLGAAQMVDELDLLPDAVAATLLADI GRY VPDWNLLVS ERCNSTVAELVKGVDEVQK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 17-1. pep LTHFARVDSLATPEERAOOAETMRKMLLAMVTDIRVVLIKLAMRTRTLOFLSNAPDSPEK 

I I ii 1 1 1 1 n 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 

g 1 1 7 - 1 LTHFARV DSLATPEERAQQAETMRKMLLAMVTDI RVVLI KL AMRTRTLQFLSNAPDS PEK 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 1 7 - 1 . pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREI ALLLDEKRTERLEYI EN 
I | I I It I I I I I I I I I I I I I I I II I I I I I I I I I I M :l I I I I 1 I I I I I I ! I I I I II I I I I I 
g 1 17 - 1 RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEK YREI ALLLDEKRTERLEY IEN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 1 7 - 1 . pep FLNI LRGELKK YNVH FEVAGRPKHI YS I YKKMVKKKLSFDGLFDI RAVRILVDTVPEC YT 
I I : I I I I I I I I I : I I I I II I I I I I I I i I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I 
gl!7-l FLDILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 17-1 .pep TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 
I I I I I I I I I I I I I I I I I I I I I 1 I I I I P I I I I I I I I II I : I I I I I I I I I I I II i I I I II I 
gll7-l TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEEKGVEVQ1RTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 17-1 . pep AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I II I I I 
gll7-l AAHWRYKEGGKGDSAYEQKIAWLROLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 17-1. pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
g 117-1 GK VT, SL P TG AT P I D F AY ALH S S I G DRC RGAKV EGOI V PL ST PL ENGQRV E 1 1 T AKEGH PS 

430 440 450 460 470 480 

490 500 510 520 530 540 

mll7-l .pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I I I I I I I I I I I : I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
g 1 1 7 - 1 VNWL YEGWVKSGKAIGKI RAYIRQQNADTVREEGRVQLDKQLAKLTFKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

mll7-l.pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKN3VLIDGED 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I : I I I I I I I I 
gll7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSATTIVKQSKIKKGGKT3VLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

mil 7-1. pep GLMTTLAKCCKPAPPDDI 1GEVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I i I I I I I I I I I I I 
gll7-l GLMTTLAKCCKPAPPDDIAGFVTRERGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 17-1 . pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 

I I I H 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I I I 1 1 1 1 I Ml 1 1 1 I 1 1 1 1 II 1 1 1 1 i 1 1 1 1 I 1 1 1 1 1 1 

gll7-l QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
670 680 690 700 710 720 

730 

ml 1 7 - 1 . pep VLASLGDVKGVLS VTRLX 
I II :||| II II Mi II II 
g 11 7-1 VLAGLGDVKGVLS VTRLX 

730 



mll7-l/RelA 
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spl P55133|RELA_VIBSS GTP PYROPHOSPHOKINASE { ATP : GTP 3 ' -PYROPHOSPHOTRANSFERASE) (PPGPP 
SYNTHETASE I) >gi|537617 (U13769) ppGpp synthetase I IVibrio sp.) Length = 744 
Score = 536 bits (1366), Expect = e-151 

Identities = 288/685 (42%), Positives = 432/685 (63%), Gaps = 31/685 (4%) 

Query: 74 LDLLPDAVAATLLADI GRYVPDWNLLVSERCNSTVAELVKGVDEVQKLTHFARVDSL 130 

L+D + ALL+ GYD 4 E 4 T4 LV4GV4++ + 4 ++ S 
Sbjct: 68 LSMDADTLIAALLYPLVEGGCYSTD A1KEEYSGTILHLVQGVEQMCAI S QLKST 121 

Query: 131 ATPEERAQQAETMRKMLLAMVTDIRWLIKLAMRTRTLQFLSNAPDSPEKRAVAKETLDI 190 

A +A 0 + +R+MLL+MV D R V+IKLA R L+ + + PD +RA A+E +1 
Sbjct: 122 AEETAQAAQVDNIRRMLLSMVDDFRCVVIKLAERICNLREVKDQPDEV-RRAAAQECANI 180 

Query: 191 FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIENFLNILRGELK 250 

+APLANRLG+ QLKW4+ED FR4Q P4 Y44IA L E+R 4R +YI +F++ L +K 
SbjCt: 181 YAPLANRLGIGQLKWETEDYAFRYQHPDTYKQIAKQLSERRIDREDYITHFVDDLSDAMK 240 

Query: 251 KYNVHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQ 310 

N4 EV GRPKHI YSI++KM KK L FD LFD+RAVRI+ 4 4 +CY LG4VH4 44- 
SbjCt: 241 ASNIRAEVQGRPKHIYS1WRKMQKKSLEFDELFDVRAVRIVAEELQDCYAALGWHTKYR 300 

Query: 311 PIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEG- 369 

4P EFDDY+ANPK NGY+S+HTV++GPE K 4E4QIRT MH4 4E GVAAHW4YKEG 
Sbjct: 301 HLPKEFDDYVANPKPNGYQSIHTVVLGPEGKTIEIQIRTKQMHEESELGVAAHWKYKEGT 360 

Query: 370 — GKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPHGKVLSLP 427 

G SAY+4KI WLR+LL W4E M+4SG 44 4444F4D 4Y TP G V4 LP 
SbjCt: 361 ASGGAQSAYDEKINWLRKLLAWQEEMSDSG — EMLDELRSQVFDDRVYAFTPKGDVVDLP 418 

Query: 428 TGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPSVNWLYE- 486 . 

4 ATP4DFAY 4HS 4G RC GAKVEG4IVP 4 L4 G 4VEIIT KE 4PS 4WL 
Sbjct: 419 SNATPLDFAYHIHSEVGHRCIGAKVEGRIVPFTYHLQMGDQVEIITQKEPNPSRDWLNPN 478 

Query: 487 -GWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKL — TPKPNLQELAENLGYKKP 543 

G4V S44A K4 A+ R4Q4 D G4 L4 4L K4 T K A4 K P 

Sbjct: 479 LG FVTS S RARAKVH AW FRKQ DRDKN 1 1 AGKE I LEAEL VK I HAT LK DAQ Y Y AAK RFN VKS P 538 

Query: 544 EDLYTAVGOGEIS-NRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKNGV 594 

E4LY 4G G4+ N4 I 4N+P + 4 K S4 KK 4+ V 

Sbjct: 539 EELYAGIGSGDLRINQVINHINALVNKPTAEEEDQQLLEKLSEASNKQATSHKKPQRDAV 598 

Query: 595 LI DGEDGLMTTLAKCCKPAP PDDI IGFVTRERGI S VHRKTCPS FQHLAEHAPEKVLDAS W 654 

444G D LMT LAfCC4P P DDI GFVT4 RGISVHR C 4 L HAPE444D W 
Sbjct; 599 W EG VDN LMTHL ARC CQ P I P G D D I QGFVTQG RG I S VH RMDC EQLEELRHHAPE RI I DT VW 658 

Query: 655 AALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQ--SRDLEASMRFTLEV 712 

G 4 + + + A 4R+GLL4++++ L K4 V 4+++ + 4 M F LE4 
SbjCt: 659 GGGFVGN-YTITVRVTASERNGLLKELTNTLMNEKVKVAGMKSRVDYKKQMSIMDFELEL 717 

Query: 713 KQ VN DL PRVL AS LG D VKGVL S VT RL 7 37 

4 L RVL 4 VK V RL 
SbjCt: 718 TDLEVLGRVLKRIEQVKDVAEAKRL 742 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 42 1>: 

all7-l. s©q 

1 ATGACCGCCA 

51 ATTGCGCGAA 

101 AAAAACTTGT 

151 GATGCCGCCA 

201 GGCGCAAATG 

251 CCCTGCTTGC 

301 TCCGAACGCT 

351 AGTGCAGAAA 

401 AAGAACGCGC 

451 GTTACCGACA 

501 CCTGCAATTT 

551 CCAAAGAAAC 

601 TGGCAGCTCA 

651 CGAAAAATAC 

701 GCCTCGAATA 

751 AAATACAATA 

801 CATTTACAAA 



TCAGCCCGAT TCAAGACACG 
TGGTTCGACA GCTACTGCAC 
CTTAGCCGCC CGTTCGCTGG 
CGCCGTATGG CGAACCGCTG 
GTTCATGAAC TCGACCTGCT 
CGACATCGGA CGCTACGTCC 
GCAACAGTAC CGTCGCCGAG 
CTCACCCACT TCGCCCGGGT 
CCAGCAGGCA GAAACTATGC 
TCCGCGTCGT GTTAATCAAA 
TTAAGCAACG CCCCCGACAG 
CCTCGACATC TTCGCCCCGC 
AATGGCAGCT CGAAGATTTG 
CGCGAAATCG CCCTGCTTTT 
CATCGAAAAC TTCCTTAATA 
TCCACTTTGA AGTCGCCGGC 
AAAATGGTGA AGAAAAAACT 



CAAAGCGCGA CTCTGCAAGA 
CGCGCTGCCG AACAACGATA 
CGGAAGCACA TTACCCCGCC 
CCCGACCACT TCCTCGGCGC 
CCCCGATGCC GTCGCCGCCA 
CCGACTGGAA CCTATTGGTT 
CTGGTCAAAG GTGTGGACGA 
GGACAGCCTC GCCACGCCGG 
GGAAAATGCT GCTGGCGATG 
CTGGCGATGC GTACGCGCAC 
CCCCGAAAAA CGCGCCGTCG 
TCGCCAACCG TTTGGGCGTG 
GGCTTCCGCC ATCAAGAACC 
GGACGAAAAA CGCACCGAAC 
TCCTGCGTAC GGAACTCAAA 
CGTCCGAAAC ACATCTACTC 
CAGCTTCGAC GGGTTGTTCG 
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851 ACATCCGCGC CGTGCGGATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACACTGGGCA TTGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAACCCGA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTGCAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCGCACT GGCGTTACAA 

1101 AGAGGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTAC 

1151 GCCAACTTTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACAGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGTGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC CTCAACTGGC 

1451 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 CATTGTCGGC TTCGTTACCC GCGATCGCGG CATTTCGGTA CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTCGCCGAAC ACGCGCCCGA AAAAGTACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGA CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTTACCGA 

2151 CCTCCCACGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 422; ORF 117-1 .a>: 

all7-l.pep 

1 MTAISPIQDT QSATLQELR3 WFDSYCTALP NNDKKLVLAA RSLAEA^YPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LAMRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLNILRTELK 

251 KYNIHFEVAG RPKHI YSIYX KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGXIRA 

501 YIRQQNADTV REEGRVQLDX QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIVG FVTRDRGISV HRKTCPSFRH LAEHAPSKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVTDLPR VLASLGDVKG VLSVTRL* 

all7-l/mll7-l 97.7% identity in 737 aa overlap 

10 20 30 40 50 60 

MT A I S P I QDTQS ATLQELREW FD S YC AA L P DN DKNL I GT AWLLAQEH Y PA DAAT P YG E PL 
f I I I I I I I t I I I I I I I 1 1 I I I ! I I I I : I I h I I I : I : I II: I I I I I I I I I I I I i I 
MTAISPIQDTQSATLQELREWFDSYCTALPNNDKKLVLAARSLAEAHYPADAATPYGEPL 
10 20 30 40 50 60 

70 80 90 100 110 120 

PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVOK 
I i I I I! I I f I I I I I t I I 1 I I I I I I I I 1 I I I t I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
70 80 90 100 110 120 

130 140 150 160 170 180 

LTHFARVDSLATPEERAQQAETMRKMLLAMVTDI RVVLI KLAMRTRTLQFLSHAPDSPEK 
1 i I I I I I I I I I I I I I I I t I I I I i t I I i I I I I I I I t I I I I I I I I I ! I I I I I I I I I I I I I I I 
LTHFARVDSLATPEERAQQAETMRKMLLAMVTDI RVVLI KLAMRTRTLQFLSHAPDSPEK 
130 14U 150 160 170 180 

190 200 210 220 230 240 

RAVAKETLDIFAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIEN 
Mil II MllllllllllllllllillllllllliMlllllllllilltllllllllM 
RAVAKETLDIFAPLANRLGVWQLKWQLEDLGFRHQEPEKYREIALLLDEKRTERLEYIEN 
190 200 210 220 230 240 



mll7-l .pep 
all7-l 

mll7-l .pep 
all7-l 

mll7-l .pep 
a!17-l 

mll7-l .pep 
all7-l 
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250 260 270 280 290 300 

mll7-l pep FLNILRGELKKYNVHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDI RAVRILVDTVPECYT 

I 1 1 1 1 1 l I i l I I : I I 1 1 1 1 l 1 1 1 1 i I i 1 1 1 1 1 1 1 1 I I I 1 1 i 1 1 1 1 1 1 I I I 1 1 I 1 1 1 1 1 1 

ain-2 FLNILRTELKKYNIHFEVAGRPKHI YS I YKKMVKKKLSFDGLFDI RAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

mill -1. pep TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHOFNEFGV 
I | I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
all*?-! TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 

310 320 330 340 350 360 



370 380 390 400 410 420 

mll7-i .pep AAHVJRYKEGGKGDSAYEQKIAWLROLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 

i 1 1 1 1 1 j 1 1 r 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

dlll-l AAHVJRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 
370 380 390 400 410 420 

430 440 450 460 470 480 

iain-1. pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEI ITAKEGHPS 
I I I I 1 1 I I I I I I 1 1 I I I I I I I I I I I I I ! I I I I I I I I I II I I I 1 I I ! I I I I I I ! I II I I I I 
all7-i GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEI ITAKEGHPS 

430 440 450 460 470 480 



490 500 510 520 530 540 

mll7-l.pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVOLDKOLAKLTPKPNLQELAENLGY 

I || I 1 1 1 I I 1 1 I M II 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 I I 1 1 I I 1 1 1 II III II 1 1 
all7-l VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
490 500 510 520 530 540 

550 560 570 580 590 600 

mll7-l.pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKNGVLIDGED 

llllltl llll IIIIIIIIII llllll HIM IMIIIIIIIIMI llllllllllllll 
all7-l KKPEDLYTAVGOGEISNRAIQKACGTLWEPPPVPVSETTIVKQSKI KKGGKNGVLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 17 - 1 . pep GLMTTLAKCCKPAPPDDI IGFVTRERGI SVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I I I I I I I I I I I I I I I I I : I I I M : i I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I 
all7-l GLMTTLAKCCKPAPPDDIVGFVrRDRGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 



670 680 690 700 710 720 

mll7-l.pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMFJTLEVKQVNDLPR 
I | | I | i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I : I I I I 
a 11 7 - 1 QVFAVDIEIRA0DRSGLLRDV5DALARIIKLNVTAVQTQSRDLEASMRFTLEVKQVTDLPR 

670 680 690 700 710 720 



730 

mll7-l.pep VliASLGDVKGVLSVTRLX 
I I I I I I I I I I I I I I I I I I 
a 11 7-1 VLASLGDVKGVLSVTRLX 
730 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 423>: 

ATCCCTTGTT TTGAAGAGTA 
TGATGACGGG GTGTGGGATG 
TAATcgaGGT TAGGAGAAAA 
ATTGTGATTG GAATCGGTAC 
GGAGCTTTTT GAAATTAAAG 
TTCATGAACG TTATGAAAGA 
GAGAAAGACA TAGTCAACGT 

This corresponds to the amino acid sequence <SEQ ED 424; ORF 1 18.ng>: 

gll8.pep 

1 MCEFKDFRRN I PC FEE Y DEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRRK 
51 YFYPMDIPRD IVIGIGTIIP FLMVPNW ELF EIKASPWLPD SVGIHERYER 
101 FTTMLRYIFT EKDTVNVRFD YYNKK* 



gll8 . seq 








1 


ATGTGCGAGT 


TCAAGGATTT 


TAGAAGAAAC 


51 


TGACGAAAAT 


TCATTTATTG 


GCAAATGGTA 


101 


AT GAAGAAT A 


TTGGAAGCTG 


GAGAATGATT 


151 


TATCCTTATC 


CGATGGATAT 


ACCAAGGGAT 


201 


CATTATTGAT 


TTTTTAATGG 


TTCCAAATTG 


251 


CTTCCCCTTG 


GTTGCCTGAT 


AGCGTGGGAA 


301 


TTCACAACGA 


TGCTCCGTTA 


TATTTTTACC 


351 


GCGATTTGAT 


TATTACAaCA 


AAAAATAG 
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The following partial DNA sequence was identified in K meningitidis <SEQ ID 425>: 

mll8 . seq 

1 ATGTGTGAGT TCAAGGATAT TATAAGAAAC GTTCCTTATT TTGAGGGGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGTTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGACAT ACCAAGATAT GTTGTCATTG GAATCGGTAC 

201 CATTATTGAT TTCTTAATGG TTCCAAATTG GAAACTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 426; ORF 1 1 8>: 

mil 8. pep 

1 MCEFKDIIRN VPYFEGYDEK SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRY WIGIGTIID FLMVPNW KLF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 118 shows 92.8% identity over a 125 aa overlap with a predicted ORF (ORF 1 18.ng) 
from N. gonorrhoeae: 

mll8/gll8 

10 20 30 40 50 60 

ml 18 . pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 

Mill!: ||:| II I I M I I I I I I I i t I I I II I 1 ! I I I I I I i I I I t : I I I I I I I I I I 
gll8 MCEFKDFRRNIPCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRRKYPYPMDIPRD 

10 20 30 40 50 60 

70 80 90 100 110 120. 

mll8.pep WIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: I I I I I I I II I f I I If I : I I I I! I I I I I I I I I I I I II I I M t I I i ! i I I I ! I I I I I I I II 
gll8 IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 10C 110 120 



ml 18. pep YYNKKX 
I I I I II 

gll8 YYNKKX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 427>: 

all8 . seq 

1 ATGTGTGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAATTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 428; ORF 1 18.a>: 

all8.pop 

1 MCEFKDFRRN I PC FEE Y DEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNWE LF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 



mll8/all8 93.6% identity in 125 aa overlap 

10 20 30 40 50 60 

mll8.pep MCEFKDI I RN V P YFEGYDENS F I GKW Y D DGVW DDEE YWKLEN DL I E VRKKY PY PM D I PRY 

I I I t I I : II:! I I I I I i I i I I I I I I I I I I I I I I I I ! I I I I I I II M 1 I I I I I I I I I 
al 1 8 MCEFKDFRRN I PCFEEYDENS FIGKWYDDGVWDDEEYWKLENDL IEVRKKYPYPMDI PRD 

10 20 30 40 50 60 
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70 80 90 100 110 120 

mll8.pep VVIGIGTI I DFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYI FTEKDI VNVRFD 

: I I I I I I I I II I I I I II : I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I | I M I I I I ■ II I 
all8 IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYI FTEKDI VNVRFD 

70 80 90 100 110 120 



ml 18. pep YYNKKX 
I I I I I I 

all 8 YYNKKX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 429>: 

gl20. seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAAGGCTACC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAATCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTGCCTACT 

251 ATAAAGACAT ACGCAGGGGC AAACTG TATG CGGAAGCCAA ATTCGCCGAC 

301 GGCAGCGTAA CCTACGGCAA AG CGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

4 01 CGAAACTCCC CCCGGGTCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

451 GTCGGCGGCC TGAATAAGGC GGGTACGGGA AAATACAGCA Taggcggcgt 

501 gGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATACGGTAA 

551 CGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGAcgaCG GCAAAACCTA TACGCTGAAG CTCAAATCGG TGCAGATCAA 

651 CGGACAGGCC GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 430; ORF 120.ng>: 

gl20.pep 

1 MMKTFKNIFS AAILSAALPC AYA ARLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PAYYKDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETB WKYRVRRGD DTVTYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 43 1>: 

m!20. seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGmACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTG TATG CGGAAGCcAA ATTCGCCGAC 

301 GGCAGCGTAA CTTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

401 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

451 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCG 

This corresponds to the amino acid sequence <SEQ ID 432; ORF 120>: 

ml 20. pep 

1 MMKTFKNIFS AAILSAALPC AYA AGLPQSA VLXYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETB WKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A r . gonorrhoeae 

ORF 120 shows 97.3% identity over a 223 aa overlap with a predicted ORF (ORF 120.ng) 
from AT. gonorrhoeae: 

ml20/gl20 

10 20 30 40 50 60 

ml2 0 . pep MMKTFKNI FSAAILSAALPCAYAAGLPQSAVLXYSGS YGI PATMTFERSGNAYKIVSTIK 

ii 1 1 ii iii i ii mi i ii i ii it i in n i ii ii ii ii him ii milium i 

gl20 MMKTFKNI FSAAILSAALPCAYAARLPQSAVLHYSGS YGI PATMTFERSGNAYKIVSTIK 

10 20 30 40 50 60 

70 .80 90 100 110 120 

ml20 . pep VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 = 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 f I f ! 1 1 f 1 1 1 1 1 1 1 1 1 

g 12 0 VPLYNIRFESGGTVVGNTLHPAYYKDI RRGKLYAEAKFADGSVTYGKAGES KTEQS PKAM 

70 80 90 100 . 110 120 

130 140 150 160 170 180 

ml2 0 . pep DLFTLAWQliAANDAKLP PGLKI TNGKKLYSVGGLNKAGTGKYS I GGVETEWKYRVRRGD 

1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j r 1 1 1 1 1 1 1 1 1 1 f i f i r 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 

g 1 2 0 DLFTLAWQLAANDAKLP PGLKI TNGKKLYSVGGLNKAGTGKYS I GGVETEWKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ttll20 - pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKP 

hi I M I M I i 1 1 1 1 1 ! 1 1 1 f 1 1 1 1 1 1 i 1 1 M M 1 1 1 1 M 1 1 

g 120 DTVTYFFAPSLNNI PAQ I GYTDDGKT YTLKLKSVQI NGQAAKPX 

190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 433>: 

al20. seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCG TATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG 

301 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

401 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

4 51 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 434; ORF 120.a>: 
al20.pep 

1 MMKTFKNIFS AAILSAALPC AYA AGLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDI RRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNI PAQIGY 

201 TDDGKTYTLK LKSVQ1NGQA AKP* 



CGGAAGCCAA ATTCGCCGAC 
AGCAAAACCG AGCAAAGCCC 



ml20/al20 



99.6% identity in 223 aa overlap 



10 20 30 40 50 60 

ml20.pep MMKTFKNI FSAAILSAALPCAYAAGLPQSAVLXYSGS YGI PATMTFERSGNAYKIVSTIK 

I I I I I I I I II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 1 
a!20 MMKTFKNI FSAAILSAALPCAYAAGLPQSAVLHYSGSYG3 PATMTFERSGNAYKIVSTIK 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml20 . pep VPLYNIRFESGGTWGNTLHPTYYRDIR1*GKLYAEAKFADGSVTYGKAGESKTEQSPKAM 
II 1 I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I i I I I I I I M I I 
al20 VPLYNIRFESGGTVVGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml20 . pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEWKYRVRRGD 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I 
al20 DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLMKAGTGKYSIGGVETEWKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml20 . pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I i I I I I I I 
al20 DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 435>: 

gl21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATTATG TCGGGAACCA GTATGGACGG 

51 GGCGGATGCC GTGCTGGTAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCTGACC GGTTGCGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCAC AGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCTCCGTGC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTtac AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACTGa cgcggatttT TACCGTCggc gacttcCGCA 

401 GCCGCGACCT TGCTGCCGGC GGacaAGGTG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA TGACAGGGAA ACACGCGTGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGGCGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGAcgcgtg gacgcaggca 

601 cacTGGcagc TGCCTTACGA CAAAAacggt gcAAAGgcgg cacAAGGCAA 

651 catatTGCcg cAACTGCTCG gcaggctGCT CGCCcaccCG TATTTCTCAC 

7 01 AACCCcaccc aaAAAGCACG GGgcGCGaac TgtttgcccT AAattggctc 

751 gaaacctAcc ttgacggcgg cgaaaaccga tacgacgtar tgcggacgct 

801 ttcccgattc accgcgcaaA ccgTttggga cgccgtctca CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCTCAATG GGTGGAGGCG gccgCATTtg 

1001 cgtggttggC GGCGTGTTGG ATTAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCGACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 436; ORF 121.ng>: 

gl21.pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFT? YPDRLRRKLL 

51 DLQDTGTDEL HRSRWLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRVVLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGN3LP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ TD 437>: 

ml21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 
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151 


GATTTGCAGG 


ACACAGGCGC 


201 


GCAAGAACTC 


AGCCGCCTAT 


251 


fITCAAAACCT 


CGCACCGTCC 


301 


ACCGTCCGAC 


ACGCGCCGGA 


351 


GCCGCTGCTG 


GCGxxxxxxx 


401 


xxxxxxxxxx 

A AAAAAAAAA 


xxxxxxxxxx 


451 


xxxxxxxxxx 


xxxxxxxxxx 


501 




xxxxxxxxxx 


551 


xxxxxxxxxx 


xxxxxxxxxx 


601 


xxxxxxCAGC 


TTCCTTACGA 


651 


CATATTGCCG 


CAACT3CTCG 


701 


AACGCCACCC 


TAAAAGCACG 


751 


GAAACCTACC 


TTGACGGCGG 


801 


TTCCCGTTTT 


ACCGCGCAAA 


851 


CAGATGCCCG 


TCAAATGTAC 


901 


TTAATGGCGG 


ATTTG3CAGA 


951 


CACCGCCGAC 


CTGAACCTCG 


1001 


CGTGGTTGGC 


GGCGT3TTGG 


1051 


GCAACCGGCG 


CATCCAAACC 


1101 


A 
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AGACGAACTG CACCGCAGCA GGATTTTGTC 
ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 
GACATTACCG CCCTCGGCTG CCACGGGCAA 
ACACGGTTAC AGCATACAGC TTGCCGATTT 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
CAAAAACGGT GCAAAGTCGG CACAAGGCAA 
ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 
CGAAAACCGA TACGACGTAT TGCGGACGCT 
CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 
ATTTGCGACG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTGGAAGCC GCCGnATTTG 
ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG AnCGCGGGAT ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 438; ORF 121>: 

ml 2 1 . pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLC SQN LAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 XXQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

Computer analysis of the amino acid sequences gave the following results: 

Homology with a predicted ORF from N. meningitidis menA with menB 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 

from N. gonorrhoeae: 

ml21/gl21 



10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I II I I I I I I I I I I I I I I I : I I I I I I II I I I I I I I I I I I I I I I : I I I I I I I I : I I I 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I |:| I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: : I I I I I 11 I I I : I I I I I I I I I I I I I I I I ! I : I I I I I I 

gl21 PAFGFDTGPGNMLMDAWTOAHWQLPYDKNGAKAACGNILPOLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml 21 . pep GRELFA1NWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
t I II I I : I I I I I I E I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I it 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 
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310 320 330 340 350 360 

ml 21. pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I 
g 121 LMADLAE CFGTRV S LH 5 TAELN L DPQWVEAAA FAW LAAC W I NR I PG SPHKATGAS K PCI L 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 439>: 

al21.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC G TACT GAT AC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC G AC ATT AC CG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

7 01 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

7 51 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT AT TAT TAT TG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 440; ORF 121.a>: 
al21.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAP3 DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21/al21 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

m!21.pep METOLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I I I 1 1 It I , I I I I I I I I I I I I I I I I 1 1 I I I I I I I It I I \ I I I I : i 1 1 I I I I I I I I I 
al21 METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml21 . pep HRSRILSOELSRLYAOTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
1111:1111111 IIIIIIIIMIIIIilMII HUM illllll Ml: 11:11111111 
al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDrTALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml 21. pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

a!21 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDEOIGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: I I I I I I I I I I : II I I I I I I I I I I I I 11 I I I I I I I I I I 

al21 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAOTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I I I : I I I I I I I I I I I I I 1 I I I M I I I I I I i I I I I I I I I I I II I I I II I I II I I I I 
al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m!21 . pep LMA DLAECFGTRVSLHS T AD LN LD PQW VE AAX FAWLAAC W I NR I ?G S P HKATG A SK P C I L 

I I I I I I I I I I I I I II I I I I : I I I I I I I II I I I I I : I ! I I: I II I I If I I I I I I I I I I I I 
a 121 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml 21. pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 441>: 

ml21-l.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCA7G TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 RCCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 442; ORF 121- 1>: 

ml21-l.pep 

1 NETQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLC SQN LAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRI FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AX FAWLAAC W INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 95.6% identity in 366 aa overlap 



10 20 30 40 50 60 
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ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
1 | I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I II I I I I I : I I I I M I 1 : 1 1 I 
gl2 1 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml21-l . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

I I I I : I I I I I I t I ! I I I i I 1 I I i I I I I i I I I I I I I I I I I I I I i 1 I I ! I I t I I I I I I I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPL.L 

70 80 90 100 110 120 

130 140 150 160 170 180 

nl 2 1 - 1 . pep AERTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGI AN I SVLPPDA 

II | | | I I ! i I I I ( I t i 1 t I I I I I I I I I I I I I I I I I I :( t i I: ! I i I I I I I I I I I I I I I 
gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

H121-1 . pCp PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I t 
gl21 pafgfdtgpgnmlmdawtqahwqlpydkngakaaqgnilpqllgrllahpyfsqphpkst 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml2l-l . pep grelfalnwletyldggenrydvlrtlsrftaqtvcdavshaaadarqmyicgggirnpv 
I I I I I I I I I I I I I I I I I 1 I I M I I I I I I I M I I I I I I I I I 1 I i I I I I I I I I I I I I I I I I 

gi2i grelfalnwletyldggenrydvlrtlsrftaqtvwdavshaaadarqmyicgggirnpv 

250 260 270 200 290 300 

310 320 330 340 350 360 

ml21-l . pep LMADLAEC FGTRV S LH ST ADLN LDPQWV EAAX FAWL AACW I NRI PG S PHKAT GAS K PC I L 
I I I I I I 1 I I I I I I I I I I I I :| I I I I I I I I I I I I t I I I 1 I I I ! II I I I I I I I i I I I I II I 
gl21 LMADLAEC FGTRVSLH ST AELNLDPQWVEAAA FA WLAACWI NRI PGSPHKATGASKPC I L 

310 320 330 340 350 360 



ml 2 1 - 1 . pep XAGYYYX 
I I I I I I 

g!21 GAGYYYX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 443>: 

al21-l.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 444; ORF 121-l.a>: 

al21-l.p«p 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 
51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 
101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 
151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 
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201 HWQLPYDKNG AKAAQGNILP QLLDRLLAKP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 



ml21-l/al21-l 96.4% identity in 366 

10 20 
n\121-l . pep METQLYIGIMSGTSMDGADAVLI 
. I I I I I I I I I I I I I I I I I I I I I I i 
al21~l METQLYIGIMSGTSMDGADAVLI 

10 20 



aa overlap 

30 40 50 60 

RM DGGKWLG AEGH A FT P YPG RLRRQLL DLQDTGADE L 
I I I I I I I I I I I I I I I I I I I I 1 1 I I : I I I I I I I I I I I I 
RMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

30 40 50 60 



70 80 90 100 110 120 

ral21-l . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

I I I I : I I I I I ! I I I I I I I I I I t I I I t I I I I I I I i I I I ! I I I I : I I : I I I I I I I I 

al21-l H RS RML S QE LS RL Y AQT AAEL LC S QNLAPS D I T ALGCHGQTVRH APEH S Y S V QLADL P LL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml2 1 - 1 . pep A5RTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGI ANISVLPPDA 
I I I I : I I I I I I I I I I I I I I I I I I I I I I I I II I I I I! : I I I I I II I I I I It 1 I I I ! I I I I 
a 12 1-1 AERTQI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANI SVLPPDA 

13U 140 150 160 170 180 

190 200 210 220 230 240 

ml2 1 -1 . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I I I I I I I I I I I I I I I I I I I II I I t I I I I I I I I M I I I I I I I I I I I I I I I I I I I II 
al21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

Itll21-1 . pep GRELFALNWLETYLDGGENRYDVLRTLSRFT AQT VCDAVSHAAADARQMY ICGGGIRNPV 
I I I I I I I M I I I I I i I I I i I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 12 1-1 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRI PGSPHKATGASKPCIL 
I I I I I I I II I I I I I I I I I I : I I I 1 I I I I \ I II I : I I I I : I II I I I I I I I I I I I I I I II 
al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRI PGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I I I I I I 

a 121 GAGYYYX 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 445>: 

gl22.seq 

1 ATGGCTTTAC TGAGCATCCG CAAGCTGCAC AAACAATACG GCAGCGTAAC 

51 CGCCATCCAA TCCTTAGACT TGGACTTGGA AAAAGGCGAA GtcatCGTAC 

101 TGCTGGGCCC gTccggctgc ggCAAATCCA CCCTcctgcg ctgcgtcaaC 

151 GGTTTGGAGC CGCACCAagg cgGCAGCATC GTGATGGACG GTgtcgGCGA 

201 ATTCgg c AAA GACGTTTCCT GGCAAACCGC CCGGCAAAAa gtcggtatgg 

251 tctttcaaag taacgAactg Tttgcccaca* tgaccgtcat cgAaaacatc 

301 ttcttAggcC CGGTAAagga aCAAAAcCgc gaccgtgccg aagcaGAGGC 

351 gCAAGCCGGC AAactGttgg aacgcgTCGG actgctAGAC CGCAAAAACG 

4 01 CCTATCCGCG CGAACTTTCC GGCGGTCAGA AACAGCGCAT CGCCATTGTC 

451 CGCGCCCTGT GCCTGAATCC GGAAGTCATC CTGCTGGACG AAATCACCGC 

501 CGCACTTGAC CCCGAAATGG TGCGCGAAGT CTTGGAAGTG GTTTTGGAAC 

551 TCGCCCGCGA AGGGATGAGT ATGCTCATCG TAACCCACGA AATGGGGTTC 

601 GCACGCAAAG TTGCCGACCG CATCGTCTTT ATGGACAAAG GCGGCATCGT 

651 CGAATCGTCC GACCCCGAAA CCTTTTTTTC CGCACCAAAA AGCGAACGCG 

701 CCCGCCAATT TCTGGCAGGT ATGGACTACT GA 

This corresponds to the amino acid sequence <SEQ ID 446; ORP 122.ng>: 
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g!22 .pep 

1 MALLSIRKLH KQYGSVTAIQ SLDLDLEKGE VIVLLGPSGC GKSTLLRCVN 

51 GLEPHQGGSI VMDGVGE FGK DVSWQTARQK VGMVFQSNEL FAHMTVIENI 

101 FLGPVKEQNR DRAEAEAQAG KLLERVGLLD RKNAYPRELS GGQKQRIAIV 

151 RALCLNPEVI LLDEITAALD PEMVREVLEV VLELAREGMS MLIVTHEMGF 

201 ARKVADR I VF MDKGGIVESS DPETFFSAPK SERARQFLAG MDY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 447>: 

ml22 .seq 

1 GTTGTCATGA TTAAAATCCG 

51 TATTTTGCGC GGCATCGATT 

101 TCCTCGGGCC TTCCGGCTCA 

151 GCGTTGGAAA TGCCCGAAGA 

201 GCTGAAAATC GATTTTTCTA 

251 TGCGCCGCAA ATCAkGCATG 

301 AAAACCGCCT TGGAAAACGT 

351 GCCTGCCGCC CAAGCGCGCG 

4 01 GCTTGGGCGA CAAAGTGGAT 

4 51 CAGCAGCGCG TCGGCATTGC 

501 GCTGTTTGAC GAACCGACTT 

551 TTTTGGATmC CATGAAGGAA 

601 GTTACGCATG AAATCAAGTT 

6 51 GATGGACrGC GGCGTTATTG 

701 ACCACCCCAA ACACGAACGG 

751 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 448; ORF 122>: 

ml 2 2 .pep 

1 WMIKIRNIH KTFGENTILR GIDLDVCKGQ VWILGPSGS GKTTFLRCLN 
51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSXM VFQQYNLFPH 
101 KTALENVMEG PVAVQGKPAA OAREEALKLL EKVGLGDKVD LYPYQLSGGQ 
151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLDXMKE LAQEGWTMW 
201 VTHEIKFALE VATTXWMDX GVIVEQGSPQ DLFDHPKHER TRRFLSQIQS 
251 TKI* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 122 shows 47.2% identity over a 246 aa overlap with a predicted ORF (ORF 1 22 .ng) 
from N. gonorrhoeae: 

ml22/gl22 

10 20 30 40 50 60 

WMI KIRNIHKTFXjENTILRGIDLDVCKGQVWIIjGPSGSGKTTFLRCIjNALEMPEDGQI 
:::::||::|| :| | ::::|||> IhhMIIM ||:|:||h|:|| : hi 
MAl^SIRKLHKQYGSVTAIQSLDLDl^KGEVIVLLGPSGCGKSTLLRCVNGLEPHQGGSI 
10 20 30 40 50 60 

70 80 90 100 110 120 

E FDNE R PL KI D FSKKPS KHDI LALRRKSXMV FQQYNLFPHKTALENVMEG PVAVQGKPAA 

= |: : I I : : hi lllh =11 I l-ll- III |:: I 

VMDGVGEFGKDVSWOTA RQKVGMVFQSNELFAHMTVI ENI FLGPVKEQNRDRA 

70 80 90 100 110 

130 140 150 160 170 180 

QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 
:h :| I I I I I M h = II : I I I I I * I I : : I : I I I -ll-hll hllllh 
EAEAQAGKLI^RVGLLDRKNAYPRELSGGQKQRIAIVRALCI^PEVILIJDEITAAIJ5PEM 
120 130 140 150 160 170 

190 200 210 220 230 240 

VQDVLDXMXELAQEGWTMVWTHEI KFALEVATTXVVMDXGVI VEQGS PQDLFDHPKHER 

l-ll: = llhll H-lllh II :|| I II | ||h>:|: :| = II II 
VREVLEWLELAREGMSMLIVTHEMGFARKVADRIVFMDKGGIVESSDPETFFSAPKSER 



CAATATCCAT AAGACCTTTG GCGAAAACAC 
TGGATGTGTG CAAAGGGCAG GTGGTCG TCA 
GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 
CGGACAAATC GAGTTCGACA ACGAGCGACC 
AAAAACCAAG CAAACACGAT ATTTTGGCAC 
GTGTTTCAAC AATACAAyCT CTTTCCGCAC 
AATGGAAGGA CCGGTTGCCG TACAgGGCAA 
AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 
TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 
CCGCGCATTG GCGATTCAGC CTGAACTGAT 
CCGCGCTCGA TCCTGAATTG GTGCAAGATG 
TTGGCGCAAG AAGGCTGGAC CATGGTTGTC 
CGCCTTAGAA GTGGCAACCA CCGwCGTCGT 
TCGAACAAGG CAGCCCGCAA GATTTGTTCG 
ACG CGGAGAT TTTTAAGCCA AATCCAATCT 



ml 22 .pep 
gl22 

ml22 .pep 
gl22 

ml22 .pep 
gl22 

m!22 .pep 
gl22 
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180 190 200 210 220 230 

250 

ml22 . pep TRRFLSQIQSTKIX 
=1=11= 

gl22 ARQFLAGMDYX 
240 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 449>: 

al22.seq 

1 GTTGTCATGA TTAAAATCCG CAATATCCAT AAGACCTTCG G C AAAAAT AC 

51 CATTTTGCGC GGCATCAATT TGGATGTGTG CAAAGGGCAG GTGGTCGTCA 

101 TCCTCGGGCC TTCCGGCTCA GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 

151 GCGTTGGAAA TGCCCGAAGA CGGACAAATC GAGTTCGACA ACGAGCGACC 

201 GCTGAAAATC GATTTTTCTA AAAAACCAAG CAAACACGAT ATTTTGGCAC 

251 TGCGCCGCAA ATCAGGCATG GTGTTTCAAC AATACAACCT CTTTCCGCAC 

301 AAAACCGCCT TGGAAAACGT GATGGAAGGA CCGGTTGCCG TACAGGGCAA 

351 GCCTGCCGCC CAAGCGCGCG AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 

401 GCTTGGGCGA CAAAGTGGAT TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 

4 51 CAGCAGCGCG TCGGCATTGC CCGAGCATTG GCGATTCAGC CCGAGCTGAT 

501 GTTGTTTGAC GAACCCACTT CCGCGCTTGA CCCCGAGTTG GTGCAAGACG 

551 TGTTGAACGC CATGAAGGAA TTGGCGCGGG AAGGTTGGAC GATGGTCGTC 

601 GTTACCCACG AAATCAAGTT CGCGCTGGAA GTTGCCACGA CCGTTGTCGT 

651 GATGGACGGC GGCGTTATCG TAGAGCAGGG CAGCCCGAAA GAGTTGTTCG 

701 ACCACCCCAA ACACGAACGG ACGCGGAGAT TTTTAAGCCA AATCCAATCT 

751 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 450; ORF 122.a>: 

al22.pep 

1 VVMIKIRNIH KTFGKNTILR GINLDVCKGQ VWILGPSGS GKTTFLRCLN 

51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD I LALRRKSGM VFQQYNLFPH 

101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLS GGQ 

151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLNAMKE LAREGWTMW 

201 VTHEIK FALE VATTVWMDG GVIV EQGSPK ELFDHPKHER TRRFLSQIQS 

251 TKI* 

ml22/al22 96.0% identity in 253 aa overlap 

10 20 30 40 50 60 

ml22.pep VVMIKIRNIHKTFGENTILRGIDLDVCKGQWVILGPSGSGKTTFLRCLNALEMPEDGQI 
I I I II I I I I I I t II : I I I I I I I : I I I I I I I I 1 I I II I I I I I I I I I I I I I I I I I I I I ! I II 
al22 VVMIKIRNIHKTFGKNTILRGINLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQI 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 22 . pep EFDNERPLKI DFSKKPSKHDILALRRKSXMVFQQYNLFPHKTALENVMEG PVAVQGKPAA 

I I I I II I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I 
al22 EFDNERPLKI DFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml22 . pep QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 

II I I I I I 1 I I I II I 1 I 1 I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
al22 QAREEALKLLEKVGLGDKVDLYPYQLSGGQQORVGIARALAIQPELMLFDEPTSALDPEL 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml22 . pep VQDVLDXMKELAQEGWTMVWTHEIKFALEVATTXVVMDXGVIVEQGSPQDLFDHPKHER 
Mill: 11111:1111 I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I :: I I [ I I i I I I 
al22 VQDVLNAMKELAREGWTMWVTHEIKFALEVATTVWMDGGVIVEQGSPKELFDHPKHER 

190 200 210 220 230 240 



ml 22. pep 



250 

TRRFLSQIQSTKIX 
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a!22 



I I I I I I I I I I I I I I 
TRRFLSQIQSTKIX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 1>: 



1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACCATTTT 

51 GCGCGGCATC GATTTGGATG TGGGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCGGGTAAA ACAACATTTC TGCGCTGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGCGC GGCCGTTACG 

201 CATTGATTTT TCCAAAAAAA CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAGTCCGG AATGGTATTC CAACAATACA ACCTCTTCCC GCATAAAACC 

301 GTGTTGGAAA ACGTGATGGA AGGGCCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTTTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGATAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGTGTCGGTA TCGCCCGCGC ACTGGCGATT CAGCCTGAAT TGATGCTGTT 

501 TGACGAACCC ACTTCCGCGC TGGACCCCGA GTTGGTGCAA GACGTGTTGG 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCACGCT GGAAGTTGCC ACGAACGTCG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

"701 TCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTGCCAAG 

"751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 452; ORF 122-1. ng>: 

gl22-l .pep 

1 MIKIRNIHKT FGENTILRGI DLDVGKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNARPLRIDF SKKTSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 VLENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDAMKELA REGWTMVWT 

201 HEIKFTLEVA TNWVMDGGV IVEQGSPKEL FDHLKHERTR RFLSQIQSAK 

251 I* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 453>: 

ml22-l .soq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACTATTTT 

51 GCGCGGCATC GATTTGGATG TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCAGGCAAA ACGACGTTTC TGCGATGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGAGC GACCGCTGAA 

201 AATCGATTTT TCTAAAAAAC CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAATCAGG CATGGTGTTT CAACAATACA ACCTCTTTCC GCACAAAACC 

301 GCCTTGGAAA ACGTAATGGA AGGACCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGACAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGCGTCGGCA TTGCCCGCGC ATTGGCGATT CAGCCTGAAC TGATGCTGTT 

501 TGACGAACCG ACTTCCGCGC TCGATCCTGA ATTGGTGCAA GATGTTTTGG 

551 ATACCATGAA GGAATTGGCG CAAGAAGGCT GGACCATGGT TGTCGTTACG 

601 CATGAAATCA AGTTCGCCTT AGAAGTGGCA ACCACCGTCG TCGTGATGGA 

651 CGGCGGCGTT ATTGTCGAAC AAGGCAGCCC GC AAGATTTG TTCGACCACC 

701 CCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTACCAAG 

T51 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 454; ORF 122-1>: 

nl 22-1 .pep 

1 MIKIRNIHKT FGENTILRGI DLDVCKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDTMKELA QEGWTMVWT 

201 HEIKFALEVA TTWVNDGGV IVEQGSPQDL FDHPKHERTR RFLSQIQSTK 
251 I* 

ial22-l/gl22-l 94. B% identity in 251 aa overlap 



gl22-l .seq 



ml22 



gl22 



1 



l.pep 



10 20 30 40 50 60 

MI K I RNIHKTFGENT I LRGIDLDVCKGQVVVILGPSGSGKTTFLRCLNAL EMPEDGQIEF 
I I I I i I > ! I ( I I I I i I I I I I I I ! I I I I I II I I t I I I I I I I I I I I I I | | | | | | | | | | | | I 
MI KIRN I HKTFGENTILRGIDLDVGKGQVVVILGPSGSGKTTFLRCLNAL EMPEDGQIEF 
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10 20 30 40 50 60 

70 80 90 100 110 120 

n»122-l . pep DNERPLKIDFSKKPSKHDILALRRKSGKVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 
II 111:111111 I I I I I 1 I I I 1 I I I li I I I I I I I I II I : I I I I I I II I I I I I I I I 1 ! I 
gl22-l DNARPLRIDFSKKTSKHDILALRRKSGKVFQQYNLFPHKTVLENVMEGPVAVQGKPAAQA 

70 80 90 100 110 120 

130 140 150 160 170 180 

nil 2 2- 1 . pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
I I I I I I I I I I I I I I I I II li II I I I I I I i M 1 II II I I I I I I I II I I I I I I I I I I I I I I I 
gl22-l REEALKLLEKVGLGDKVDLYPYOLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml22-l . pep DVLDTMKELAQEGWTMVVVTHEIKFALEVATTWVMDGGVIVEQGSPQDLFDHPKHERTR 

||||:lll!l:lllllll lllllll:|MII:|IHIIIIHIIIII::IHI Mill 
g 12 2-1 DVLDAMKELAREGWTMVWTHEIKFTLEVATNVVVMDGGVIVEQGSPKELFDHLKHERTR 

190 200 210 220 230 240 

250 

ral22-l.pep RFLSQIQSTKIX 
11111111:111 
g!22-l RFLSQIQSAKIX 
250 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 455>: 

al22-l. seq 

1 ATGATTAAAA TCCGCAATAT CC AT AAGACC TTCGGCAAAA ATACCATTTT 

51 GCGCGGCATC AATTTGGATG TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCAGGCAAA ACGACGTTTC TGCGATGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGAGC GACCGCTGAA 

201 AATCGATTTT TCTAAAAAAC CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAATCAGG CATGGTGTTT CAACAATACA ACCTCTTTCC GCACAAAACC 

301 GCCTTGGAAA ACGTGAT GGA AGGACCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGACAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGCGTCGGCA TTGCCCGAGC ATTGGCGA1T CAGCCCGAGC TGATGTTGTT 

501 TGACGAACCC ACTTCCGCGC TTGACCCCGA GTTGGTGCAA GACGTGTTGA 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCGCGCT GGAAGTTGCC ACGACCGTTG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 CCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTACCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 456; ORF 122-l.a>: 

a!22-l.pap 

1 MIKIRNIHKT FGKNTILRGI NLDVCKGQVV VILGPSGSGK TTFLRCLNAL 
51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 
101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSG3QQQ 
151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLNAMKELA REGWTMVWT 
201 HEIKFALEVA TTWVMDGGV IVEQGSPKEL FDHPKHERTR RFLSQI3STK 
251 I* 

al22-l/ml22-l 97.2% identity in 251 aa overlap 

10 20 30 40 50 60 

al22-l .pep MIKIRNIHKTFGKNTILRGINLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 

I I | | | I I I i I I I : f I 1 I I I I : I I I t I I I I I M I I I I I ! I I I I I I 1 I I I II I I I I I I I i 
ml22-l MIKIRNIHKTFGENTILRGIDLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQ1EF 

10 20 30 40 • 50 60 

70 80 90 100 110 120 

al22-l - pep DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 

i in ii li I M 1 1 1 1 1 li 1 1! i : 1 1 li li 1 1 li i ill 1 1 li li ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ml22-l DNERPLKIDFSKKPSKHDILALRRKSGMVFQOYNLFPHKTALENVMEGPVAVQGKPAAQA 
70 80 90 100 110 120 

130 140 150 160 170 180 

al22-l . pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFOEPTSALDPELVQ 

I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 

ml 22-1 REEALKLLEKVGLGDKVDLYPYQLSGGOQQRVGIARALAIQPELMLFDEPTSALDPELVO 
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130 140 150 160 170 180 

190 200 210 220 230 240 

al 22-1 .pep DVLNAMKELAREGWTMWVTHEIKFALEVATTWVMDGGVIVEQGSPKELFDHPKHERTR 
I I | : : I i I I I : I I I I I i I I I II I I I I I I I I I I 1 I I I I I I I I I I I I I I : : I I I I I I I I I I I 
ml 22-1 DVLDTMKELAQEGWTMWVTHEIKFALEVATTVWMDGGVIVEQGSPQDLFDHPKHERTR 
190 200 210 220 230 240 

250 

al22-l.pep RFLSQIQSTKIX 
lllllltlllll 
ml22-l RFLSQIOSTKIX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 7>: 

gl25.seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGGT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TCGCCCCCTT GGGCTGGCAG CGCGGTCTGG CGGCCCTGCT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GTGTGCGCCT GTCGTTCGGC AAATGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGTCGGCGC AacggTCAGC TCCGCTTTGG GCAAAGTGTT 

351 GTGGGACggc gaATC^TTTG TCTGGTGGGC ATTGGCAAAC GGCGCACTGA 

401 TCGTGCTGTG GCTGGTTTTC GGCGCACGCA GAACGGGCGG GCTGAAAACC 

451 GTTTCGATGC TGCTGATGCT GCTTGCCGTG TTGTGGTTGA GCGTCGAAGT 

501 GTTCGCTTCG TCCGGCACAA ACGCCGCGCC CGCCGTTTCA GACGGCATGA 

551 CCTTCGGAAC GGCAGTCGAA CTGTCCGCCG TCATGCCGCT TTCCTGGCTG 

601 CCGCTGGCCG CCGACTACAC GCGCCAAGCA CGCCGCCCGT TTGCGGCAAC 

651 CCTGACGGCA ACGCTCGCCT ATACGCTGAC CGGCTGCTGG ATGTATGCCT 

701 TGGGTTTGGC GGCGGCTCTG TTTACCGGAG AAACCGACGT GGCGAAAATC 

751 CTGTTGGGCG CGGGCTTGGG CATAACGGGC ATTCTGGCAG TCGTCCTCTC 

801 CACCGTTACC ACAACGTTTC TCGATACCTA TTCCGCCGGC GCGAGTGCGA 

851 ACAACATTTC CGCGCGTTTT GCGGAAATAC CCGTCGCTGT CGGCGTTACC 

901 CTGatccgca ccgtgcttgc cgtcatgctg cccgttaccg aatataaaaa 

951 cttcctgctg cttatccgct cggtatttgg gccgatggcg ggtggttttg 

1001 attgccgaCT TTTttgtctt AAAACGGCGT GA 

This corresponds to the amino acid sequence <SEQ ID 458; ORF 125.ng>: 



The following partial DNA sequence 



ml25 . seq 






1 


ATGTCGGGCA 


ATGCCTCCTC 


51 


TTGGTTCGGC 


GCGGCGGTAT 


101 


TTGCGCCTTT 


GGGCTGGCAG 


151 


GCCGTCGGCG 


GCGCGCTGTT 


201 


CGGACGCAGC 


TCGATGGAAA 


251 


CAGTGCTGTT 


TTCCGTGGCG 


301 


GTGATGATTT 


ACGCCGGCGC 


351 


GTGGGACGGC 


GAATCTTTTG 


401 


TTGTGCTGTG 


GCTGGTTTTC 


451 


GTTTCGATGC 


TGCTGATGCT 


501 


CTTTTCCACG 


GCAGGCAGCA 


551 


TCGGAACGGC 


AGTCGAGCTG 


601 


CTTGCCGCCG 


ACTACACGCG 


651 


GACGGCAACG 


CTCGCCTACA 


701 


GTTTGGCAGC 


GGCGTTGTTC 


751 


CTGGGCGCAr 


GTTTGgGTGC 


801 


CGTTACCACA 


ACGTTTCTCG 



identified in N. meningitidis <SEQ ID 459>: 

TCCTTCATCT TCCTCCGCCA TCGGGCTGAT 
CGATTGCCGA AATCAGCACG GGTACGCTGC 
CGCGGTCTGG CGGCTCTACT TTTGGGTCAT 
TTTTGCGGCG GCGTATATCG GCGCACTGAC 
GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 
AATATGCTGC AACTGGCCGG CTGGACGGCG 
AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 
TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 
GGCGCACGCA AAACAGGCGG GCTGAAAACC 
GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 
CCGCCGCACA GGTTTCAGAC GGCATGAGTT 
TCCGCCGTGA TGCCGCTTTC CTGGCTGCCG 
CCACGCGCGC CGCCCGTTTG CGGCAACCCT 
CGCTGACCGG CTGCTGGATG TATGCCTTGG 
ACCGGAGAAA CCGACGTGGC AAAAAT CCTG 
GGCAGGCATT TTGGCGGTCG TCCTCTCCAC 
ATGCCTATTC CGCCGGCGCG AGTGCGAACA 
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851 


ACATTTCCGC 


GCGTTTTGCG 


GAAACACCCG 


TCGCTGTCrG 


CGTTACCCTG 


901 


ATCGGCACGG 


TACTTGCCGT 


CATGCTGCCC 


GTTACCGAAT 


ATGAAAACTT 


951 


CCTGCTGCTT 


ATCGGCTCGG 


TATTTGCGCC GATGGCGGgC 


GGTTTTGATT 


1001 


GCCGACTTTT 


TCGTCTTGAA 


ACGGCGTGA 






This corresponds to the amino acid sequence <SEQ ID 460; ORF 


125>: 


ml 25 .pep 












1 


MSGNASSPSS 


SSAIGLIWFG 


AAVSIAEIST 


GTLLAPLGWQ 


RGLAALLLGH 


51 


AVGGALFFAA 


AYIGALTGRS 


SMESVRLSFG 


KRGSVLFSVA 


NMLQLAGWTA 


101 


VMI YAGATVS 


SALGKVLWDG 


ESFWVJALAN 


GALIVLWLVF 


GARKTGGLKT 


151 


VSMLLMLLAV 


LWLSAEVFST 


AGSTAAQVSD 


GMSFGTAVEL 


SAVMPLSWLP 


201 


LAADYTRHAR 


RPFAATLTAT 


LAYTLTGCWM 


YALGLAAALF 


TGETDVAKIL 


251 


LGAXLGAAGI 


LAWLSTVTT 


TFLDAYSAGA 


SANNISARFA 


ETPVAVXVTL 


301 


IGTVUVVMLP 


VTEYENFLLL 


IGSVFAPMAG 


GFDCRLFRLE 


TA* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 125 shows 92.1% identity over a 343 aa overlap with a predicted ORF (ORF 125.ng) 
from iV. gonorrhoeae: 
ml25/gl25 

10 20 30 40 50 60 

MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLlJVPLGWQRGLtAAIiLLGHAVGGALFFAA 

llll I IM llhll IMMMIIMM MIMIIM MMIIII Ml IMIMMMIM 

MS GNAS S PS S S AA I GL VWFGAAVS I AE 1 S TGTLLAPLGWQRGLAAL LLGHAVGGAL FFAA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AY I GALTGRS SMES VRLSFGKRGS VLFS VANMLQI^GWTAVMI YAGAWSSAIiGKVLWDG 

I I I I I I I I I I I I I I I I I IN I II II I I I I I I I I I I I I I t I I I I : I I I I I I I I I I I I I I I 
AYIGALTGRSSMESVRLSFGKCGSVLFSVANMLOI^GWTAVMIYVGAWSSALGKVLWDG 

70 80 90 100 110 120 

130 140 150 160 . 170 179 

ESFWWAIJINGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQ--VS 

mmiimimiii MiiiMihiiiMiiiiiiiiiiiiiihii h-i-n n 

ESFVWWALANGALIVLWLVFGARRTGGLKTVSMLLI^IJ^VLWLSVEVFASSGTNAAPAVS 
130 140 150 160 170 180 

180 190 200 210 220 230 239 

DGMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAAT^ 

Mhllll II llll II II llllll II I hi IMIIIIMM MUM II llll MIMM 

DGMTFGTAVELSAVMP15WLPLAADYTRQARRPFAATL^ 

190 200 210 220 230 240 

240 250 260 270 280 290 299 

FTGETDVAKI LLGAXLGAAG I LAWLS TVTTT FLDA YS AGASANN I S AR FAETP V AVXVT 

IMIIIIIIIIIM II MIIIIMMIIIIIIhllMIIIIIIMIIII Mil II 

FTGETDVAKI LLGAGLGI TGI LAWLS TVTTT FLDTYS AGASANN I SARFAE I PVAVGVT 
250 260 270 280 290 300 

300 310 320 330 340 

LIGTVLAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 

II Mlllllimhllllll llhllMIIIMII hill 

LI RTVLAVMLPVTEYKNFLLLI RS VFGPMAGGFDCRLFCLKTAX 
310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 461 >: 

al25.seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACACTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTGCT TTTGGGTCAT 



ml 25 .pep 
gl25 

ml25 -pep 
gl25 

ml25 .pep 
gl25 

ml25 .pep 
gl25 

ml25 .pep 
gl25 

ml 25 .pep 
gl25 
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151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

20 i CGGACGCAGC TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGCCGGCGC AACCGTCAGC TCCCCTTTGG GCAAAGTGTT 

351 GTGGGACGGC GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 

401 TTGTGCTGTG GCTGGTTTTC GGCGCACGCA AAACASGCGG GCTGAAAACC 

4 51 GTTTCGATGC 7GCTGATGCT GTTGGCGGTT CTGTG3CTGA GTGCCGAAGT 

501 CTTTTCCACG GCAGGCAGCA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 

551 TCGGAACGGC AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC TTGGCTGCCG 

601 CTGGCCGCCG ACTACACGCG CCACGCGCGC CGCCC3TTTG CGGCAACCCT 

651 GACGGCAACG CTCGCCTACA CGCTGAC CGG CTGCT3GATG TATGCCTTGG 

701 GTTTGGCAGC GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAATCCTG 

751 CTGGGCGCAG GTTTGGGTGC GGCAGGCATT TTGGCGGTCG TCCTGTCGAC 

801 CGTTACCACC ACTTTTCTCG ATGCCTACTC CGCCGGCGTA AGTGCCAACA 

851 ATATTTCCGC CAAACTTTCG GAAATACCCA TCGCC3TTGC CGTCGCCGTT 

901 GTCGGCACAC TGCTTGCCGT CCTCCTGCCC GTTACCGAAT ATGAAAACTT 

951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCG . GC GGTTTTGATT 

1001 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 



This corresponds to the amino acid sequence <SEQ ID 462; ORF 125. a>: 

al25.pep 

1 MSGNASSPSS SAAIGLIWFG AAVSIAEIST GTLLAPLGWC RGLAALLLGH 

51 AVGGALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSMLLMLLAV LWLSAEV FST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETDVAKIL 

251 LGAGLGAAGI LAVVL STVTT TFLDAYSAGV SANNISAKLS EIPIAVAVAV 

301 VGTLLAVLLP VTEYEN FLLL IGSVFAPMAX GFD CRLFRLE TA* 



ml25/al25 95.6% identity in 342 aa overlap 

10 20 30 40 50 60 

ml 2 5 . pep MSGNAS S PS S S S AIGLI WFGAAVS I AEI STGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

I I I I I I I I I I I : I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I 
al25 MSGNASSPSSSAAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 25 . pep AYIGALTGRS SMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVSSALGKVLWDG 

I I I II I I I I I II I II I I I I I I I I I I I I ! I I I I I I I I I I I I I II II II I I I I I I I I I I I 1 I 
al25 AYIGALTGRSSMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVSSALGKVLWDG 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml25 . pep ESFVWWALANGAL I VLWLVFGARKTGGLKTVSMLLMLLAVliWLSAEVFST AGSTAAQVSD 

I I I I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I ! I I I I I I It I I I I I I I I 
al25 ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQVSD 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml25 . pep GMS FGT AVE L S A VM P L S WL P LAA D YT RHARRP FAAT LT ATLA YT LT GC WM Y ALGL AAAL F 

I I I I I I II I I I I I I II I I I 1 I I I I II I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I 
al25 GMS FGTAVELSAVMPLSWLPLAADYTRHARRP FAAT LTATLAYTLTGCWMYALGLAMLF 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml25 . pep TGETDVAKILLGAXLGAAGIIAWLSTVTTTFLDAYSAGASANNISARFAETPVAVXVTL 
1 I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : II I I I I I : : : I I : I I I : : 
al25 TGET DV AK I L LG AG LG AAG I LA W LS T VTT T FL DAY S AG VS ANN I S AK L SE I P I A V AVA V 

250 260 270 280 290 300 



310 320 330 340 

ml 25 . pep IGTVLAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 
: I I: I i I : I I I I I I I I I I I I I I I I I II I I I i I I I I I M I I I I 
al25 VGTLLAVLLPVTEYEN FLLLIGSVFAPMAXGFDCRLFRLETAX 
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310 320 330 340 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 463>: 

gl26 . seq 

1 AtgccgtcTG AAaccCcaaa ggcACGCCGC CGGCTTTCAG ACGGCATCGC 

51 GTCCGACAAC CATACCAAAG AATCCATCAT GCTCACCctg tacggcGAAA 

101 CTTTCCCTTC GCGGCTGCTg ctcggcacgG cggcctacCC GAC CCCTGAA 

151 ATCCTCAAAC AATCCGTCCG AACCGCCCGG CCCGCGATGA ttaccGTCTC 

201 GCTGCGCCGC ACGGGATGCG GCGGCGAGGC GCACGGTCAG GGGTTTTGGT 

251 CGCTGCTTCA AGAAACCGGC GTTCCCGTCC TGCCGAACAC GGCAGGCTGC 

301 CAAAGCGTGC AGGAAGCGGT. AACGACGGCG CAAATGGCGC GCGAAGTGTT 

3 51 TGAAACCGAT TGGATAAAAT TGGAACTCAT CGGCGACGAC GACACCTTGC 

4 01 AGCCGGACGT GTTCCAACTC GTCGAAGCGG CGGAAATCCT GATTAAAGAC 
451 GGCTTCAAAG TGCTGCCTTA TTGCACCGAA GACCTGATTG CCTGCCGCCG 
501 CCTGCTCGAT GCGGGCTGTC AGGCGTTGAT GCCGTGGGCG GCTCCCATCG 
551 GCACGGGTTT GGGGGCGGTT CACGCCTATG CGCTCAAAAT CCTGCGCGAA 
601 CGCCTGCCCG ACACGCCGCT GATTATCGAC GCGGGCTTGG GTTTGCCTTC 
651 CCAAGCGGCA CAAGTGATGG AATGGGGTTT TGACGGCGTA TTGTTAAACA 
701 CCGCCGTTTC CCGCAGCGGC GACCCCGTCA ACATGGCGCG CGCCTTCGCA 
751 CTCGCCGTCG AATCCGGACG GCTGGCATTT GAAGCCGGGC CGGTCGAAGC 
801 GCGAACCAAA GCCCAAGCCA GCACGCCGAC AGTCGGACAA CCGTTTTGGC 

. 851 ATTCGGCGGA ATATTGA 

This corresponds to the amino acid sequence <SEQ ID 464; ORF 126.ng>: 

gl26 .pep 

1 MPSETPKARR RLSDGIASDN HTKES IMLTL YGETFPSRLL LGTAAYPTPE 

51 ILKQSVRTAR PAMITVSLRR TGCGGEAHGQ GFWSLLQETG VPVLPNTAGC 

101 QSVQEAVTTA QMAREVFETD WIKLELIGDD DTLQPDVFQL VEAAEILIKD 

151 GFKVLPYCTE DLIACRRLLD AGCQALMPWA APIGTGLGAV HAYALKILRE 

201 RLPDTPLI ID AGLGLPSQAA QVMEWGFDGV LLNTAVSRSG DPVNMARAFA 

251 LAVE S GRLAF EAGPVEARTK AQASTPTVGQ PFWHSAEY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 465>: 

ml26.seq (partial) 

1 . . CACTATACAA AGGAACCCAT TATGCTCACC CTA7ACGGCG AAACTTTCCC 

51 CTCGCGGCTG CTGCTCGGCA CGGCTGCCTA CCCGACCCCC GAAATCCTCA 

101 AACAATCCAT CCAAACCGCC CAGCCTGCGA TGA7TACCGT CTCGCTGCGC 

151 CGCGCGGGAA GCGGCGGCGA GGCGCACGGT CAGGGGTTTT GGTCGCTGCT 

201 TCAAGAAACC GGCGTTCCCG TCCTGCCGAA CACGGCAGGC TGCCAAAGCG 

251 TGCAGGAAGC GGTAACGACG GCGCAAATGG CGCGCGAAGT GTTTGAAACC 

301 GATTGGATAA AATTGGAACT CATCGGAGAT GACGACACCT TGCAGCCGGA 

3 51 TGTGTTCCAG CTTGTCGAAG CGGCGGAAAT CCTGATTAAA GACGGCTTCA 
401 AAGTGCTGCC TTATTGCACC GAAGACCTGA TTGCCTGCCG CCGCCTGCTC 

4 51 GACGCGGGCT GTCAGGCGTT GATGCCGTGG GCGGCCCCGA TCGGCACGGG 
501 TTTGGGCGCG GTTCACGCCT ACGCGTTGAA CGTCCTGCGC GAACGCCTGC 
551 CCGACACGCC GCTGATTATC GACGCGGGCT TGGGTTTGCC CTCACAGGCG 
601 GCACAAGTGA TGGAATGGGG CTTTGACGGC GTGCTTTTGA ATACTGCCGT 
651 TTCCCGCAGC GGCGATCCGG TCAATATGGC ACGCGCCTTC GCACTCGCCG 
701 TCGAATCCGG ACGGCTGGCA TTTGAAGCCG GACCGGTCGA AGCACGCGAC 
751 AAAGCGCAAG CCAGCACGCC GACAGTCGGA CAACCGTTTT GGCATTCGGC 
801 GGAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 466; ORF 126>: 

ml26.pep (partial) 

1 . . HYTKEPIMLT LYGETFPSRL LLGTAAYPTP EILKQSIQTA QPAMITVSLR 

51 RAGSGGEAHG OGFWSLLQET GVPVLPNTAG CQSVQEAVTT AQMAREVFET 
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101 DWIKLELIGD DDTLQPDVFQ LVEAAEILIK DGFKVLPYCT EDLIACRRLL 

151 DAGCQAIJ^PW AAPIGTGLGA VHAYALNVLR ERLPDTPLII DAGLGLPSQA 

201 AQVMEWGFDG VLLNTAVSRS GDPVNMARAF ALAVESGRLA FEAGPVEARD 

251 KAQASTPTVG QPFWHSAEY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 126 shows 95.9% identity over a 269 aa overlap with a predicted ORF (ORF 126.ng) 
from N. gonorrhoeae: 

ml26/gl26 



10 



20 



30 



40 



ml26.pep HYTKEPIMLTLYGETFPSRLL1K3TAAYPTPEILKQSIQTAQ 

-III illMIII IIIMIIIIII IN III II |: 
gl26 MPSETPKARRRLSDGIASDNHTKESIMLTLYGETFPSRLLLGTAAYPTPEILKQSVRTAR 
10 20 30 40 50 60 



50 60 70 80 90 100 

ml26 . pep PAMITVSLRRAGS3GEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

MM II I'll hi II II I II II III III III Mill II HIM hi III 1 1 1 1 III II II 

gl26 PAMITVSLRRTGCGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 
70 80 90 100 110 120 



110 120 130 140 150 160 

ml 2 6 .pep WIKLELIGDDDTLQPDVFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALKPWA 

1 1 1 1 1 1 1 1 1 1 1 1 i E 1 1 1 1 i 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f ] I E 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

g 1 2 6 WI KLELIGDDDTLOPDVFQLVEAAEI LI KDGFKVLPYCTEDLI ACRRLLDAGCQALMPWA. 

130 140 150 160 170 180 



ml26 .pep 
gl26 



170 180 190 200 210 220 

AP I GTGLGAVHAYALNVLRERLPDTPL 1 1 DAGLGLPSQAAQVMEWGFDGVLLNTAVSRS G 

II M 1 1 II Ml M hH I Ml III III III Mil 1 1! 1 1 1 II Ml 1 , 1 1 1 MIMIIII 

API GTGLGAVHAYALKI LRERLPDTPL I IDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 
190 200 210 220 230 240 



230 240 250 260 * 270 

ml 2 6 . pep D PVNMARAFALAVE SGRLAFEAG P VE ARDKAQAS TP TVGQP FWHS AE YX 

1 1 1 E 1 1 1 1 i E 1 1 f i E f 1 1 1 1 1 1 E E E 1 1 1 I M II I II II i III I II II 

g!26 DPVNMARAFALAVESGRLAFEAGPVEARTKAQASTPTVGQPFWHSAEYX 
250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 467>: 

al26. seq 

1 TTGTTAATCC ACTATACAAA GGAACCCATT ATGCTCACCC TGTACAGCGA 

51 AACTTTCCCT TCGCGGCTGC TGCTCGGCAC AGCCGCCTAC CCGACCCCTG 

101 AAAT CCT CAA ACAATCCGTC CGAACCGCCC GGCCCGCGAT GATTACCGTC 

151 TCGCTGCGCC GCGCGGGATG CGGCGGCGAG GCGCACGGTC AGGGGTTTTG 

201 GTCGCTGCTT CAAGAAACCG GCGTTCCCGT CCTGCCGAAC ACGGCAGGCT 

251 GCCAAAGCGT GCAGGAAGCG GTAACGACGG CGCAAATGGC GCGCGAAGTG 

301 TTTGAAACCG ATTGGATTAA ACTCGAACTC ATCGGOGACG ACGACACCTT 

351 GC AG CCGG AT GTGTTCCAAC TTGTCGAAGC GGCGGAAATC CTGATTAAAG 

401 ACGGCTTCAA AGTGCTGCCT TATTGCACCG AAGACCTGAT TGCCTGCCGC 

4 51 CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG ATGCCGTGGG CGGCCCCGAT 

501 CGGCACGGGT TTGGGCGCGG TTCACGCCTA CGCGTTGAAC GTCCTGCGCG 

551 AACGCCTGCC CGACACGCCG CTGATTATCG ACGCGGGCTT GGGTTTGCCC 

601 TCACAGGCGG CACAAGTGAT GGAATGGGGC TTTGACGGCG TGCTTTTGAA 

651 TACTGCCGTT TCCCGCAGCG GCGATCCGGT CAATATGGCA CGCGCCTTCG 

701 CACTCGCCGT CGAATCCGGA CGGCTGGCAT TTGAAGCCGG ACCGGTCGAA 

751 GCACGCGACA AAGCGCAAGC CAGCACGCCG ACAGTCGGAC AACCGTTTTG 

801 GCATTCGGCG GAATATTGA 
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This corresponds to the amino acid sequence <SEQ ID 468; ORF 126.a>: 

al26.pep 

1 LLIHYTKEPI MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV 

51 SLRRAGCGGE AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV 

101 FETDWIKLEL IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR 

151 RLLDAGCQAL MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP 

201 SQAAQVMEWG FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE 

251 ARDKAQASTP TVGQPFWHS A EY » 



ml26/al26 98.1% identity in 269 aa overlap 

10 20 30 40 50 

ml26.pep HYTKEPIMLT L YG ET FPS R LL LGT AA Y PT PE I LKQ S I QT AQP AM I T VS LRRAG SG G E 

I | | I I I I I I I I I : I II I I I I II I I I I I I I I I I I I I I : : i I : I I I I I I I I I I I I III 
a 126 LL I H Y T KE P IMLT LYS ET FPS RLLLGTAAYPT PE I LKQ S VRT AR PAM I TVS LRRAGCGGE 

10 20 30 40 50 60 

60 70 80 90 100 110 

ml26.pep AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQI4AREVFETDWIKLELIGDDDTLQPD 
I | t I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I M I I I I II I II 
a 126 AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQI4AREVFETDWIKLELIGDDDTLQPD 
70 80 90 100 110 120 



120 130 140 150 160 > 170 

ml26 . pep VFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I II I 
a 1 26 VFQLVE AAE I L I K DG FKVL P Y CTEDLIACRRLL DAGCQ ALM PW AA P I GT GLG AVHA Y ALN 

130 140 150 160 170 180 

180 190 200 210 220 230 

ml26 . pep vlrerlpdtpliidaglglpsqaaqvmewgfdgvllntavsrsgdpvnmarafalavesg 

1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
al26 vlrerlpdtpliidaglglpsqaaqvmewgfdgvllntavsrsgdpvnmarafalavesg 

ISO 200 210 220 230 240 



240 250 260 270 

ml26 .pep RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

i mi iiii iii iii iii ii ii ii nii ii i iii 

al26 RLAFE AGPVEARDKAQAST PT VGQP FWH S AE YX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 469>: 

gl26-l. seq 

1 ATGCTCACCC TGTACGGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 G3CCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCACGGGATG CGGCGGCGAG 

151 GZGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CSCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAC GTGTTCCAAC TCGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ATGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCTCCCAT CGGCACGGGT TTGGGGGCGG TTCACGCCTA 

501 TGCGCTCAAA ATCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCT TCCCAAGCGG CACAAGTGAT GGAATGGGGT 

601 TTTGACGGCG TATTGTTAAA CACCGCCGTT TCCCGCAGCG GCGACCCCGT 

651 CAACATGGCG CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG GCCGGTCGAA GCGCGAACCA AAGCCCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 470; ORF 126-1. ng>: 

gl26-l.pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRTGCGGE 
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51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALK ILRERLPDTP LIIDAGLGLP SOAAQVMEWG 

201. FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARTKAQASTP 

251 TVGQPFWHSA EY* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47 1>: 

ml26-l. seq 

1 ATGCTCACCC TATACGGCGA AACTTTCCCC TCGCGGCTGC TGCTCGGCAC 

51 GGCTGCCTAC CCGACCCCCG AAATCCTCAA ACAATCCATC CAAACCGCCC 

101 AGCCTGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGAAG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATT GG AT AAA ATTGGAACTC 

301 ATCGGAGATG ACGACACCTT GCAGCCGGAT GTGTTCCAGC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 



This corresponds to the amino acid sequence <SEQ ID 5; ORF 126-1>: 

ml26-l.pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSI QTAQPAMITV SLRRAGSGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 



ml26-l/gl26-l 96.9% identity in 262 aa overlap 

10 20 30 40 50 60 

ral2 6- 1 . pep MLTLYGETFPSRLLLGTAAYPTPEILKQS IQTAQPAMITVSLRRAGSGGEAHGQGFWSLL 
I I I t M I f I I I I I I I I I I I 1 I t I I I I I I I :: I I : I I I I I I I I i f : I I I I I I I I I I I I I I 
gl26-l MLTLYGETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRTGCGGEAHGQGFWSLL 

10 ' 20 30 40 50 60 



70 80 90 100 110 120 

ml 2 6-1 . pep qetgvpvlpntagcqsvqeavttaqmarevfetdwikleligdddtlqpdvfqlveaadi 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I i I I ! I I I I f II II I I I I 
g 12 6-1 QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEr 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml26-l .pep likdgfkvlpyctedliacrrlldagcqalmpwaapigtglgavhayalnvlkeklpdtp 
I I I I I i I [ III I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I II I I I I I t 
g 12 6- 1 LI kdgfkvlpyctedli acrrlldagcqalmpwaapi gtglgavhayalki LRERLPDTP 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml26-l.pep LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
I I I ! I I I I II I I I I I M I I I M 1 I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 
g!26-l LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

190 200 210 220 230 240 



250 260 
ml2 6- 1 . pep ARDKAQASTPTVGQPFWHSAEYX 
I I I I I I I II II I II I I I I I I I I 
gl26-l ARTKAQASTPTVGQPFWHSAEYX 

250 260 



The following partial DNA sequence was identified in K meningitidis <SEQ LD 473>: 

al26-l.seq 

1 ATGCTCACCC TGTACAGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 AGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGATG CGGCGGCGAG 
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151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATTAA ACTCGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 474; ORF 126-1. a>: 

al26-l.pep 

1 MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRAGCGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIXLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

.201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 

al26-l/ml26-l 98.1% identity in 262 aa overlap 

10 20 30 40 50 60 

al2 6-l .pep MLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPANITVSLRRAGCGGEAHGQGFWSLL 
I I I I I r I I 1 1 I I I I I j I I I I I I I I i I I I I :: I I : I I I I I I I I I 1 I I I I I 1 1 I ] I I I I I I 
ml2 6-l MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPANITVSLRRAGSGGEAHGQGFWSLL 

10 20 30 40 50 60 

70 80 90 100 110 120 

a 12 6-1. pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETTDWIKLELIGDDDTLQPDVFQLVEAAEI 
( I I I I I I I I I I I 1 I I I I ! I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
ml 2 6- 1 QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWrKLELIGDDDTLQPDVFQLVEAAEI 
70 80 90 100 110 120 

130 140 150 160 170 180 

a 12 6- 1 . pep LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 

I i I I I I 1 1 II I I I I I I I II I 1 1 I I I I I I I 1 1 1 I I II I I 1 1 1 1 I I I I I! 1 1 1 1 I I I I I I I I 

ml 2 6- 1 LI K DGFKVLPYCTEDLI ACRRLLDAGCQALMPWAAPI GTGLGAVHAYALNVLRERLPDT P 

130 140 150 160 170 180 

190 200 210 220 230 240 

a 12 6-1 .pep LIIDAGLGLPSQAAQVMEWGFIXVLl^TAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I 
ml 2 6-1 LI IDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

190 200 210 220 230 240 

250 260 
a 12 6- 1 . pep ARDKAQASTPTVGQPFWHSAEYX 
I I I ! I I I I I I I I I I I I I I I I I I I 
ml2 6-1 ARDKAQASTPTVGQPFWHSAEYX 

250 260 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 475>: 
gl27 .seq 

1 ATGGAAATAT GGAATATGTT GAACACTTGG CCCGATGCCG TCCCGATACG 

51 CGCGGAGGCG GCCGAATCCG TGGCGGCGGT CGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAGAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATT CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACAAAAGAA 

3 01 CTGATTATGT GTCTGTCGGG CAGTATTTTA aggtctGCCA CCCAGCAATA 
351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

4 01 ACATCAATCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 
4 SI GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 
501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GT CATC CAT A 
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551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 TCAGCGGTAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGGCC GCGCGTTACC CGCGTACCGT ACGACGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCATC 

851 CCGCCGgctc cgAAACACTT TAA 

This corresponds to the amino acid sequence <SEQ ID 476; ORF 127.ng>: 

gl27 .pep 

1 MEIWNMLNTW PDAVPIRAEA AES VAAVAAL LLARALLLNI HFRRHPDFGI 

51 ESKRRFLVAS RNITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRY LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 477>: 

CTCGGTGCCG TCCCGATACG 
TGCGGCTTTG CTGCTGGCGC 
GGCATCCGGA TTTCGGCATC 



GACGAAGGAA 
CCCAGCAATA 
CGCGTGGTGG 
GAACCCCTTG 
GCCTGTTGTT 
GTCATCCATA 
AGCCGTATGC 
TCCCCGCCAT 



ml 2 7 . seq 








1 


ATGGAAATAT 


GGAATATGTT 


GGACACTTGG 


51 


TGCGGAGGCG 


GTCGAATCCG 


TGGCGGCGGT 


101 


GCGCCCTTCT 


GTTGAATATC 


CACTTCAAAC 


151 


GAAAGCAAGC 


GGCGGTTTTT 


GGTTGCCAGC 


201 


GGTGCTGTTT 


TCGCTGGCAT 


TTATCTGGTC 


251 


CTTTGTCGAT 


GTTTGCGGTG 


GCGGCGGCGG 


301 


CTGATTATGT 


GTCTGTCGGG 


CAGTATTTTA 


351 


CTCGGTCGGC 


GACTATATCG 


AAATCAACGG 


401 


ACATCAACCT 


GTTGAACACG 


CTGATGATGC 


451 


GTCGGACAGC 


TTGCGGGAAC 


CACCGTTTCT 


501 


GAGCCACCCC 


GTGCGCCGCG 


ACAATATTTT 


551 


CGGTCGAAAT 


CCCCGTTCCC 


ATCCATTTGG 


601 


CGTCTGAAAG 


CCGTACTCGA 


GCCCTTGTGC 


651 


CCAACGGsAT 


TTGGAAAACG 


TGCAGGCGGA 


701 


CCGCCAGACC 


GCGCGTTACC 


CGCGTGCCGT 


751 


ATCATCGTCC 


GCTTCGCTTC 


CCCCGTTTCA 


801 


GGCGGTTATG 


GACGAATTTT 


TGCGCGTACA 


651 


CCGCCGGCTC 


CGAAACACTT 


TAA 



This corresponds to the amino acid sequence <SEQ ID 478; ORF 127>: 

ml27 .pep 

1 MEIWNMLDTW LGAVPIRAEA VES VAAVAAL LLARALLLNI HFKRHPDFGI 

51 ESKRRFLVAS RNITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRX LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 1 27 shows 97.9% identity over a 290 aa overlap with a predicted ORF (ORF 127.ng) 
from N. gonorrhoeae: 
ml27/gl27 

10 20 30 40 50 60 

ml 27 . pep ME IWNMLDTWLGAVPIRAEAVESVAAVAALLLARALLLNI HFKRHPDFGI ESKRRFLVAS 

MINIMI II II 1 1 INI Mill MHII III II Ml Mi III I IN, i III 

g 1 2 7 ME I WNMLNTWPDAVPI RAEAAESVAAVAALLLARALLLNIHFRRKPDFG I ESKRRFLVAS 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml27 .pep RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVWATKELIMCLSGSIZiRSATQQYSVG 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

gl27 RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVWATKELIMCLSGSILRSATQQYSVG 
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70 80 90 100 110 120 

130 140 150 160 170 180 

ml27 . pep DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQLAGTTVS FPNSLLLSHPVRRDNILGDY 

I I I I I I I I I I I t I I I I I I I I I I I I I I I II I II I I I I I I I I I I I M I I I I I I I I I I I I I I I 
gl27 DYIEI NGLRGR WD INLLNTLMMQVG PNPLVGQLAGTTVS FPNSLLLSHP VRRDNI LGDY 

130 140 150 160 170 180 

190 200 210 220 230 240 

VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIORXLENVOAEKLFITPAARPRVT 

II 1 1 11 1 1 1 II I II I II 1 1 1 1 II: III II I II 1 1 1 1 1 1 IIIMIMIIIIIIIIIIII 

V I HTVE I P VP I HLD SD E AVCRIiKAVL E PLCAP Y I PA I QR Y LENVQ AE KL F I TP AAR P RVT 
190 200 210 220 230 240 

250 260 270 280 290 

ml27 . pep RVPYDDKAYRI I VR FAS PVS KRLE I QQ AVMDE FLRVQ YRLLiNH P AGS ETLX 

I I I I I I I I ! I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl27 RVPYDDKAYRI I VRFA5 PVS KRLE IQQAVMDEFLRVQYRLLNHPAGS ETLX 

250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 479>: 

al27 . seq 

1 ATGGAAATAT GGAATATGTT GGACACTTGG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGTGGT 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG 

301 CTGATTATGT GTCTGTCGGG CAGCATTTTA 

351 CTCGGTCGGC GACTATATCG AAATCAACGG 

401 ACATCAACCT GTTGAACACG CTGATGATGC 

451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT 

501 GAGCCACCCC GTGCGCCGCG ACAATATTTT 

551 CGGTCGAAAT CCCGGTTCCC ATCCATTTGG 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC 

651 CCAACGGCAT TTGGAAAACG TGCAGGCGGA 

701 CCGCCAAACC GCGCGTTACC CGCGTGCCGT 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA 

851 CCGCCGGCTC CGAAACACTT TAA 

This corresponds to the amino acid sequence <SEQ ID 480; ORF 127.a>: 

a .127 .pep 

1 MEIWNMLDTW LGAVPIRAEA VES VAWAAL LLARALLLN I HFKRHPDFGI 

51 ESKRRFLVAS RN ITLLLVLF SLAFIWSA QI QTLALSMFAV AAAWVAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRKDNILGDY VIHTVEIPVP 1HLUSDEAVC 

201 RLKAVLEPLC APYIPAIQRH LENVQAEKLF ITPAAKPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNYPAGSETL * 

ml27/al 27 98.6% identity in 290 aa overlap 

10 20 30 4C 50 60 

ml27.pe P MRIWNMLDTWLGAVPIRAEAVESVAAVAALLLARALLLNIHFKRHPDFGIESKRRFLVAS 
I I I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I I I I II I I I I M I I I I I It I I I I I I I I I 
a!27 MEIWNMLDTV7LGAVPIRAEAVESVAVVAALLLARALLLNIHFKRRPDFGIESKRRFLVAS 

10 20 30 4C 50 60 

70 80 90 10C 110 120 

ml27 . pep RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVWATKELIMCLSGSILRSATQQYSVG 

1 1 1 ii 1 1 1 1 1 1 n ii 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 minim iiiiiiiim 

al27 RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVVVATKELIMCLSGSILRSATQOYSVG 
70 80 90 10C 110 120 



ml27 . pep 
gl27 



CTCGGTGCCG 
CGCGGCTTTG 
GGCATCCGGA 
CGCAATATAA 
GGCGCAAATC 
TCGTCGTGGC 
AGGTCTGCCA 
CCTGCGCGGG 
AGGTCGGTCC 
TTCCCCAACA 
GGGCGACTAC 
ATTCGGATGA 
GCGCCCTACA 
AAAACTGTTT 
ACGATGACAA 
AAGCGGCTGG 
ATACCGCCTG 



TCCCGATACG 
CTGCTGGCGC 
TTTCGGCATC 
CGCTGCTTTT 
CAAACGCTGG 
GACGAAGGAA 
CCCAGCAATA 
CGCGTGGTCG 
GAACCCCTTG 
GCCTGTTGTT 
GTCATCCATA 
AGCCGTATGC 
TCCCCGCCAT 
ATCACGCCCG 
GGCATACCGC 
AAAT CCAACA 
TTAAATTACC 
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130 140 150 160 170 180 

ml27 . pep DYIEINGLRGRVVDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 
I I I I I II I 11 I I ! II II I I I I I 1 I I I I I I I I I I I I I I I I I I II I I I I I I I 1 I I I I I 1 I ! I 
al27 DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQLAGTTVSFPNGLLLSHPVRRDNILGDY 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml27 . pep VI HTVE I P VPI HLDS DE AVCRLKAVLE PLCA PY I PAI QRX LENVQAEKLFITPAAR PRVT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I 1 I 11 I I I I I II I I I I I I : I I I I 
al27 VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRHLENVQAEKLFITPAAKPRVT 

190 200 210 220 230 240 

250 260 270 280 290 

ml27 . pep RVPYDDKAYRI IVRFASFVSKRLEIQQAVMDEFLRVQYRLLNHPAGSETLX 

I I I I I i I I I I I I I I I I I I I I I I I I I I I M M I I 1 I I I I I I I I : I I I I t I I I 
a 1 2 7 RVPYDDKAYRI I VRFAS PVSKRLE IQQAVMDEFLRVQYRLLNY PAGSETLX 

250 260 270 280 290 



The following partial DNA sequence wa* 

gl28 . seq 

1 atgattgaca acgCActgct 

51 aatccaaacc gaagACAtca 

101 CGCGCGGACA AATCGCCGCC 

151 AACACCGTCG AGCGTCTGAC 

201 GGGCGTCGTG TCCCATCTCA 

251 CCGTCTATAA CGAACTGATG 

301 GGACAAGACA TCGAACTGTA 

351 CGAATTTGCA ACGCTTTCCC 

401 TGCGCGATTT CX3TATTGAGC 

451 GAACTGGCAA AACTGCAAAC 

501 CCAAAACGTC CTAGACGCGA 

551 CCGCACCGCT TGCCGGCATT 

601 GCCGCGCAAA GCGAAGGCAA 

651 GCACTACCTT GCCGTTATCC 

701 AAATCTACCG CGCCTACGTT 

751 AAATTCGACA ACACCGCCAA 

801 AACCGccaaa cTGCTCGGCT 

851 CCAAAATGGC GGACACGCCC 

901 GCCCGCCGCG CCAAACCCTA 

951 CTTCGCCCGC GAACACCTCG 

1001 GCTACGCCGG CGAAAAACTG 

1051 GAAGTCAAAA AATACTTCCC 

1101 CCAAATCAAA AAACTCTACG 

1151 TCTGGCACAA AGACGTGCGC 

1201 ATCGGCGGCG TTTATATGGA 

1251 CGCGTGGATG AACGACtaca 

1301 TGCAACTGCC CACCGCCTAC 

1351 GGCAAAGAAG CGCGTTTAAG 

14 01 AacCGGCCAC GGACTGCACC 

14 51 TGTCCGGCAT CAAcggcgtA 

1501 TTTATGGAAA ACTTCGTTTG 

1551 CCACGAAGAA AccgGCGAGC 

1601 TcgcCGCCAA AAACTTCCAG 

1651 TTCGCCCTCT TCGATATGAT 

1701 GAAAAACTGG CAGCAGGTTT 

1751 TCCAACCGCC CGAATACAAC 

1801 GCcggcGGCT ATTCCGCAGG 

1851 CAGCACCGAT GCCTACGCCG 

1901 CAGGCAAACG CTTCTGGCAA 

1951 gcgGCGGAAT CCTTCAAAGC 



identified in K gonorrhoeae <SEQ ID 48 1>: 

ccacttgggc gaagaaccCC GTTTTaatca 
AACCCGCCGT CCAAACCGCC AT CGCCGAAG 
GTCAAAGCGC AAACGCACAC CGGCTGGGCG 
CGGCATCACC GAACGCGTCG GCAGGATTTG 
ACTCCGTCGT CGACACGCCC GAACTGCGCG 
CCTGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATTCCCC 
CCGCACAAAA AACCAAGCTC GATCACGACC 
GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
AACAGGTTAC AAAATCGGCT TGCAGATTCC 
AATACGCCGG CAACCGCGAA CTGCGCGAAC 
ACCCGTGCCA GCGAACTTTC AAACGACGGC 
CATCGACCGC ACGCTCGAAA ACGCATTGAA 
TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
GAACAGGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 
CGCGAAGCCA AATACGCATT CAGCGAAACC 
CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 
GCATCGGATT CGCCGAAAAA ACCGTTCCCG 
TATTTTGAAT TGCAACAAAA CGGCAAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGCTTTGCC GACGgcacGC 
CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 
CCACGACGAA ATCCTCACCC TCTTCCACGA 
ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
GAATGGGACG CGGTCGAACT GCCCAGCCAG 
GGAATACAAT GTATTGGCAC AAATGTCCGC 
CCCTGCCGAA AGAACTCTTC GACAAAATGC 
CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
GATTTACAGT GAAAGCGACG AATGCCGTCT 
TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
CGCTTCGCCA ACAGCTTCGG CCacatCtTC 
CTATTACAGC TACGCATGGG CCGAAGTCCt 
CCTTTGAAGA AAGcGACGac gtCGCCGCCA 
GAAAtccttg ccgtcggcgg ctCCCGCAGC 
CTTCCGCGGA CGCGAACCGA GCATAGACGC 
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2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 482; ORF 1 28.ng>: 



gl28 .pep 








1 


MIDNALLHLG 


EEPRFNQIQT 


EDIKPAVQTA 


51 


NTVERLTGIT 


ERVGRIWGW 


SHLNSWDTP 


101 


GQDIELYNRF 


KTIKNSPEFA 


TLSPAQKTKL 


151 


ELAKLQT EGA 


QLSAKFSQNV 


LDATDAFGIY 


201 


AAQSEGKTGY 


KIGLQIPHYL 


AVIQYAGNRE 


251 


KFDNTANIDR 


TLENALKTAK 


LLGFKNYAEL 


301 


ARRAKPYAEK 


DLAEVKAFAR 


EHLGLADPQP 


351 


EVKKYFPVGK 


VLAGLFAQIK 


KLYGIGFAEK 


401 


IGGVYMDIjYA 


REGKRGGAWM 


NDYKGRRRFA 


451 


GKEARLSHDE 


ILTLFHETGH 


GLHHLLTQVD 


501 


FMENFVWEYN 


VLAQMSAHEE. 


TGEPLPKELF 


551 


FALFDMMIYS 


ESDECRLKNW 


QQVLDSVRKE 


601 


AGGYSAGYYS 


YAWAEVLSTD 


AYAAFEESDD 


651 


AAESFKAFRG 


REPSIEALLR 


QSGFDNAA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 483>: 

ml28.seq (partial) 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTGCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAG CTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 

1 TACGCCAGCG AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 

51 wGTCAAAAAA TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 

101 AAmTCAAAAA ACTmTACGGC ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 

151 TGGCACAAAG ACGTGCGCTA TTkTGAATTG CAACAAAACG GCGAAmCCAT 

201 AGGCGGCGTT TATATGGATT TGTACGCACG CGAAGGCAAA CGCGGCGGCG 

251 CGTGGATGAA CGACTACAAA GGCCGCCGCC GTTTTTCAGA CGGCACGCTG 

3 01 CAAyTGCCCA CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 

3 51 CAGGGAAGCC CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 

4 01 CCGGACACGG GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGGCGTA 
451 TCCGGCATCA ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 
501 TATGGAAAAT TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 
551 ACGAAGAAAC CGGcgTTCCC yTGCCGAAAG AACTCTTsGA CAAAwTGCTC 
601 GCCGCCAAAA ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
651 CGCCCTCTTT GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 
701 AAAACTGGCA ACAGGTTTTA GACAGCGTGC GCAAAAAAGT CGCCGTCATC 
751 CAGCCGCCCG AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
801 AGGCGGCTAT TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGTATTGA 
851 GCGCGGACGC ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 
901 GGCAAACGCT TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
951 nGCAGAATCC TTCAAAGCCT TCCGCGGCCG CGAACCGAGC ATAGACGCAC 

1001 TCTTGCGCCA CAGCGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ID 484; ORF 128>: 

ml28.pep (partial) 

1 MTDNALLHLG EEPRFDQIKT EDI KPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



// 



* 1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRFSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAO XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 
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301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 

10 20 30 40 50 60 

MIDNALLHLGEEPRFNQI QTEDI KPAVQTAI AEARGQIAAVKAQTHTGWANTVERLTGI T 

i iiiiiiiiiiiihihiiiiiihiiiiiiii iiihiiiiiiiiimi inn 

MTDNALLHLGEE PRFDQ I KTED I KPALQTAI AEAREQ I AAI KAQTHTGW ANTVE PLTG I T 
10 20 30 40 50 60 

70 80 90 100 110 120 

ERVGRI WGWSHLNSWDTPELRAV YNELMPE ITVFFTE I GQDI EL YNRFKTI KNS PEFA 

iimiiiiiim hiiiiiiMMiiMiiiiimMiiiiitiiiiiniim 

ERVGRI WGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNS PEFD 
70 80 90 100 110 120 



gl28 .pep 
m!28 

gl28 .pep 
ml28 



130 140 150 160 170 180 

gl28 . pep TLSPAQKTKLDHDLRDFVLSGAELPPER0AELAKLQTEGAOLSAKFSQNVLDATDAFGIY 

mi ii mi: i 

ml 2 8 TLS PAQKTKLNH 

130 

// 

340 350 360 

gl 2 8 . pep YAGEKLREAKYAFSETEVKKYFPVGKVLAG 

I hi II I IP III II I I II I MM II I 

ml28 YAS E KLREAKYAFS ETX VKKY F PVGXVLNG 

10 20 30 



370 380 390 4C0 410 420 

gl28 .pep LFAQI KKL YG I G FAE KTVPVWHKDVRYFELQQNGKT I GGVYMDL YAREGKRGGAWMND YK 

Mil I II Ml MM II I II MIMM Mill hH II; II I II 1 1 MM III II I II I 

ml28 LFAQXKKLYG I GFTEKTVPVWHKDVRYXELQQNGEX I GGVYMDL YAREGKRGGAWMND YK 

40 50 60 70 80 90 



430 440 450 460 470 480 

gl28 . pep GRRRFADGTLQLPTAYLVCNFAP PVGGKEARLSHDE I LTLFHETGHGLHHLLTQVDELGV 

I I II I : II II I I I I I I II I II I II I I h I II II M I M II I I I II I II II I M I I I M 
ml 2 8 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDE I LI LFHETGHGLHHLLTQVDELGV 

100 110 120 130 140 150 



490 500 510 520 530 540 

gl 2 8 . pep S G INGVEWD AVEL PS QFMEN FVWE YNVLAQMS AHE E TGE P LP KE L FDKMLAAKN FQRGMF 

I Mill Mil III Ml Ml I HUM I II I II II 1 1 Mill II II Ml II III 

ml 2 8 SGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVP 

160 170 180 190 200 210 



550 560 570 560 590 600 

gl28 . pep LVRQmFALFDMMIYSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

Ml I Mill Ml I II Ml 1 1 Mill II I III hill III I II II II MM MUM 

ml 2 8 XVRQXE FALFDMM I YSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALS FGHI FAGGY 

220 230 240 250 260 270 



610 620 630 640 650 660 

gl 2 8 . pep SAGYYS YAWAEVLSTDAYAAFEESDDVAATGKRFWQE I LAVGGSRSAAES FKAFRGREPS 

lh IIMMMIhlMMMIMMIMMMMMIIM MhlMIMMIMM 

ml 2 8 S AAX YS Y AW AE V LS ADA Y AA FE E SDD VAATGKR FWQ E I LAVGXSRSGAES FKAFRGREPS 

280 290 300 310 320 330 
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670 679 

g 12 8. pep . IDALLRQSGFDNAAX 

II I I I hi II l!h 

ml 2 8 IDALLRHSGFDNAVX 
340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 485>: 
a!28 .seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATG 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCG CC AAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTT7TTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

1401 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAAC7CTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAG C GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 486; ORF 128.a>: 

al28 .pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA 1AEAREQIAA IKACTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EW DAVE LP SQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

128/al28 66.0% identity in 677 aa overlap 

20 20 30 40 50 60 

ml28 . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

I I 1 M M I ! 1 1 I I I I I I I I I I I I 1 1 I I I I I I I I I I I 1 1 I I I I I I I 1 I I I 1 I 1 1 I I I 1 1 M 
al28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml28 . pep ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I I II I I I I I I I : I I I 1 I I I : I I I I I I I I I I I I I I t I I I I I I I I I I I I I I | || | | | 
al28 ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

TO 80 90 100 110 120 

130 

ml 28 . pep TLS PAQKTKLNH 

III I I M I I I I 

al28 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 



ml 28. pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 
190 200 210 220 230 240 



ml 28. pep 



al28 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 

140 150 

ml 28 .pep YASEKLREAKYAFSETXVKKYFPVGX 

I I : I I I I i I II I I I I I I I I I I I I I 
al28 ARRAK P Y AE K DLAE VKAFARES LG L A DLQ PW D LG Y AGE KLRE AK Y A FSE TE VKKY FP VG K 

310 320 330 340 350 360 

160 170 180 190 200 210 

ml 28 . pep VLNGLFAQX KKLYG I G FT EKT V PVWHKD VR YXE LQQNG EXI GGV YM D L Y AREGKRGG AWM 

I I I I I I I I I I I I I 1 I II I II I I I I I I I I I I II I I I I I : I I I I I I I I I I I I II I I I I I I 
al20 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 

220 230 240 250 260 270 

ml28 . pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
I I I I I I I I I I I I I I I I ! I I I I I I I I : I I I I I : E I I I I I I I I I I I II I I I I II I I I I I I I 
al28 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

280 290 300 310 320 330 

ml28.pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAOXSAHEETGVPLPKELXDKXLAAKNFQ 
1 I I I I I I II I I I I I II I I 1 I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I 
a 128 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

340 350 360 370 380 390 

ml28.pep XGM FXVRQXEFALFDMMI YSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALS FGH I F 

HI Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I :: I I | I i I I I IMIII 
al28 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAVVRPPEYNRFANSFGHIF 
550 560 570 580 590 600 
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400 410 420 430 440 450 

AGGYSAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWCEILAVGXSR3GAESFKAFRG 
MINI: j M I | I I I I I I ( I I I II I I I I I | I I I I I I I I | || | | | | |||:||||||||l 
AGGYSAGYYSYAWAEVLSA0AYAAFEESDDVAATGKRFWCBILAVGGSR5AAESFKAFRG 
610 620 630 640 650 660 

4 60 470 
REPSIDALLRHSGFDNAVX 
I I I I I I I I I I I I I I I I I : 
RE PS I DALLRHSG ?DNAAX 
670 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 487>: 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCG CCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 G3GCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

2S1 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 G3ACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCT ACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

1401 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

1451 TGTCCGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEQ ED 488; ORF 128-l.ng>: 

gl2 8-l.pep (partial) 

1 MIDNALLHLG BEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT BRVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFPTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDAIAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHbGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM MDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 489>: 

ml28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCG CCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 



m!28 . pep 
al28 

ml 28 . pep 
al28 
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201 


GGGCGTGGTG 


TCGCACCTCA 


251 


CCGTCTATAA 


CGAACTGATG 


301 


GGACAAGACA 


TCGAGCTGTA 


351 


CGAATTCGAC 


ACCCTCTCCC 


401 


TGCGCGATTT 


CGTCCTCAGC 


451 


GAACTGGCAA 


AACTGCAAAC 


501 


CCAAAACGTC 


CTAGACGCGA 


551 


CCGCACOGCT 


TGCCGGCATT 


601 


GCCGCGCAAA 


GCGAAAGCAA 


651 


ACACTACCTC 


GCCGTCATCC 


701 


AAATCTACCG 


CGCCTACGTT 


751 


AAATTCGACA 


ACACCGCCAA 


801 


AACCGCCAAA 


CTGCTCGGCT 


851 


CCAAAATGGC 


GGACACGCCC 


901 


GCCCGCCGCG 


CCAAACCCTA 


951 


CTTCGCCCGC 


GAAAGCCTGA 


1001 


GCTACGCCAG 


CGAAAAACTG 


1051 


GAAGTCAAAA 


AATACTTCCC 


1101 


CCAAATCAAA 


AAACTCTACG 


1151 


TCTGGCACAA 


AGACGTGCGC 


1201 


ATAGGCGGCG 


TTTATATGGA 


1251 


CGCGTGGATG 


AACGACTACA 


1301 


TGCAACTGCC 


CACCGCCTAC 


1351 


GGCAGGGAAG 


CCCGCCTGAG 


1401 


AACCGGACAC 


GGGCTGCACC 


1451 


TATCCGGCAT 


CAACGGCGTA 


1501 


TTTATGGAAA 


ATTTCGTTTG 


1551 


CCACGAAGAA 


ACCGGCGTTC 


1601 


TCGCCGCCAA 


AAACTTCCAA 


1651 


TTCGCCCTCT 


TTGATATGAT 


1701 


GAAAAACTGG 


CAACAGGTTT 


1751 


TCCAGCCGCC 


CGAATACAAC 


1B01 


GCAGGCGGCT 


ATTCCGCAGG 


1851 


GAGCGCGGAC 


GCATACGCCG 


1901 


CAGGCAAACG 


CTTTTGGCAG 


1951 


GCGGCAGAAT 


CCTTCAAAGC 


2001 


ACTCTTGCGC 


CACAGCGGTT 



ACTCCGTCGC CGACACGCCC GAACTGCGCG 
CCCGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATTCCCC 
CCGCACAAAA AACCAAACTC AACCACGATC 
GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
AACAGGCTAC AAAATCGGCT TGCAGATTCC 
AATACGCCGA CAACCGCGAA CTGCGCGAAC 
ACCCGCGCCA GCGAACTTTC AGACGACGGC 
CATCGACCGC ACGCTCGCAA ACGCCCTGCA 
TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
GAACAAGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
ACCTCGCCGA TTTGCAACCG TGGGACTTGG 
CGCGAAGCCA AATACGCGTT CAGCGAAACC 
CGTCGGCAAA GTATTAAACG GACTGTTCGC 
GCATCGGATT TACCGAAAAA ACCGTCCCCG 
TATTTTGAAT TGCAACAAAA CGGCGAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGTTTTTCA GACGGCACGC 
CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 
CCACGACGAA ATCCTCATCC TCTTCCACGA 
ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
GAATGGGACG CGGTCGAACT GCCCAGCCAG 
GGAATACAAT GTCTTGGCAC AAATGTCAGC 
CCCTGCCGAA AGAACTCTTC GACAAAATGC 
CGCGGCATGT TCCTCGTCCG GCAAATGGAG 
GATTTACAGC GAAGACGACG AAGGCCGTCT 
TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 
CGCTTCGCCT TGAGCTTCGG CCACATCTTC 
CTATTACAGC TACGCGTGGG CGGAAGTATT 
CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
CTTCCGCGGC CGCGAACCGA GCATAGACGC 
TCGACAACGC GGTCTGA 



This corresponds to the amino acid sequence <SEQ ID 490; ORF 128-1>: 

ml28-l.pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQGA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRB LREQIYRAYV TRASELSDDG . 

251 KFDNTANIDR TLANALQTAK LLGFKNYAEL S1ATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

451 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FKENFVWEYN VLAQMSAHEE TGVPLPKELP DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAV* 



m!28-l/gl28-l 94.5% identity in 491 aa overlap 



10 20 30 40 50 60 

gl28 - 1 . pep MI DNALLHLGEEPR FNQ I KTED I K PAVQT AI AEARGQ I AAVKAQTHTGWANTVERLTG IT 

I I I I I I I I I 1 I I I I : I 1 I I I I I ■ I h ! I I I 1 I I I I I i h I I I i I I i I I I I I I I I I I I 
TO128- 1 MTDN ALLHLGE E PRFDQ I KTED I K PALQTAI AEAREQI AAI KAQTHTG WANT VE PLTG IT 

10 20 30 40 50 60 



70 80 90 100 110 120 

g!28-l - pep ERVGR IWG WSHLNSWDT PE LRAVYNELMPE I TVFFT E IGQD I ELYNRFKT I KNS PE FA 

I Mil II II Nil III: II II Ml till M III III! Illlllllllil II Mllllll 
ml28-l ERVGR IWG WSH LNSVADT PE LRAVYNELMPE I TVFFT E IGQD I ELYNRFKT I KNS PE FD 

70 30 90 100 110 120 
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130 140 150 160 170 180 

gl28-l.pep TLS PAQKTKLDHDLRDFVLSGAELPPE^QAEIJUCLQTEGAQLSAKFSQNVLDATDAFG I Y 

Mil II III UN III II M M Mlhl IMtllll II HM llllll II II III II II 

ml 2 8 - 1 TLS P AQKTKLNHDLRD FVLSGAE LPPBQQAELAKLQTEG AQLSAKF SQNVLD ATDAFG I Y 

130 140 150 160 170 160 



190 200 210 220 230 240 

gl28-l .pep FDDAAPLAGI PEDAIAMFAAAAQSEGKTGYKIGLQI PHYLAVIQYAGNRELnEQIYRAYV 
Mill lltlll I IIMIIIII II IMIIII lillllllill I I II lllllilllllll 
ml28-l FDD AA P LAG I PEDALAMFAAAAQS ES KTG YKI G LQI PHYLA VI QYADNRELREQ I YRA YV 

190 200 210 220 230 240 



250 260 270 280 290 300 

gl 28 - 1 . pep TRASELSNDGKFDNTANIDRTLENALKTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

inn ihi 1 1 1 1 1 hi n 1 1 1 iihiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

ml 28 -1 TRASELSDDGKFDNTANIDRTIJWAUZrAKLUSF^ 

250 260 270 280 290 300 



310 320 330 340 350 360 

gl28-l .pep ARRAKPYAEKDLAEVKAFAREKLGLADPQPWDLSYAGEKLREAKYAFSETEVKKYFPVGK 

I I I I I I I I I I I I Ell III III hill I i I I I : I I i I I I I I I I I I I I I i I I I I I I I I I I 
ml 2 8 -1 ARRAKPY AE KDLABVKAFARES LNLADLQ PWDLGYAS E KLREAKYAFS ETE VKKYF PVG K 

310 320 330 340 350 360 



370 380 390 400 410 420 

gl28-l .pep VLAG LFAQ I KKL YG I G FAEKTVP VWHKDVRY FELQQNG KTI GGV YMDLY AREGKRGG AWM 

ii ii iii m 1 1 ii mmm i iiiiii mil i mm ii ii iiiiMimiiii 

ml 2 8 -1 VLNGLFAQIKKLYGIGFTEKTVPVWHKDWYFELC^3NGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 



430 440 450 460 470 480 

gl2 8 -1 . pep NDYKGRRRFAIXSTLQLPTAYLVCNFAPPVGGKEARI^HDEILTLFHETGHGLHHIJjTQ 

I I I I I I I I I = I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 2 8 - 1 NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEI LI LFHETGHGLHHLLTQVD 

430 440 450 460 470 480 



490 

gl28-l .pep ELGVSGINGVK 



lllllllll 

E LG VSG I NGVEWDA VE L PSQFMENFVWE YNVLAQMSAHE ETGVPLPKELFDKMLAAKN FQ 
490 500 510 520 530 540 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 49 1>: 

al28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATOGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
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1251 


CGCGTGGATG 


1301 


TGCAACTGCC 


1351 


GGCAAAGAAG 


14 01 


AACCGGACAC 


14 51 


TATCCGGCAT 


1501 


TTTATGGAAA 


1551 


CCACGAAGAA 


1601 


TCGCCGCCAA 


1651 


TTCGCCCTCT 


1701 


GAAAAACTGG 


1751 


TCCGACCGCC 


1801 


GCAGGCGGCT 


1851 


GAGCGCGGAC 


1901 


CAGGCAAACG 


1951 


GCGGCAGAAT 


2001 


ACTCTTGCGC 



AACGACTACA 
CACCGCCTAC 
CCCGCTTGAG 
GGCCTGCACC 
CAACGGCGTA 
ATTTCGTTTG 
ACCGGCGTTC 
AAACTTCCAA 
TTGATATGAT 
CAACAGGTTT 
CGAATACAAC 
ATTCCGCAGG 
GCATACGCCG 
CTTTTGGCAG 
CCTTCAAAGC 
CACAGCGGCT 



AAGGCCGCCG 
CTCGTCTGCA 
CCATGACGAA 
ACCTGCTTAC 
GAATGGGACG 
GGAATACAAT 
CCCTGCCGAA 
CGCGGAATGT 
GATTTACAGC 
TAGACAGCGT 
CGCTTCGCCA 
CTATTACAGC 
CCTTTGAAGA 
GAAATCCTCG 
CTTCCGCGGA 
TCGACAACGC 



CCGTTTTTCA 
ACTTCACCCC 
ATCCTCACCC 
CCAAGTCGAC 
CAGTCGAACT 
GTCTTGGCGC 
AGAACTCTTC 
TCCTCGTCCG 
GAAGACGACG 
GCGCAAAGAA 
ACAGCTTCGG 
TACGCGTGGG 
AAGCGACGAT 
CCGTCGGCGG 
CGCGAACCGA 
GGCTTGA 



GACGGCACGC 
GCCCGTCGGC 
TCTTCCACGA 
GAACTGGGCG 
GCCCAGTCAG 
AAATGTCCGC 
CACAAAATGC 
CCAAATGGAG 
AAGGCCGTCT 
GTCGCCGTCG 
CCACATCTTC 
CGGAAGTATT 
GTCGCCGCCA 
ATCGCGCAGC 
GCATAGACGC 



This corresponds to the amino acid sequence <SEQ ID 492; ORF 128-La>: 

al28-l.pep 



1 


MTDNALLHLG 


51 


NTVEPLTGIT 


101 


GQDI ELYNRF 


151 


ELAKLQTEGA 


201 


AAQSEGKTGY 


251 


KFDNTANIDR 


301 


ARRAKPYAEK 


351 


EVKKYFPVGK 


401 


IGGVYMDLYA 


451 


GKEARLSHDE 


501 


FMENFVWEYN 


551 


FALFDMMIYS 


601 


AGGYSAGYYS 


651 


AAESFKAFRG 


l/a!28-l 97.8% 



EEPRFDQIKT 
ERVGRIWGW 
KTIKNSPEFD 
QLSAKFSQNV 
KIGLQIPHYL 
TLENALQTAK 
DLAEVKAFAR 
VLNGLFAQIK 
REGKRGGAWM 
ILTLFHETGH 
VLAQMSAHEE 
EDDEGRLKNW 
YAWAEVLSAD 
REPSIDALLR 



EDI KPALQTA 
SHLNSVTDTP 
TLSHAOKTKL 
LDATDAFGIY 
AVIQYADNRK 
LLGFKNYAEL 
ESLGLADLOP 
KLYGIGFTEK 
NDYKGRRRFS 
GLHHLLTQVD 
TGVPLPKELF 
QQVLDSVRKE 
AYAAFEESDD 
HSGFDNAA* 



IAEAREQIAA 
ELRAAYNELiM 
NHDLRDFVLS 
FDDAAPLAGI 
LREQIYRAYV 
SLATKMADTP 
WDLGYAGEKL 
TVPVWHKDVR 
DGTLQLPTAY 
ELGVSGINGV 
DKMLAAKNFQ 
VAWRPPEYN 
VAATGKRFW0 



IKAQTHTGWA 
PEITVFFTEI 
GAELPPEQQA 
PEDALAMFAA 
TRASEL3DDG 
EQVLNFLHDL 
REAKYAFSET 
YFELQQNGET 
LVCNFTPPVG 
EWDAVELPSQ 
RGMFLVRQME 
RFANSFGHIF 
EILAVGGSRS 



identity in 677 aa overlap 



10 20 30 40 50 CO 

al28 -1 . pep MTDNALLHLGEEPRFDQIKTEDIKPALCTTAI 

MMMMMMM MIMMMIMIIIMMMM MIMMMMMMMIMI 

ml28-l MTDNALIaHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANT\'EPLTGIT 
10 20 30 40 50 60 

70 80 90 100 110 120 

al28 -1 . pep ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

1 1 1 1 1 1 I i 1 1 1 M I M • I I i 1 1 1 1 : 1 1 ! 1 1 1 1 1 1 1 1 1 ( I i 1 1 1 1 1 1 ! I M I i 1 1 i 1 1 1 1 1 

ral28 -1 ERVGRIWGWSHI^SVADTPELRAVYNEI^PEITvFFTEIGQDIELYWRFKTIKNSPEFD 
70 80 90 100 110 120 



al28-l .pep 



ral28-l 



130 140 150 160 170 180 

TLSHAQKTKLNITOLRDFVI^GAELPPEQQ^ 

Ml IIIMtlilliMlltllllliliiilllllillllllilllllltllillillll 

TLS PAQKTKLNHDLRDFVIjSGAELP PEQQAE LAKLQT EGAQLS AKFSQNVLDATDAFGI Y 
130 140 150 160 170 180 



al28-l .pep 



ml28-l 



190 200 210 220 230 240 

FDDAAPLAGI PEDALAMFAAAAQS EGKTG YKI G LQI PHYLAVI QYADNRKLRBQI YRAYV 

ill MIIIIMII lllll) Mhlllllill Mil Illlll MM h Mill I 

FDDAAPLAG I PEDALAMFAAAAQSESKTGYKIGLQI PHYLAVI QYADNRELRBQI YRAYV 
190 200 210 220 230 240 



250 260 270 280 290 300 

al2 8 - 1 . pep TRASELSDDGKFDNTAN IDRTLENALQTAKLLG FKNYAELS LATKMADTPEQVLNFLHDL 

MIIMMIilllllMMMI Ml M MMM II MMM MUM II MM MINI 

ml 2 8 - 1 TRASELSDDGKFDOTANIDRTLANALQTAKLLG FKNYAELS LATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 



310 320 330 340 350 360 

a 12 8 - 1 . pep ARRAKPYA E KDLA EVKA FARES LG LADLQ PWDLG YAG EKLR EA KYAF S ETEVK KY F P VG K 

I 1 1 1 M 11 1 M 1 1 M M I M M I M M I M M II 1 1 M M I M 1 1 1 1 1 1 1 1 1 1 I M I 1 1 I 
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ml 2 8 - 1 ARRAKPYAEKDLAEVKAFARESLNIoADIjQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

310 380 390 400 410 420 

S128-1 pep VLNGLF AQI KKLYG IGFTE KTVP VWHKD VRYF E LQQNGET I GG VYMDL YAREG KRGG A WM 

lllllllllllllllllllllllllllllllilinMlllllllllllilliMillll 

ml 2 8 - 1 VLNGIiFAQl KKLYG I GFTE KTVP VWHKD VRYFE LQQNGET I GGVYMDLYAREG KRGG AWM 

370 380 390 400 410 420 

430 440 450 460 470 480 

al2 8 - 1 . pep NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEI LTLFHETGHGLHHLLTQVD 

IMIIIIIMMIIIIMilMMhlllthlMIMIMI III! IIIIIMIIIII 

ml2 8 - 1 NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILI LFHETGHGLHHLLTQVD 

430 ' 440 450 460 470 480 

490 500 510 520 530 540 

al2 8 - 1 . pep ELGVSGINGVEl^AVELPSQmENFWEYNVI^QMSAHEETGVPLPKELFDKKl^AKNFQ 

I II ! II llllll I DIM II Mil [III! II III till MM! till i llllll III 1 1 ! 

ml 2 8 - 1 ELGVSGINGVEWDAVELPSQFMENFWEYNVLAQMSAHEETGVPLPKELFDKFJjAAKNFQ 
490 500 510 520 530 540 

550 560 570 580 590 600 

3128-1. pep RGMFLVRQMEFALFDMM I YSEDDEGRLKNWQQVLDS VRKEVAWRP PE YNRFANSFGH I F 

I M I M 1 1 M 1 1 1 1 1 1 II 1 1 M II II 1 1 1 1 i II M 1 1 M M I h M M 1 1 M I llllll 

ml 2 8 - 1 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
550 560 570 580 590 600 

610 620 630 640 650 660 

al 2 8 - 1 . pep AGG YS AGYY S YAWAEVLS ADAYAAFEE SDDVAATGKR FWQE I LAVGGS RSAAES FKAFRG 

I IM II IIIM MIM II II M Mill M MIM Ml I MM1M M I MIMMM M I 

ml 2 8 - 1 AGGYS AGYYSYAWAEVLS ADAYAAFEESDDVAATGKRFWQE I LAVGGS RSAAES FKAFRG 

610 620 630 640 650 660 

670 679 
al28-l.pep REPSIDALLRHSGFDNAAX 

I M ! I! MMM 1 1 M M 

ral2 8 - 1 REPSIDALLRHSGFDNAVX 
670 

al28-l/ P44573 

sp| P44573 |OPDA_HAEIN OLIGOPEPTIDASE A >gi | 1075082 | pir | | C64055 oligopcptidaae A (prlC) homolog 
- Haemophilus influenzae (strain Rd KW20) 

>gi| 1573174 (U32706) oligopeptidase A (prlC) [Haemophilus influenzae RdJ Length = 681 
Score = 591 bits (1507) , Bxpect = e-168 

Identities = 309/677 (45%), Positives = 415/677 (60%), Gaps * 4/677 (0%) 

Query: 4 NALLHLGEEPRFDQIKTEDIKPALQTXXXXXXXXXXXXXXXTHTGWANTVEPLTGITERV 63 

N LL++ P F QIK E I + PA+4 H W N + PLT +R+ 

Sbjct: 5 NPLIiNIQGLPPFSQIKPEHIRPAVBKLIQDCPOTIEQVLKQPHFTWENFILPLTETNDRL 64 

Query: 64 griwgwshi^svtdtpelraaynblmpeitvffteigqdielynrpkt;knspefdtls 123 

R W VSHLNSV 44 ELR AY +P +4 4 T +GQ LYN -» + KNS EF S 
Sbjct: 65 NRAWSPVSH1^SVKNSTELREAYQTCLPLI£EYSTWVT3QHKGLYNAY1ALKNSAEFADYS 124 

Query: 124 HAQKTKLNHDIiRDFVL£GAELPPEQQAELAKJjQTEGAQLS 183 

AQK + + LRDF LSG L E+Q 44 ++L+++FS NVLDAT + 4+ 
Sbjct: 125 IAQKKAIENSIJlDFBLSGIGLSEEKQQRYGEIVARLSELNSQFS^INVLDAT^IGWEKLIEN 184 

Query: 184 AAPLAGI PEDALAI4FAAAAQSEGKTGYKIGLQI PKYTAVIQYADNRKLREQI YRAYVTRA 243 

A LAG+PE AL 4A+S+G GY+ L+IP YL V+ Y +NR LRE++-YRAY TRA 

Sbjct: 185 EAE LAGLPE SALQ AAQQS AES KGLKG YRFTLEI PS YLP VMT YC ENRALRE EMYRA YAT RA 244 

Query: 244 SELSDD GKFDlTrANIDRTLEN7u^TAKLlJGFKNYAEI^lATKMADTPEQVI^FLHDI^ 302 

SE + GK+DN+ +4 L ++ AJCLLGF Y ELSLATKMA+ P+QVL+FL LA 
Sbjct: 24 5 S EQG PNAGKWDNS KVMEE I LTLRVB LAKLLG fDTVTELS LATKMAENPQQVLD FLDHLAE 304 



Query: 303 RAKPYAEKDI^EVKAFAlU^SI^i^ljQPWDLGYAGEKLREAKYAFSETEVKKYFPVGKVL 362 

RAKP EK-fL E+K 4 + G+ +L PWD+G4 EK 4+ YA 44 E+4 YFP +V+ 
Sbjct: 305 PAKPC^EKEliQFXKGYCEKEFGVTBIAPWDIGFYSEKQKQHLYMNDEELRPYFPENRVI 364 
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Query: 363 NG LFAQ I KKLYG I G FTE - KT VPVWHKDVRYFEL - QQNG ET IGG VYMDL YAREGKRGGAWM 420 

+GLF IK+++ I E K V WHKDVR+F+L -+N + G Y+DLYARE KRGGAWM 
Sbjct: 365 SGLFELIKRIFNIRAVERKGVDTWHKDWFFDLIDENDOLRGSFYLDLYAREinCRGGAVW 424 

Query: 421 ND YKGRRRFSDGTLQLPTAYLVCNFT PPVGGKEARLSHDE IXXXXXXXXXXXXXXXXQVD 480 

+D GR+R DG+ + + P AYL CNF P+G K A +H+E+ Q+D 
Sbjct: 425 DDCIGPJ<RKLIX5SIETPVAYIjTCNFNAPrGNKPALFTHNEVTTLraEFGHGIHHI'lLTQID 4 84 

Query: 481 E LGVSG INGV EWD A VE LPSQ FME^FVWEYNVLAQMS AH EETG VPLPKELFDKMLAAKN FQ 540 

V+GINGV WDAVELPSQ FMEN +• WE LA +S H ETG PLPKE ++L AKNFQ 
Sbjct: 485 VSDVAGINGVPWDAVELPSQTOENVOffiEEALAFISGHYETGEPLPKEKLTQl^KAKNFQ 544 

Query: 541 RGMFLVRQME FALFDMMI YS EDDEGRLKNWQQVLDS VRKEVAWRP PEYNRFANSFGH I F 600 

MF++RQ+EF +FD ++ D + L SV+ +VAV++ ++ R +SF HIF 

Sbjct: 545 AAMFILRQLEFGIFDFRLHHTFDAEKTNQIIJ5TLKSVKSQVAVIKGVDWARAPHSFSHTF 604 

Query: 601 XXXXXX3GCXXXXWAEVLSADAYAAFEESDD Y- AATGKRFWQE I LAVGGS RS AAESFKAFR 659 

WAEVL5ADAY+ FEE TGK F EIL GGS E FK FR 

Sbjct: 605 AGGYAAGYYSYIiWAEVLSADAYSRFEEEGIFNPITGKSFLDEILTRGGSEEPMELFKRFR 664 

Query: 660 GREPSIDALLRHSGFDN 676 

GREP -+DALLRH G N 
Sbjct: 665 GREPQLDALLRHKGIMN 681 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 493>: 

gl29.seq 

1 ATGCTTTCAC CTCCTCGGCG TAAAACGGCG GCACATCAAT CAAGCCGTCT 

51 TTCATTTGCG TGCGGAAAAA ATGCGGCGTG TTGCCGTGAT CAAAATCAAT 

101 ATCGTGCAGC ATCCAGCCCA AATCGCGGTT TGCCTCGCTT TCCGATAACG 

151 CCGACGGCGG CAGCGGTTCA CCCTTATCCG CGCTTTCGCC ATTTGCCCTT 

201 TCAGGCTGCG GGCATAGGGG CGGAACAGGC GGCGGT CGAA TCCTGTTTCA 

251 TCCGGACAAA CGCGTTGGCA GTCGGAAAAT CCGGCCGGCC GTGTCAAATA 

301 ATGCGTTACT TTGGCCGGGT CTTGTCCTTT GTAAGCGGCG GTCTTTTTTT 

351 GCGCGCCATC CGCATCTGTT TGGGCGCATG GCAAACGGCG GCTGCCGTAC 

401 AATCAAAATG TTTGGCGATT TCATGCAGAC AGGCATCCGG ATGCCGCCCG 

4 51 ACATATCGAG CCGGTTTTTG CCTATCCGAT TTGGCGGCAT TTAGGCCGGT 

501 AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 494; ORF 129.ng>: 

gl29 .pep 

1 MLSPPRRKTA AHQSSRLSFA CGKNAACCRD QNQYRAASSP NRGLPRFPIT 

51 PTAAAVHPYP RFRHLPFQAA GIGAEQAAVE SCFIRTNALA VGKSGRPCQI 

101 MRYFG RVLSF VSGGLFLRAI RIC LGAWQTA AAVQSKCLAI SCRQASGCRP 

151 TYRAGFCLSD LAAFRPVT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 495>: 

ml29.seq (partial) 

1 . .TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCGG CTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG G CG ATTT CAT 

251 GCAGATAGGC ATCCGGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 496; ORE 129>: 

ml29 .pep (partial) 

1 . . YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGRLC QIMRYFGRVL 

51 FFVSGGLFLR VIPIC LSAXQ MVAAVQSKCL AISCRXASGC CPTYXAGFCL 

101 SDLTAFRPVT * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 129 shows 79.1% identity over a 1 10 aa overlap with a predicted ORF (ORF 129.ng) 
from N. gonorrhoeae: 

ml29/gl29 

10 20 30 

ml29 .pep YLRFHYLPFQAAGIGTEQVAVKSCFIQINT 

I II" II I II III hi hi hll I h h 

gl29 RDQNQYRAASSPNRGLPRFPITPTAAAVHPYPRFRHLPFQAAGIGAEQAAVESCFIRTNA 
30 40 50 60 70 80 

40 50 60 70 80 90 

ml29.pep LWGKFGRLCQIMRYFGRVLFFVSGGLFLRVIPICliSAXQMVAAVQSKCLAISCRXASGC 

hill ii milium nm mm nm i milium m im 

gl29 I^VGKSGRPCX5IMRYFX3RVLSFVSGGLFI^IRICLGAWQTAAAVOSKCLAISCROASGC 
90 100 110 120 130 140 

100 110 
ml2 9 . pep C P T YXAGFCLS DLTAFR PVTX 

III lllllllhlllllll 
gl29 RPTYRAGF CLSDLAAFR PVTX 

150 16 0 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 497>: 
al29.seq (partial) 

1 TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 
51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 
101 GAAAATTCGG CCAGCTGTGT CAAATAATGC GTTACTT TGG CCGGGTCTTG 
151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 
201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 
251 GCAGATAGGC ATCCTGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 
301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 498; ORF 129.a>: 

al29.pep (partial) 

1 YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGQLC QIMRYFGRVL 

51 FFVSGGLFLR VIPIC LSA*Q MVAAVQSKCL AISCR*ASWC CPTY+AGFCL 

101 SDLTAFRPVT * 

ml29/al29 98.2% identity in 1 10 aa overlap 

10 20 30 40 50 60 

ml29.pep Y LRFH YLP FQAAG I G TE QVAVKS C FI Q I NT LWGK FG RLC Q IMR YFG RVL FFV S GG L JLR 

I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I i : I I I I I I I I I I I I I I I i I I I I I I 
al29 Y LRFHYL P FQAAG I G TE QVAVKS CFIQ I NT LWGK FGQLCQIMRYFG RVL FFVSGGLFLR 

10 20 30 40 50 60 

70 80 90 100 110 

ml29.pep VIPICLSAXQMVAAVQSKCLAISCRXASGCCPTYXAGFCLSDLTAFRPVTX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al29 VIPICLSAXQMVAAVQSKCLAISCRXASWCCPTYXAGFCL3DLTAFRPVTX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 499>: 

gl30 .seq 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG TGGATCGTTC GGCGATGTCG ATGCCACTAC GGAAGCGGCA 

151 ACGCAGACCC GCATCCAGCC TGTCGGACAA TTGACGATGG GTGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 
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251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

301 AACGGCGACT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGCAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACCTACAT GGCGAATAAA 

4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTG CGGTCGG CGTTGACGGT 

601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTAT CAAAA 

701 AAGGCAAAGA AACCTTGCAC AAACATGCCC TTGAAGGCTT TAACGCGATG 

751 CCGGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 

801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 500; ORF 130.ng>: 

gl30.pep 

1 MKQLRDNKAQ GSALF TLVSG IVIVIAVLYF LIKLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGIPVG ERQGEQI FGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAADLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 50 1>: 



ml30. seq 


(partial) 


1 


. . GGCGAACAGA TTTTCGGCAA 


51 


CAATGTGCCG AACGCTCCGA 


101 


GTATCGgCAA GGCTTCGATA 


151 


ACGCCATGCC TGCAAAAGGC 


201 


AAACGGGCGA TTACTTACAT 


251 


TCCTGATGAG GCTGCGCCTG 


301 


CTCCTGCCGA TAGTGCAGCT 


351 


GCGGCACCCC TGCGGTCGGC 


401 


TGTCAGGTGT GCCACGGCGG 


451 


AAAAGACGAT TGGGCACCGC 


501 


AACACGCCCT TGAAGGCTTT 


551 


GGTTTGAGCG ATGACGAAgT 


601 


ATCCGGTGCA AAATTCTAA 



This corresponds to the amino acid sequence <SEQ ID 502; ORF 130>: 

ml30.pep {partial) 

1 . .GEQIFGKICI QCHAADSNVP NAPKLEHNGD XAPRIQGFDT LFQHALNGFN 
51 AMPAKGGAAD LTDQELKRAI TYMANKSGGS FPNPDEAAPA DNAASGTASA 
101 PADSAAPAEA KAEDKGAAPA VGVDGKKVFE ATCQVCHGGS IPGIPGIGKK 
151 DDWAPRIKKG KETLHKHALE GFNAMPAKXG NAGLSDDEVK AAVDYMANQS 
201 GAKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 1 30 shows 98. 1% identity over a 206 aa overlap with a predicted ORF (ORF 1 30.ng) 
from N. gonorrhoeae: 

ml30/gl30 

10 20 30 

ml30 pep GEQIFGKICI OCHAADSNVPNAPKLEHNGD 

Mil Hi II I III II I I I II I IN II llll 
g 1 3 0 DATTE AATQTRI Q PVGQLTMGDG I PVGERQGEQI FGKICI QCHAADSNVPNAPKLEHNGD 

50 60 70 80 90 100 

40 50 60 70 80 89 

ml 3 0 pep XAPRI - QGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 

llll I I I I I I I I I I I I I I M I I I II I I I I I ! I I 1 I I I I I I I I I I I I I I I I I I I I I I I I 
gl3 0 WAPRIAQGFDTLFQHALNGFN7^PAKGGAADLTIX)ELKRAITYMANKSGGSFPNPDEAAP 
110 120 130 140 150 160 
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90 100 110 120 130 140 

ml3 0 . pep ADN AAS GT AS A PAD S AA P AEAKAE DKGAA - PAVGVDGKKVFEATCQVCHGGSI PGI PGIG 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 

gl3 0 ADNAASGTASAPADSAAPAEAKAEDKGAAAPAVGVDGKKVFEATCQVCHGGSIPGI PGIG 

170 180 190 200 210 220 



150 160 170 180 190 200 

ml30 . pep KKDDWAPR I KKGKETLHKHALEGFNAMP AKXGNAGLSDDEVKAAVDYMANQSGAKFX 

II Ml I II! Ill II III I I I II II Hill IIMIIIIIIIIIIIIMIIIIIIII 
gl30 KKDDWAPRIKKGK3TLHKHAl;EGF^AMPAKGGNAGLSDDEVKAAVDyMANQSGAKFX 
230 240 250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 503>: 

a!30.seq 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG CGGCTCGTTC GGCGATGTCG ATGCCACTAC GGAAGCAGCA 

151 ACGCAGACCC GTATCCAGCC TGTCGGACAA TTGACGATGG GCGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 

251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

301 AACGGCGATT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGTAG 

401 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACTTACAT GGCGAACAAA 

451 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 

601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 

701 AAGGCAAAGA AACCTTGCAC AAACACGCCC TTGAAGGCTT TAACGCGATG 

751 CCTGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 

801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 



This corresponds to the amino acid sequence <SEQ ID 504; ORF 130.a>: 

al30.pep 

1 MKQLRDNKAQ GSAL FTLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAVDLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 

m 1 30/al 30 97.6% identity in 206 aa overlap 

10 20 30 

ml30 . pep GEQIFGKICIQCHAADSNVPNAPKLEHNGD 

I I I 1 1 1 1 II I I M I II I I! I I I i ! I I I I I I 
al30 DATTEAATQTRIQPVGQLTMCDGIPVGERQGEQIFGKICIQCHAADSNVPNAPKLEIINGD 
50 60 70 90 90 100 



40 50 60 70 80 89 

ml 30 . pep XAPRI-QGFDTLFQHALNGFNAMPAKGGAADLTDQ2LKRAITYMANKSGGSFPNPDEAAP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I i ! I I I I I I 
al30 WAPRIAQGFDTLFQHALNGFNAMPAKGGAVDLTDQELKRAITYMANKSGGSFPNPDEAAP 
110 120 130 140 150 160 



90 100 110 120 130 140 

ml30.pep ADNAASGTASAPADSAAPAEAKAE DKGAA- PAVGVDGKKVFEATCQVCHGGSI PGI PGIG 

I I I 1 I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I [ I I I | 
al30 ADNAASGTASAPADSAAPAEAKAE DKGAAAPAVGVDGKKVFEATCQVCHGGS I PGI PGIG 

170 180 190 200 210 220 



ml 30 .pep 



150 160 170 180 190 200 

KKDDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 
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1 1 1 j I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 

al30 KKDDWAPRIKKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 505>: 

gl32.seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT ATTAATATTA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACCTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CCAACTTcct CAgccGCTCG AccGccGTTG CAGCAACAtt 

201 tttcttTGca acctgcAtgg gctatggTgt atattcacac CCACACGACA 

251 AAACACGGTT TGGACTtcag caacataCGA CAGACTCAGC AagcACCCAA 

301 ACCcgtAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACagtTTTT CAAATgccga caTGgtga 

This corresponds to the amino acid sequence <SEQ ID 506; ORF 132.ng>: 

gl32 .pep 

1 MEAFKTLIWI INIISALAV I VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS T AVAATFF FA TCMGYGVYSH PHDKTRFGLQ QHTTDSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 507>: 

m!32 . seq (partial) 

1 ATGGAAGCCT TCAAAACCTT AATTTGGATT GTTAATTTAA TTTCCGCTTT 

51 GGCCGTCTTC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGA. . . 

This corresponds to the amino acid sequence <SEQ ID 508; ORF 132>: 
ml32.pep (partial) 

1 MEPFKTLIWI VNLISALAV F VLVLLQHGKG ADAGATFG. . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 132 shows 89.5% identity over a 38 aa overlap with a predicted ORF (ORF 132.ng) 
from N. gonorrhoeae: 

ml32/gl32 

10 20 30 

ml32 .pep ME P FKTL I W I VNL I S ALAVFVLVLLQHG KGAD AGATFG 

ii 1 1 ii iih hi illinium M i mi it 1 1 

g 1 3 2 MEAFKTLI W I INI I SALAV I VLVLLQHGKGADAGATFGSGSGSAQGVFGS AGNANFLSRS 

10 20 30 40 50 60 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 509>: 

al32.seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT GTTAATATAA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CTAACTTCCT CAGCCGCTCG ACCGCCGTTG CAGCAACATT 

201 TTTCTTTGCA ACCTGCATGg GCTATGGTGT ATATTCACAC CCACACGACA 

251 AAACACGGTT TGGACTTCAG CAACGTACAA CAAACTCAGC AAGCACCCAA 

301 ACCCGTAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACAGTTTTT CAAATGCCGA CATGGTGA 

This corresponds to the amino acid sequence <SEQ ID 510; ORF 132. a>: 

al32 .pep 

1 MEAFKTLIWI VNII5ALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QRTTNSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 



ml32/al32 92.1% identity in 38 aa overlap 
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10 20 30 

ml 32 . pep ME P FKT LIWIVNLIS ALAV FV L VL LQHG KG A DAG AT FG 

I I I I I I I I I I I : I I I i I I : I I I I I I I I I I i II I I I I I 
a 132 MEAFKTLIWIVNIISALAVIVLVLLQHGKGADAGATFGSGSGSAQGVFGSAGNANFLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 51 1>: 

gl34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCCGATGCGG GTAAAACCAC GCTGACCGAA AAACTGCTGC 

101 TGTTTTCGGG CGCGATTCAA AGCGCAGGCA CGGTGAAAGG TAAGAAAACC 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTACAAAGAC CACACCGTCA 

251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

3 01 GTTTTAACCG CAGTGGACAG CGCCTTGATG GTCATCGACG CGGCAAAAGG 

3 51 CGTGGAAGCG CAAACCATCA AACTCTTGAA CGTCTGCCGC CTGCGCGATA 
401 CGCCGATTGT TACCTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCT 

4 51 TTGGAACTCT TGGACGAAGT GGAAGACATC CTGCAAATCC GCTGCGCGCC 
501 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCCTGCCG 
601 CACGAGTTCG ACATCATCAA AGGCATAAAC AATCCCGAAT TGGAACAACG 
651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 
701 CGGCTTCCAA CGAATTTAAT CTCGacgaAT TTCTCGccgG CGAACTCACG 
751 CCAGTGTTCT TCGGCTCTGC GATTAACAAC TTCGGCATTC AGGAAAT CCT 
801 CAATTCATTG ATTGACTGGG CACCCGCACC GAAACCGCGC GACGCGACCA 
851 TGCGCATGGT CGGGCCGGAC GAGCCGAAAT TTTCCGGATT TATCTTTAAA 
901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATCG CCTTCTTGCG 
951 CGTCTGCTCC GGTAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 T CAACCGCGA AATCGCCGCC TCCAGCGTAG TAACCTTCAT GTCGCACGAC 

1051 CGCGAACTGG CGGAAGAAGC CTACGCCGGC GACAT CATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTTACCGG CATCCCATTC TTCGCGCCCG AACTGTTCCG CAGCGTCCGC 

12 01 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGTT TGCAACAACT 

1251 CGGCGAAGAA GGTGCGGTTC AAGTATTCAA ACCGATGAGC GGCGCGGATT 

1301 TGATTTTGGG TGCGGTCGGC GTGTTGCAGT TTGAAGTCGT AACCTCACGC 

1351 CTCGCCAACG AATACGGCGT GGAAGCCGTG TTCGACAGCG CATCCATCTG 

14 01 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCGGAATTTG 

1451 AAAAAGCCAA CGCAGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 

1501 TACCTCGCCC CCAACCGCGT GAATTTGGGG TTGACGCAAG AACGCTGGCC 

1551 GGACATCGTG TTCCACGAAA CGCGCGAACA TTCGGTCAAA CTCTAA 

This corresponds to the amino acid sequence <SEQ ED 512; ORF 134.ng>: 

gl34 .pep 

1 MSQEILDQVR RRRTFAI ISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVEDI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGTN NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATMRMVGPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELAEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLCOLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

4 51 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGK LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 513>: 

ml 3 4 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCGGG CGCGATTCAG AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACTTCCGA CTGGATGGAA ATCGAGAAGC AGCGCGGCAT 
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201 TTCCGTGGCA TCAAGTGTGA TGCAGTTCGA TTACAAAGAC CACACCGTCA 

251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

301 GTTTTAACCG CCGTGGACAG CGCATTAATG GTCATCGACG CGGCAAAAGG 

351 CGTGGAAGCG CAAAC CATC A AGCTCTTAAA CGTCTGCCGC CTGCGCGATA 

401 CACCGATTGT TACGTTTATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

451 CTGGAACTTT TGGACGAAGT GGAAAACATT TTAAAAATCC GCTGCGCGCC 

501 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 

551 TCCTGAACGA TGAAATTTAT CTCTTTGAAG CTGGCGGCGA ACGCCTGCCG 

601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCTGAAT TGGAACAACG 

651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 

701 CGGCTTCCAA CGAGTTTAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 

751 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 

801 CAATTCATTG ATTGACTGGG CGCCCGCGCC GAAACCGCGC GACGCGACCG 

851 TACGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT TATCTTCAAA 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 

951 CGTCTGCTCC GGCAAATTCG AG CGCGGCAT GAAGATGAAA CACCTG CGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTGG TTACCTTCAT GTCGCACGAC 

1051 CGCGAGCTGG TTGAAGAAGC CTACGCCGGC GACATTATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTCACCGG CATCCCATTC TTCGCACCCG AACTGTTCCG CAGCGTACGC 

1201 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGCT TGCAACAG CT 

1251 CGGCGAAGAA GGCGCGGTGC AGGTGTTCAA ACCGATGAGC GGCGCGGATT 

1301 TGATTTTGGG CGCGGTCGGC GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 

1351 CTCGCCAACG AATACGGCGT AGAAGCCGTG TTCGACAGCG CATCCATCTG 

1401 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCTGAATTTG 

1451 AAAAAGCCAA CGCGGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 

1501 TACCTCGCCC CCAACCGCGT GAATTTGGGA CTCACGCAAG AACGTTGGCC 

1551 GGACATCGTG TTCCACGAAA CACGCGAACA TTCGGTCAAA CTGTAA 

This corresponds to the amino acid sequence <SEQ ID 514; ORF 134>: 

ml34 -pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWME IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VI DAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVENI LKIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDUKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

401 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEWT SR 

451 LANEYGVEAV FDSAS1WSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N, gonorrhoeae 

ORF 1 34 shows 98.7% identity over a 531 aa overlap with a predicted ORF (ORF 134.ng) 
from N. gonorrhoeae: 

ml34/gl34 



10 20 30 40 50 60 

ml34 .pep MSQEILDQVRRRRTFAIISHPDAGKTTLTEKL1.LFSGAI0SAGTVKGKKTGKFATSDWME 

MIIIIIIIIMIIIIIIIIIimilllllMlllllllllllllllllllllllllh 

gl3 4 MSOEILDQVRRRRTFAI ISHPDAGKTTLTEKLLLFSGAIOSAGTVKGKKTGKFATSDWMD 

10 20 30 40 50 €0 



70 80 90 100 110 120 

ml3 4 . pep I EKQRGI S VAS S VMQ FD Y KDHTVNLLDTPGHQD FSEDTYRVLTAVD S ALMV I DAAKGVEA 

IMII Mill 111 II III! IIMIIIIMI1I I II llllll Mil I III III IMIII II 

g 1 3 4 I EKQRGI S VASSVMQ FDY KDHTVNLLDTPGHQD FS EDTYRVLTAVDS ALMV I DAAKGVEA 

70 80 90 100 11G 120 



130 140 150 160 17C . 180 
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ml 3 4 . pep QT I KLLNVCRLRDT P I VTFMNKYDRE VRDS LELLDE VEN I LK I RCAPVTW P I GKG KNF KG 

II II I Ml llll II III I Mil II I I II II I III II I 1 = 1 hi I II I I II II II II II II 
gl34 QT I KLLNVCRLRDT P I VTFMNKYDREVRDS LE LLDEVE D I LQ I RCAPVTW P I GMGKNF KG 

130 140 150 160 170 180 



190 200 210 220 230 . 240 

ml34 . pep V YH I LNDE 1 Y I iFE AGGERL PHE FD 1 1 KG I DNPELEQRFPLE I QQLRDE I E LVQAAS NE FN 

I I I I I I I I I II I I I I I I I I I I II I I I I I h I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl34 VYHILNDEIYLFEAGGERLPHEFDIIKGINNPELEQRFPLEIQQLRDEIELVOAASNEFN 
190 200 210 220 230 240 



250 260 270 280 290 300 

ml34 . pep IiDEFlAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFI FK 

II II II llllll II III I Mil II 1 1 1 1 WWW MINIM hill II llll MINI 

gl34 IX)EFIAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATMRMVGPDEPKFSGFIFK 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml34 . pep IQANMDPKI1RDRIAFLRVCSGKFEUIGMKMKHLRINREIAASSVVTFMSHDRELVEEAYAG 

nilllll MM II llllllllllllll III III II MM II I! II 1 1 II II M IMI II 

gl34 IQ ANMD P KHRDR I AFLRVCSGKFERGMKM KHLR I NRE I AAS S WTFMS HDRE LAE EAY AG 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml34 . pep DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
gl34 DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 
370 380 390 400 410 420 



430 440 450 460 470 480 

ml 3 4 . pep GAVQVFKPMSGADLI LGAVGVLQFE WTSRLANE YGVE AVFDSAS I WS ARWVSCDD KKKL 

II II Mil III I II IIMIMMI I III III II III I Mil II I II I III I II I llllll 

gl34 GAVQVF KPMS GADL I LGAVGVLQFE WTSRLANE YGVE AVFDSAS I WS ARWVSCDD KKKL 

430 440 450 460 470 480 



490 500 510 520 530 

ml 34 . pep AEFEKANAGNIAIDAGGNLAYTAPNRVNIjGLTOERWPDIVFHETREHSVKLX 

I ! 1 1 M 1 1 1 1 1 1 1 1 IMIIMIMIII Ml III II MIIIMI II I III II 

gl34 AEFEKANAGNIAIDAGGNLAYI^PNRVNIiGLTQERWPDIVFHETREHSVKLX 
490 500 510 520 530 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 515>: 
al34 . seq 

1 ATGTCCCAAG aaatcctcga ccaagtgcgc cgccgccgca cgtttgccat 

51 catctcccac cctgacgcag gtaaaaccac gttgactgaa aaactcttgc 

101 tgttttcagg tgcgattcaa agcgcgggta cggtaaaagg caagaaaacc 

151 ggcaaattcg ccacctccga ctggatggac atcgagaagc agcgcggcat 

201 ttccgtggca tcaagcgtga tgcagttcga ctataaagac cacaccgtca 

251 accttttgga cacgccggga caccaagact tctccgaaga cacctaccgc 

301 gttttgaccg ccgtcgatag tgccttgatg gtcatcgacg cggcaaaagg 

351 cgtggaagcg caaaccatca aactcttgaa cgtctgccgc ctgcgcaata 

4 01 cgccgattgt tacgttcatg aacaaatacg accgcgaagt gcgcgattcc 

4 51 ctggaattgc tggacgaagt ggaaaacatc ctgcaaatcc gctgcgcgcc 

501 cgtaacctgg ccgatcggca tgggcaaaaa cttcaaaggc gtgtaccaca 

551 tcctgaacga cgaaatctat ctctttgaag cgggcggcga acgcttgccg 

601 cacgagttcg acatcatcaa aggcatcgat aatcccgaat tggaacaacg 

651 ctttccgtta gaaatacagc agttgcgcga cgaaatcgaa ttggtgcagg 

701 cggcttccaa cgagttcaat ctcgacgaat tcctcgccgg cgaactcacg 

7 51 cccgtattct tcggctctgc gattaacaac ttcggtattc aggaaatcct 

801 caattcattg attgaatggg cgcccgcgcc gaaac^acgc gatgcgaccg 

851 tgcgtatggt cgagccggac gagccgaagt tttccggatt tatcttcaaa 

901 atccaagcca atatggaccc gaaacaccgc gaccgtattg ccttcttgcg 
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951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 



CGTCTGCTCC 
TCAACCGCGA 
CGCGAGCTGG 
CCACGGCAAC 
CGTTTACCGG 
ATCAAAAACC 
TGGCGAAGAA 
TGATTTTGGG 
CTTGCCAACG 
GTCGGCGCGC 
AAAAAGCCAA 
TACCTCGCCC 
GGACATCGTG 



GGCAAATTCG 
AATCGCCGCC 
TTGAAGAAGC 
ATCCAAATCG 
CATCCCATTC 
CGCTGAAAAT 
GGTGCGGTGC 
CGCGGTCGGC 
AATACGGCGT 
TGGGTATCGT 
CGCGGGCAAC 
CTAACCGCGT 
TTCCACGAAA 



AGCGCGGCAT 
TCCAGCGTGG 
CTACGCCGGC 
GCGACAGCTT 
TTCGCGCCCG 
CAAGCAACTG 
AGGTGTTCAA 
GTGTTGCAGT 
GGAAGCCGTG 
GCGACGACAA 
CTCGCCATCG 
GAATCTGGGA 
CGCGCGAGCA 



GAAAATGAAA 
TAACCTTCAT 
GACATTATCG 
CTCCGAAGGC 
AACTGTTCCG 
CAAAAAGGTT 
ACCAATGAGC 
TTGAAGTCGT 
TTCGACAACG 
GAAAAAACTG 
ACGCGGGCGG 
CTCACGCAAG 
TTCGGTCAAA 



CACCTGCGTA 
GTCCCACGAC 
GTATCCCAAA 
GAACAACTGA 
CAGCGTTCGC 
TGCAACAGCT 
GGCGCGGATT 
TACCTCGCGC 
CATC CAT CTG 
GCGGAATTTG 
CAACCTCGCC 
AACGCTGGCC 
CTTTAA 



This corresponds to the amino acid sequence <SEQ ID 516; ORF 134.a>: 



al34 -pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRNTPIVTFM 

151 LELLDEVENI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY 

201 HEFDI IKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN 

251 PVFFGSAINN FGIQEILNSL IEWAPAPKPR DATVRMVEPD 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLTFTGIPF 

401 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG 

4 51 LANEYGVEAV FDNASIWSAR WVSCDDKKKL AEFEKANAGN 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 



SAGTVKGKKT 
HQDFSEDTYR 
NKYDREVRDS 
LFEAGGERLP 
LDEFLAGELT 
EPKFSGFIFK 
SSWTFMSHD 
FAPELFRSVR 
VLQFEVVTSR 



ml 34/al34 98.9% identity in 53 1 aa overlap 

10 20 



30 



40 



LAIDAGGNLA 



50 



60 



ml34 .pep MSQEILDQVRRRR7FAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 
I I I I I I I I I I I I I i I I I I I I I I I I I I I I I i I I I I I I I I I I i I I II II I I I I I I I I I I I I : 
al34 MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml34.pep IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

I II I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M II I I I II I I 

al34 IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 34. pep QT I KL LN VCR LR DT P I VT FMNK Y DREVR DS LE LLDEVEN ILK IRC A PVTW PIGMGKNFKG 

I I I I I I I I II I I: i I I I I I II I I I I I I II I I I I I I I I I I I I : I I I I I II I I II I I I I I I I 
al34 QTIKLLNVCRLRNTPIVTFMNKYDREVRDSLELLDEVENILQIRCAPVTWPIGMGKNFKG 

130 140 150 160 170 180 

ISO 200 210 220 230 240 

ml34.pep VYH I LN DE I YLFEAGGERLPHE FDI IKG I DN PELEQRFPLE IQQLRDEIELVQAASNE FN 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
al34 VYH I LN DEI YLFEAGGERLPHE FDI IKG I DN PELEQRFPLE IQQLRDEIELVQAASNE FN 

ISO 200 210 220 .230 240 

250 260 270 280 290 300 

ml 34 .pep LDEFLAGELTPVFFG3AINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 
I I I I II II I I I I I I I II I I I I I I II I I I I I I : I II I II I I I I I I I I I I I I I I I I I I I I I I 
al34 LDEFLAGELTPVFFGSAINNFGIQEILNSLIEWAPAPKPRDATVRMVEPDEPKFSGFIFK 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 34 .pep IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I 
al34 IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSVVTFMSHDRELVEEAYAG 

310 320 330 340 350 360 
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370 380 390 400 410 420 

ml34 . pep DI IGI PNHGNIQIGDS FSEGEQLAFTGI PFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

I ! I I I I I I : I I I M I I I I I I I I I : I 1 I i I ! M I I I I I I I I I I I I I I I I I I I I I I I I I M I 
al34 DIIGI PNHGN IOIG DS FSEGEQLT FTG I PFFAPELFRS VRIKNPLKI KQLQKGLQQLGEE 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 34 . pep GAVQVFKPMSGADLILGAVGVLQFEWTSRLANEYGVEAVFDSASIWSARWVSCDDKKKL 
1 I II I I I I i I I I 1 I I I I I I I I I I I I I I I I I ! I I I I I I M I I I : I I I I I I I I I I I I I I I I I 
al34 GAVQVFKPMSGADLILGAVGVLQFEWTSRLANEY3VEAVFDNASIWSARWVSCDDKKKL 

430 440 450 460 470 480 

490 500 510 520 530 

ml34 . pep AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I 
al34 AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
490 500 510 520 530 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 517>: 
gl35.seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCG 

51 TTCGGAcgGC AGCCTCTCGC GCGGCAAAAT CCAAACCATC ACCCGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCTG CAGGGTTCGG CGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

, 251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCACAAATC 

301 CTGCTCAGCC GTGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

351 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCGATTCCC ATCATCAATG 

401 AAAACGACAC GGTTTCGGTT GAGGAGTTGA AAATCGGCGA CAACGACACA 

451 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 

501 GACCGACATA GACGGTCTTT ACACCGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

601 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 

651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 

701 CCTCACTCAA ACC CGATTCA TTGGCCGAAG CCGCCGAACA TCAGGCGGAC 

751 GGCTCGTTTT TCGTcccCcg tgCCAAAGGT TTGCGGACAC AGAAGCAATG 

801 GctggCGTTC TATTCcgaaa gcggGGgcag cgttTAtgtg gacgaaagtg 

851 cggaacacgc tTtgtccgaa caagggaaag cctgCTGA 
This corresponds to the amino acid sequence <SEQ ID 518; ORF 135.ng>: 

gl35 pep 

1 MKYKRIVFKV GTSSITRSDG SLSRGKIQTI TRQIAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGOG LLMEEYTANL SSDGIVSAQI 

101 LLS RADFADK RRYQNAGGAL SVLLQRRAIP IINENDTVSV E EXtKI GDNDT 

151 LSAQVAAMIQ ADLLVLLT DI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMIiTKIKAA TIAAESGVPV YICSSLKPDS LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESGGSVYV DESAEHALSE QGKAC* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 5 19>: 

ml35 . seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCTGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCGCAAATC 

301 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

351 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC ATCATCAATG 

401 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 

451 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 



WO 99/57280 



PCT/US99/09346 



383 



AATCCCGATG 
CATCGAAATG 
TGACCAAAAT 
TATATCTGTT 
TCAGGCGGAC 
AGAAGCAATG 
GACGAAGGTG 
GTCGGGCATT 
CCGTGTACAG 
TTCGGCTCTG 
CGTGTTCATC 
TGCTTCTGAC 



This corresponds to the amino acid sequence <SEQ ID 520; ORF 135>: 

ml35.pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TCQLAALHHA GHE LVLVSSG 

51 AVAAG FGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI 

201 AGGSGSANGT GGMLTKI KAA TIAAESGVPV YICSSLKPDA 

2 51 GSFFVPRAKG LRTQKQWLAF YSESRGSVYV DEGAEHALSE 

301 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL 

351 HRDDWISITP EIRLLLTEF* 



501 


GACCGACATA 


GACGGTCTTT 


551 


CCGTACGGCT 


GGACAAAATC 


601 


GCGGGCGGCT 


CGGGTTCGGC 


651 


CAAAGCGGCA 


ACCATCGCCG 


701 


CCTCGCTCAA 


ACCCGATGCA 


751 


GGCTCGTTTT 


TCGTCCCCCG 


801 


GCTGGCGTTC 


TATTCCGAAA 


651 


CGGAACACGC 


TTTGTCCGAA 


901 


GCCGGAATCG 


AAGGGCATTT 


951 


CAAGGCAACC 


AAACAGCCCC 


1001 


CCGCCGCCGA 


AGACCTGCTC 


1051 


CATCGGGACG 


ACTGGATTTC 


1101 


CGAATTTTAG 





SSDGIVSAQI 
EELKIGDNDT 
EHINHEIIEM 
LAEAAEHQAD 
QGKSLLMSGI 
KSRKAKGVFI 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 135 shows 97.6% identity over a 294 aa overlap with a predicted ORF (ORF 135.ng) 
from N. gonorrhoeae: 
ml35/gl35 

10 20 30 40 50 60 

ml 3 5 . pep M KY KRI V FKVGTS S I TH SDGS LS RG K IQT I TCQLAALHHAGHE LVLVS SGAVAAGFGALG 

I M 1 1 1 Ml II III I hi II Mil Mill II lillllllllllMIIIIIIIIIIIIII 

gl35 MKYKRI VFTCVGTSS I TRSDGSLSRGKIQTITRQLAALHHAGHELVLVS SGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml35 . pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYONAGGAL 

I III I MINI I II I II IMI 111 1111)11 II I III Ml III I I II III I III III I I I 
gl35 FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 3 5 . pep SVLLQRRAVP 1 1 NENDTVSVEELKI GDNDTLS AQVAAMI Q ADLLVLLTD I DGLYTGNPNS 

I I I I I I I I « I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl 3 5 S VLLQRRAI P 1 1 NENDTVSVEELKI GDNDTLS AQVAAMI QADLLVLLTD I DGLYTGNPNS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml35.pep NPDAVRLDKI EHINHE 1 1 EMAGGSGSANGTGGMLTKIKAATIAAESGVPV YICSSLKPDA 

IIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMh 

gl35 NPDAVRLDKI EHINHEI IEMAGGSGSANGTGGMLTKI KAATI AAESGVPVYI CSSLKPDS 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 3 5 . pep LAEAAEHQADGSFFVPRAKGLRTQKQWIAFYSESRGSVYTO 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII II I II MMI III IM Ih 
gl 3 5 LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESGGSVYVDESAEHALSEQGKACX 
250 260 270 280 290 



310 320 330 340 350 360 
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ral35.pep AGIEGHFSRMDTVWYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 521 >: 

al35. seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCCGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCA3GGG CTGCTGATGG 

251. AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCACAAATC 

301 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

351 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC ATCATCAATG 

4 01 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 

451 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 

501 GACCGACATA GACGGTCTTT ACACCGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GG AC AAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

601 GCGGGCGGCT CGGGTTCGGC AAACGGCACA GGCGGTATGC TGACTAAAAT 

651 CAAAGCGGCG ACGATTGCGA CCGAGTCCGG CGTACCGGTC TATATCTGTT 

701 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CGGCAGATAA TCAGGCGGAC 

751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAGCAATG 

801 GCTGGCGTTC TATTCCGAAA GCAGGGGCGG CGTTTATGTG GACGAAGGTG 

851 CGGAACACGC TTTGTCCGAA CAGGGAAAAA GCCTGCTGAT GTCGGGCATT 

901 GCCGGAATCG AAGGGCATTT TTCCCGTATG GACACCGTAA CCGTGTACAG 

951 CAAGGCAACC AAACAGCCTT TGGGCAAAGG GCGAGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATTGCGTA AGGCGAAAGG CGTGTTCATC 

1051 CATCGGGACG ACTGGATTTC CATCACGCCC GAAATACGCC TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 522; ORF 135.a>: 

al35.pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TRQLAALHHA GHELVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIATESGVPV YICSSLKPDA LAEAADNQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGGVYV DEGAEHALSE QGKSLLMSGI 

301 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL KLRKAKGVFI 

351 HRDDWISITP EIRLLLTEF* 

ml35/al35 98.4% identity in 369 aa overlap 

10 20 30 40 50 60 

ml35 . pep MKYKRI VFKVGTS S ITH SDGS LS RGKI QT I TCQLAALHHAGHE LVLV S S GAVAAGFGALG 

1 I I I I ) I I I I i I I I I I I i I I I I I ! 1 I I I I I I I I I I I I I i I I I I I I I I 1 I I I I II I I I I I 
al35 MKYKRIVFK\ r GTSSITHSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml35 . pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 
I II I I I I I I I I I I I I I I I It ! I II I 1 I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I 
a 1 3 5 FKKR PVK I ADKQAS AAVGQG LLMEE YTANLS S DG I VSAQI LLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ir.135.pep SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDI DGLYTGNPNS 

I I I I ) 1 1 1 1 I 1 1 I 1 1 I I I i ! 1 1 I I I I I I I I I I 1 1 I 1 1 I I I I I I I I I I II I I I I I I 1 1 I I I 
al35 SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDI DGLYTGNPNS 

130 140 150 160 170 180 

190 200 210 220 230 240 

irl35.pep NPDAVRLDKIEHINHEIIEMAGGSGSANGTGGMLTKIKAATIAAESGVPVYICSSLKPDA 
I I I I I I I If II I I II I I I I I I I I I I I I I I I I I I I I II I I II I I : I I I I I I I I I I I M | | | 
al35 NPDAVRLDKIEHINHEI IEMAGGSGSANGTGGMLTKIKAATIATESGVPVYICSSLKPDA 

190 200 210 220 230 240 
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250 260 270 28C 290 300 

ml35.pep LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEPiALSEQGKSLLMSGI 
I I I I I : : I I I I I I I I I I I I I I I II II I I I I I I I I I I : I II I II I I I I I I I I I I I I I I I I I 
al35 LAEAADNQADGSFFVPRAKGLRTQKQWLAFYSESRGGVYVDEGAEHALSEQGKSLLMSGI 

250 260 270 28C 290 300 

310 320 330 340 350 360 

ml35.pep AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 
I I I I I ! I I ! I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I ! I I I I I I I I I I I I I II 
al35 AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKLRKAKGVFIHRDDWISITP 

310 320 330 340 350 360 

370 

ml 3 5 . pep E IRLLLTEFX 

I I I I I I I I I I 
al35 E IRLLLTEFX 

370 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 523>: 

gl36 . seq 

1 ATGGAAATCC GGTTTCAGAC AGCATTTTTA CGTTTGGTTC AGatgaAAAC 

51 AAACGCTtca aTTCTtaccg caACACGCCT TGTATTTCCT GccgCTGCCG 

101 CACGGACAGG GATCGTTCCT GCCGgtTTTT TCCCCTTCCC TGCGGACGGT 

151 TTGCGGTTTG TTGATGACCG CCTGCCAGTA GCGGTAGATG TCtgccagcg 

201 cgTAAGGCag tTCGGAcgca agttccgcca gctcgccttc ggTGAATTGC 

251 AGgcggataa cgccgtttTC CTCTTCGTCg taaatgccgc ccaccgccat 

301 cacgGGGTAA AACAGCTCTT CAAACGCTTC ATCATCGGCG GCTTCAAACC 

351 AATCGGTCGG CACAATGTCC AAACCGTAAA GATAGGCGTT GCACCAAGTG 

4 01 TAAAAATCGC TGCCGCCCTC GCCGTCGTCG TAGAGCCACA AATCGGGCAG 

451 CTTTTTATCC GACATCGCGG CGGTTGTTTC CATCGCCATT GCCAAAACCA 

501 GCCGTTCGAT TTCGGAACGT TCGGCGGCGG TAAATTGCGA TTCGTCGCCC 

551 AACACTTCGG GCAGCCAGTC GAGCGGTGCC AATTTGTCCG GCCCGCTCAA 

601 CAGCGCCGTC ATAAAACCTT GAACCTCGTC GCAACGCATC GTGTTGCCTT 

651 GTTCGCTTTT GGCATCCAAT AA 

This corresponds to the amino acid sequence <SEQ ID 524; ORF 136.ng>: 

gl36 .pep 

1 MEIRFQTAFL RLVQMXTNAS ILTATRLVFP AAAARTGIVP AGFFPFPADG 

51 LRFVDDRLPV AVDVCQRVRQ FGRKFRQLAF GELQADNAVF LFWNAAHCH 

101 HGVKQLFKRF IIGGFXPIGR HNVQTV KIGV APSVKIAAAL AWV EPQIGQ 

151 LFIRHRGGCF HRHCONQPFD FGTFGGGKLR FVAQHFGQPV ERCQFVRPAQ 

201 QRRHKTLNLV ATHRVALFAF GIQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 525>; 
ml 36 .seq 

1 ATGGAAACAA ACG CTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGTTTGTT GATGACTGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGCG CCGTTTTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAGCATTGC 

351 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

401 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 

451 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

501 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 

551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 

601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 

651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CAAATGGGTT 

701 TTGCGCCCTA TTATCGCCGC AATGCCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 526; ORF 136>: 
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ml36.pep 

1 METNASILTA TRLVF3AAAA RTGIVPACFF AFPADGLRFV DDCLPVAVDI 

51 RQCIROLGFQ FRQLAFCELQ TDSAVFLFW NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TV KISIAPCV KIAAAVFV FI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAOPPFGCFG KFSGIHHFPF QMGFAPYYRR NAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 1 36 shows 85.6% identity over a 209 aa overlap with a predicted ORF (ORF 136.ng) 
from N. gonorrhoeae: 

ml36/gl36 



10 20 30 40 

ml36 . pep METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRE^VDDCLPV 

hlllllllllllll Mllillllll II lllllllllll III 

gl36 MEIRFQTAFLRLVQMKTNASILTATRLVFPAAAARTGIVPAGFFPFPADGLRFVDDRLPV 
10 20 30 40 50 60 



50 60 70 80 90 100 

ml36 .pep AVDIRQCIROLGFQFRQIiAFCELQTDSAVFLFWNTAQCHDGIKQLFKRFIIDGFKPIGR 

III: I MM -111111 I lh hi II III II 'hi I hi III II II I 1 1 1 1 i 1 1 

g 1 3 6 AVDVCQRVRQFGRKFRQLAFGELQADNAVFLFWNAAHCHHGVKQLFKRFI I GGFKPIGR 

70 80 90 100 110 120 



110 120 130 140 150 160 

m!36 . pep HNIQTVKISIAPCVKIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLR* 

||:|||||::|| lllllh I = = I I I I I = I I II I I I I I I I I II t I I I I I I I I I I I I I I 
gl36 HNVQTVKI GVAPS VKI AAALAVWE PQI GQLFI RHRGGCFHRHCQNQP FDFGTFGGGKLR 

130 140 150 160 170 180 



170 180 190 200 210 220 

ml36 . pep FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIH 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
gl36 FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQX 
190 200 210 220 



230 240 
ml36 , pep HFPFQMGFAPYYRRNAVX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 527>: 

al36.seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGCTTGTT GATGACCGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGTG CCGTTGTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAGCATTGC 

351 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

401 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 

4 51 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

501 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 

551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 

601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 

651 ATGCTTCGGT AAATT7TCGG GAATCCATCA TTTTCCTTTT CCAATGGGTT 

701 TTGCGCCCTA TTATAGTGGA TTAAATTTAA ATCAGGACAA GGCGACGAAG 

7 51 CCGCAGACAG TACAAATAGT ACGGCAAGGC GAGGCAACGC CGTACTGGTT 

801 TAAATTTAAT CCACTATATC GCCGCAATGC CGTCTGA 



This corresponds to the amino acid sequence <SEQ ID 528; ORF 136.a>: 

a!36.pep 
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1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRLV DDRLPVAVDI 

51 RQCIRQLGFO FRO LAFCELQ TDSAWLFW NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TVK ISIAPCV KIAAAVFVFI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRII KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF PMGFAPYYSG LNLNQDKATK 

251 PQTVQIVRQG EATPYWFKFN PLYRRNAV* 



ml36/al36 98.3% identity in 238 aa overlap 

10 20 30 40 50 60 

ml 3 6 . pep METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPVAVDIRQCIRQLGFQ 
I I I I I I I M I I I I I I I I II I II I I I I I I I I I M I I I I I : I I I I I I I I I I I I I I I I I I I I 
al36 METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRLVDDRLPVAVDIRQCIRQLGFQ 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml36.pep FRQLAFCELQTD5AVFLFWNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 
I I I 1 I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
al36 FRQLAFCELQTDSAWLFWNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml36.pep KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 
I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I If I I I 
a 13 6 KTAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNOPFDFGTFGGGKLRFVAQHFGQPVERCQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml36 . pep FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFQMGFAPYYRR 

I I I I I I 1 I I II I I I I I I I I I I I I II I I I I I Mi I I I I II I I I I I I I I I I I ! I I I I I I 
al36 FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFPMGFAPYYSG 

190 200 210 220 230 240 



ml36.pep NAVX 

al36 LNLN Q DKATK PQT VQ I VRQGE AT P YW FK FN P L Y RRN AVX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 529>: 

gl37 . seq 

1 ATGATTATCC ATCACcaaTT CGATCCCGTC CTCATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCT TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TGATTTTGGG 

2 01 CGGACGCTTG GGCTATGTCC TGTTTTACAA ATTCTCCGAC TACCTCGCCC 

251 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

301 GGCTTTTTGG GTGTAGTTAT TGCCATATGG TTGTTCAGCC GCAAGCACGG 

351 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCGCCGCTC GTTCCGCTGG 

4 01 GTCTCGCTTC GGGACGTATC GG CAACTTT A TCAACGGCGA ACTT TGGGGA 

• 451 CGCATTACCG ACATTAACGC ATTTTGGGCA ATGGGCTTCC CGCAAGCGCA 

501 TTACGAAGAT GCCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 

551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 

601 GCCCTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TTTCCAAAAA 

651 ACCGCGCCCG ACCGGGCAGA CTGCCGCGCT TTTTCTCGGC GGCTACGGCG 

701 TGTTCCGCTT TATTGCCGAA TTTGCGCGCC AACCCGACGA CTATCTCGGG 

751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 

801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGG CATG AAA AAACAGCACT 

851 GA 

This corresponds to the amino acid sequence <SEQ ID 530; ORF 137.ng>: 

gl37 .pep 
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1 MIIHHQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MSFHG 

101 GFLGWIAIW LF SRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 R1TDINAFWA MGFPQAHYED AEAAAHNPLW AEWLQQYGML PRHPSQ LYQF 

201 ALEGICLFAV VWL FSKKPRP TGQTAALFLG GYGVFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 53 1>: 

ml37 . seq 

1 ATGATTACCC ATCCCCAATT CGATCCCGTC CTTATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCC TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TAATTTTGGG 

201 CGGGCGTTTG GGTTACGTCC TGTTTTACAA GTTTTCCGAC TACCTCGCCC 

251 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

301 GGCTTTTTGG GTGTAGTTAT TGCCATACGG TTGTTCGGCC GCAAACACGG 

351 CATCGGCTTC CTCAAACTGA TGGATACGGT CGCACCGCTC GTTCCGCTGG 

4 01 GTCTCGCTTC GGGACGTATC GGCAACTTCA TCAACGGCGA ACTTTGGGGA 

4 51 CGCGTTACCG ACATCAACGC ATTTTGGGCA ATGGGCTTCC CGCAGGCGCG 

501 TTACGAAGAT GCCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 

551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 

601 GCACTTGAAG GCATCTGCCT GTTCACCGTC ATTTGGCTGT TCTCTAAAAA 

651 ACAGCGGTCG ACCGGACAAG TCGCCTCGCT CTTCCTCGGC GGCTACGGCA 

701 TATTCCGCTT CATTGCCGAA TTCGCACGCC AACCCGACGA CTATCTCGGG 

751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 

801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 

851 GA 

This corresponds to the amino acid sequence <SEQ ID 532; ORF 137>: 
ml37.pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MSFHG 

101 GFLGWIAIR LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTD INAFWA MGFPQARYED AEAAAHNPLW AEWLQQYGML PRHPS QLYQF 

201 ALEGICLFTV IWL FSKKQRS TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFGM K KQH* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 137 shows 95.4% identity over a 283 aa overlap with a predicted ORF (ORF 137.ng) 
from N. gonorrhoeae: 
ml37/gl37 

10 20 30 40 50 60 

ml37 . pep MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

II I II I I I t I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g 1 3 7 MI IHHQFDPVL I S IGPLAVRWYALS YI LGFI LFTFLGRRRI AQGLS VFTKESLDDFLTWG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml37.pep I LGVI LGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIRLFGRKHGIGF 

MM lit III III MM Mil III II I II I II Ml I Nil II Mil III IMIIIMII 

g 13 7 I LGVI LGGRLGYVLF YKFSDYLAHPLD I FKVWEGGMS FHGG FLG W I A I WL FSRKHG IGF 

70 80 90 " 100 110 120 

130 140 150 160 170 180 

ml3 7 . pep LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 

I MIIIMM III Ml M lllllllllll IIMIMIIIMI IIIMII III II Mill I 

gl37 LKLMDTVAPLVPLGLASGRIGNFINGELWGRITDINAFWAMGFPQAHYEDAEAAAHNPLW 
130 140 150 160 170 180 



ml37 . pep 



190 200 210 220 230 240 

AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 
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gl37 



I I I I I I I I I I I I I II I I I I I I I I I I I I I : I * I I I I I I I ll|:hlllMlhllll|| 
AEWLQQYGMLPRHPSQLYQFALEGICLFAVVWLFSKKPRPTGQTAALFLGGYGVFRFIAE 
190 200 210 220 230 240 



250 260 270 280 

ml 3 7 . pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
gl3 7 FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 
250 260 270 280 



The following partial DNA sequence was 

al37.seq 

1 ATGATTACCC ATCCCCAATT 

51 TGCCGTCCGC TGGTATGCCC 

101 CCTTTCTCGG CAGAAGGCGC 

151 GAATCGCTCG ACGACTT CCT 

201 CGGGCGTTTG GGTTACGTCC 

251 ATCCGCTTGA TATTTTCAAG 

301 GGCTTTTTGG GTGTAGTTAT 

351 CATCGGCTTC CTCAAACTGA 

401 GTCTCGCTTC GGGACGTATC 

4 51 CGCGTTACCG ACATCAACGC 

501 TTACGAAGAC CTCGAAGCCG 

551 TGCAACAATA CGGTATGCTG 

601 GCACTTGAAG GCATCTGCCT 

651 ACAGCGGCCG ACCGGACAAG 

701 TATTCCGCTT CATTGCCGAA 

751 CTGCTGACCT TGGGGCTGTC 

801 TGTTTTGGGT ATCGTCGGCT 

851 GA 



identified in N. meningitidis <SEQ ID 533>: 



CGACCCCGTC 
TAAGCTACAT 
ATCGCGCAAG 
GACATGGGGC 
TGTTTTACAA 
GTATGGGAAG 
TGCCATATGG 
TGGACACGGT 
GGC AACTTCA 
ATTTTGGGCA 
CCGCGCACAA 
CCGCGTCATC 
GTTCGCCGTC 
TCGCCTCACT 
TTTGCACGCC 
GATGGGGCAA 
TTGTCCGGTT 



CTTATCAGTA 
CCTCGGATTT 
GCTTGTCCGT 
ATTTTGGGCG 
GTTTTCCGAC 
GCGGAATGTC 
TTGTTCGGTC 
CGCACCGCTC 
TCAACGGCGA 
ATGGGCTTCC 
TCCGCTTTGG 
CCTCGCAGCT 
GTTTGGCTGT 
CTTCCTCGGC 
AACCCGACGA 
TGGTTGAGCG 
CGGCATGAAA 



TCGGCCCGCT 
ATTCTTTTTA 
TTTTACCAAA 
TAATTTTGGG 
TACCTCGCCC 
GTTCCACGGC 
GCAAACACGG 
GTTCCACTGG 
ACTTTGGGGA 
CGCAGGCGCG 
GCAGAATGGC 
TTATCAGTTT 
TCTCTAAAAA 
GGCTACGGCA 
CTATCTCGGG 
TCCCGATGAT 
AAACAGCACT 



This corresponds to the amino acid sequence <SEQ ID 534; ORF 137.a>: 

al37.pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEG GMSFHG 

101 GFLGWIAIW LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED LEAAAHNPLW AEWLQQYGML PRHPS QLYQF 

201 alegiclfav VWL FSKKQRP tgqvaslflg GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 



ml 37/al 37 98.2% identity in 283 aa overlap 

10 20 



30 



40 



50 



60 



ml37 .pep MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II i I I I I I I I I I I I I I I I I I I I I I I I 
al37 MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml37.pep ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIRLFGRKHGIGF 
I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
al37 ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIWLFGRKHGIGF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 37 . pep LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 
I I M I I I I I I I 1 I I II I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I II I I I I I 
a!37 IJCI^DTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDLEAAAHNPLW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 37 . pep AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 
I I I I I 1 I I I I I I I I i I I I I I I I I I I I 1 I : I : I I I I I I I I I I I I I I I I I I I I I I ! II I I I 
al37 AEWLQQYGML PRHPSQLYQFALEGICLFAWWLFSKKQRPTGQVASLFLGGYGIFRFIAE 
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190 200 210 220 230 240 

250 260 270 280 

ml37 . pep FAROPDDYLGLLTLGLSMGQWLSVPMIVLGrVGFVRFGMKKQHX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I It I I 
al37 FARQ P DD Y LG L LT LG LS MGQW LS V PM I VLG I VG FVR FGMKKQHX 

250 260 270 280 



The following partial DNA sequence was identified inN. gonoi-rhoeae <SEQ ID 535>: 
g!38.seq 

1 ATGGAGTTTG AAAACATTAT TTCCGCCGCc gaCAAGGCGC GTATCCTTGC 

51 CGAAGCACTG CCTTACAtCC gccgGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AGTATGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GCGAATTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGAC GATGGATATT 

301 GTCGAAATGG TATTGGGCGG GCACGTCAAC AAGGAAATCG TGT CGATGAT 

351 TAACACATAT GGAGGGCACG CGGTCGGCGT GAGCGGGCGC GACGACCATT 

401 TCATTAAGGC GAAGAAACTT TTGGTCGATA CGCCCGAACA GAATAGCGTG 

451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGCGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT GGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAAtatcgc 

651 cgGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC acgCCGAAAC 

701 GGATTGATGG GCTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCcgccgtc aACGGTGTGA AAG CCACGCA 

801 CATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGGTCGATG ATTTTAGGCA GAGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 536; ORF 138.ng>: 

gl38 .pep 

1 MEFENIISAA DKARILAEAL PYIRRFSGSV AVI KYGGNAM TE PAL KEG FA 

51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKETMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LVDTPEQNSV 

151 DIGQVGTVES IDTGLVKGLI BRGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDGLIA DGTLYGGMLP 

251 KIASAVEAAV NGVKATHIID GRLP NALLLE IFTDAGIGSM I L GRGEDA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 537>: 

ml38 .seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

401 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGTGTGA AAGCCACGCA 

801 TATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 538; ORF 138>: 
ml38 .pep 

1 MESENI ISAA DKARILAEAL PYIRRFSGSV AVI KYGGNAM TE PAL KEG FA 
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51 R DWLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRI DELIA DGTLYGGMLP 

251 KIASAVEAAV NGVKATHIID GRLP NALLLE IFTDAGIGSM IL GGGEDA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 138 shows 98.0% identity over a 298 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae: 

ml38/gI38 

10 20 30 40 50 60 

MESENI I S AADKAR I LAEALPYIRRFSGSVAVIKYGGNAMTE PALKEGFARDWLLKLVG 

II IMIMMIIIMMMMIIMIIMMIMMIIMMIIMIIMIMMMII 

ME FEN I IS AADKAR I LAEALPY I RRFSGSVAV I KYGGNAMTE PALKEGFARDWLLKLVG 
10 20 30 40 50 60 



ral38 .pep 
gl38 



70 80 90 100 110 120 

ml38 .pep IHPVIVHGGGPQINAMLEKVGK^GEFVQQMRVTDKEAMDIVEMVI^HVNKSIVSMINTY 

II II III! IN II IMNIIMIIill III II! Mhlillllill Mil M I II IN II 

gl38 IHPVIVl^GGGPQINAMLEKVGKKGEFVQGMRVTDKETl^IVEMVLGGHW KEIVSMINTY 

70 80 90 100 110 120 



130 140 150 160 170 180 

ral3 8 . pep GGHAVGVSGRDDH FI KAKKLL I DTP EQNGVD I GQVGTVES I DTGLVKGLI ERGC I P WAP 

INI III lillll llll lllhlllll hllll II III III Mill llllll II llllll 

gl3 8 GGHAVGVSGRDDH FI KAKKLLVDTPEQNSVDI GQVGTVES IDTGLVKGLI ERGCIPWAP 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml3 8 . pep VGVGE KGE AFN I NADLVAG KLAE ELNAE KLLMMTN I AGVMD KTGNLLTKLT P KRI DE L I A 

MM II M Mill IMIM I Mill III III IMMM MMM IIMIMM III 

gl38 VGVGE KGEAFNI NADLVAGKLAEELNAE KLLMMTN I AGVMDKTGNLLTKLT P KR I DGL I A 

190 200 210 220 230 240 



250 260 270 280 290 299 

ml3 8 . pep DGTLYGGMLPKI ASAVEAAVNGVKATHI I DGRLPN ALLL E I FTDAGI GSM I LGGGEDAX 

1 1 1 M 1 1 1 1 II M 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 M II I II 1 1 1 1 Mill 

g 13 8 DGTLYGGMLPKI ASAVEAAVNGVKATHI I DGRLPNALLLE I FTDAGI GSMI LGRGEDAX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 539>: 

al38.seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATT CATC CCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

401 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGCGTGA AAGCCACGCA 

801 TATCATCGAC GGCAGGGTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 
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This corresponds ro the amino acid sequence <SEQ ID 540; ORF 138. a>: 

a!38 .pep 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 

51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGN LLTKL TPKRIDELIA DG TLYGGMLP 

251 K I AS AVE AAV NGVKATHIID GRVPNALLLE IFTDAGIGSM ILGGGEDA* 

m 1 38/al 3 8 99.7% identity in 298 aa overlap 

10 20 30 40 50 60 

ml 3 8 . pep MESENI I SAADKARILAEALPYIRRFSGS VAVIKYGGNAMTEPALKEGFARDVVLLKLVG 

I I I II I I II I I 1 1 II I I I I M I I I I 1 1 I I I I I I I I I I I I I I 1 1 I | | | | | | | | | | | | | | | | 
al38 MESENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 3 8. pep IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDIVEMVLGGHVNKE IVSMINTY 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I || | 
a!38 IHPVIVHGGG PQINAMLEKVGKKGEEVQGMRVTDKEAMUI VEMVLGGHVN KEIVSMINTY 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 3 8. pep GGHAVGVSGR DDHFIKAKKLLIDTPEQNGVDIGQVGTVES I DTGLVKGLIERGC I PWAP 

I I 1 1 1 1 1 1 1 1 i M I IN 1 1 I I I 1 1 I 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 I 1 1 I 1 1 1 1 1 | 1 1 1 1 

a 1 3 8 GGHAVGVSGRDDHFIKAKKLLI DTPEQNGV DIGQVGTVES I DTGLVKGLIERGC I PWAP 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml38 .pep VG VGEKGEAFN I N ADL VAGKLAEELNAE KLLMMTN I AG VM DKTGNLLTKLT PKR I DEL IA 

I I I I I 1 1 I I I I I I I 1 1 I 1 1 M M II I I 1 1 1 I I II I I I I I I I I I I M 1 1 II I I I 1 1 I I II I 
al38 VG VGE KGE AFN I N ADL VAG KLAEE LN AE KL LMMTN I AG VMDKTGNLLT KLTPKRIDELIA 

190 200 210 220 230 240 



250 260 270 280 290 299 

ml 3 8. pep DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFTDAGIGSMILGGGEDAX 
I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I II I : I I I I I I I I I I I I M I I I I I I I I I I I I 
al38 DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRVPNALLLEIFTDAGIGSMILGGGEDAX 

250 260 270 280 290 



The following partial DNA sequence was identified in Kgonorrhoeae <SEQ ID 541>: 

gl3 9.seq 

1 ATGCGAACCA CCTCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAggc ggcggcggag 

101 gcGGCACTTC TGCTCCCGAC TTTAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACGATAGC GGAATCAGCA GCAGTATCTT ACGCCGGTAT 

201 AAAAAACGAA ATGTGCAAAG ACAGAAG CAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAAAGCCCC CCGAATCTGC 

301 ATACCGGAGA CTTTTCAAAC CCAAATGACC AATATTAAGA ATATGATCAA 

3 51 CCTCAAACCT GCAATTGAAG CAGGCTATAC AGGACGCGGG GTAGAGGTAG 

401 GTATCGTCGA TACAGGCGAA TCCGTCGGCA GCATATCCTT TCCCGAACTG 

451 TATGGCAGAA AAGAACACGG CTATAACGAA AATTACAAAA ACAAATTACA 

501 AAAACTATAC GGCGTATATG CGGAAGGAAG CGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 542; ORF 138.ng>: 

gl39 .pep 

1 MRTTSTFPTK TFKPAAMAI A VATTLSACLG GGGGGTSAPD FNAGGTG I GS 

51 NSRATIAESA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKIKAPRIC 

101 IPETFQTOMT NIKNMINLKP AI EAG YTGRG VEVGIVDTGE SVGSISFPEL 

151 YGRKEHGYNE NYKNKLQKLY GVYAEGSA* 
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The following partial DNA sequence was identified in N.meningiiidis <SEQ ID 543>: 
ml3 9. seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGACTGC CAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGTACCGG TATCGGCAGC 

151 AACAGCAGAG CAACAACAGC GAAATCAGCA GCAGTATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTCCA AACCCAAATG ACGCATtACA AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

401 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

451 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTACG AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 544; ORF 138>: 

ml3 9.pep 

1 MRTTPTFPTK TFKPTAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 

101 CIPETFQTQM THYKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 

151 LYGRKEHGYN ENYEKLYGVY AEGSA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 92.2% identity over a 179 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae: 
ml39/gl39 

10 20 30 40 50 60 

MRTT PT FPT KTFKP TAMALAVATTLS ACLGGGGGGT S A P D FNAGGT G I GSNSRATTAKS A 

1 1 1 1 I II MM Ihll M IMM II 1 1 Ml III Ml III! M Mill II I INI Ml 

MRTTST F PTKT FKP AAMAIiAVATTLS ACLGGGGGGTS APD FNAGGTG I GSNSRAT I AE S A 
10 20 30 40 50 60 

7C 80 90 100 110 120 

AVSYAGI KNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRI CI PETFQTQMTHYKNLINLK 

I I I I I I 1 1 1 1 1 1 1 1 I 1 1 1 1 I I 1 1 I 1 1 1 I 1 1 1 I I h I I Illlllllllllh Ihllll 

AVSYAG I KNEMCKDRSMLCAGRDDVAVTDRDAKIKAP - RI CI PETFQTQMTNI KNMI NLK 
70 80 90 100 110 

130 140 150 160 170 
PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENY EKLYGVYAEGSAX 

IIIIIIIIIIIIIIII1IIIIIIIMIIIIIIIMIIIIIIII HIIIIIIIIIII 
PAI EAG YTGRGVEVGI VDTGESVGS I SFPELYGRKEHGYNENYKNKIiQKLYGVYAEGSAX 
120 130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 545>: 

al39.seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACAACAGC GAAATCAGCA GCAATATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTACA AACCCAAATG ACGCAT . ACA AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

401 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

451 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTAC. AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 546; ORF 139.a>: 

al39.pep 

1 MRTTPTFPTK TFKPAANALA VATTLSA CLG GGGGGTSAPD FNAGGTGIGS 



m!39.pep 
gl39 

ml39.pep 
gl39 

ml39.pep 
gl39 
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51 NSRATTAKSA AISYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 
101 CIPETLQTQM THXKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 
151 LYGRKEHGYN ENYXKLYGVY AEGSA* 

ml39/al39 97.1% identity in 175 aa overlap 

10 20 30 40 50 60 

IXL39 . pep MRTT PT F PT KT FK PT AMALAVAT T LS ACLGGGGGG T S APD FN AGG T G I G SN S RAT T AK S A 

I I ! I I 1 1 I 1 1 I I I I : M I 1 1 1 I i I I I I I I I I I 1 1 I I 1 1 I II 1 1 1 1 I I i I I I I I I I I I I 1 1 
al39 MRTTPTFPTXTFKPAAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

10 20 30 40 50 60 

70 80 90 100 110 120 

nl39 . pep AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 
I : I I I I I I I 1 I I I I I ! I I II I I I I I i I I I I I I I I I I I I I I ! I I I I: I II I I I I I I I I I I 
al39 AISYAGIKN2MCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETLQTQMTHXKNLINLK 

70 80 90 100 110 120 

130 140 150 160 170 

ml39 . pep PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYEKLYGVYAEGSAX 
M I I I I I I I I I I I II I I I I I I I I II II I I I I I I II I I I IN I I I I I I I I I I I I I I 
al39 PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYXKLYGVYAEGSAX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 547>: 
gl40 . seq 

1 Atgtcggcac gCGGCAAGGG GGCAGgctat ctcAACAGTA CCGGACGACa 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 AAAATATCAA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCCG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAGCTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGTCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCTACGG CGCACAACTT 

401 TCCGCACAGC GGCAGCCGTA CAGCATGCGA ATACCGCCGA CGGCGTACGC 

451 aTCTTcaaCA GTCTCGCCGC TAccgTCTAt GccgACAGTG CCGCCGCCCA 

501 TGccgATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

5 51 ACAACGGTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT CGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 TATCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

701 TGGGCATAGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGTGG GCGATATCGG 

8 01 CTATCTCAAA GGCCTGTTCT CctaCGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGATGAA TATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAGCTGG GCGCACTGGG TGGTGTCAAC GTTCCGTTTG CCGCAACGGG 

951 AGATTTGACG GTTGAAGGCG GTCTGCGCCA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCagt GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAACTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTCTG CGACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CGCGGTAACG GGCGGCTTTA CCGGCGCGGC TGCAGCAACC 

12 01 GGCAAGACGG GTGCACGCAA TATGCCGCAC ACCCGCCGGG TTGCCGGTCT 

1251 GGGGGTGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

1301 GCTACACCGG TTCCAAACAG TACGGCAACC ACAGCGGACA AATCGGCGTA 

1351 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 548; ORF 140.ng>: 

gl40.pep 

1 MSARGKGAGY LNSTGRHVPF LSAAKIGQDY SFFKNIKTDG GLLASLDSVE 

51 KTAGSEGDTP SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 

101 ELDASESSAT PETVETAVAD RTDMPGIRLR RTTFRTAAAV QHANTADGVR 

151 IFNSLAATVy ADSAAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 

201 TWEQGGVEGK MRGSTQTIGI AAKTGENTTA AATLGIGRST WSENSANAKT 
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251 DSISLFAGIR HDVGDIGYLK GLFSYGRYKN SISRSTGADE YAEGSVNGTL 
301 MQLGALGGVN VP FAATGDLT VEGGLRHDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLSATAGVER DLNGRDYAVT GGFTGAAAAT 
401 GKTGARNMPH TRRVAGLGVD VEFGNGWNGL ARYSYTGSKQ YGNHSGOIGV 
451 GYRF* 

The following partial DNA sequence was identified in ^meningitidis <SEQ ID 549>: 

ml40 .seq 

1 ATGTCGGCAC GCGGCAAGGG GGCAGGCTAT CTCAACAGTA CCGGACGACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGCCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGCAGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

401 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 

451 ATCTTCAACA GTCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

551 ACAACGGCAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

701 TGGGCATGGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 

801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAG CTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 

951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACGC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGTCTGG TTGCCGGCCT 

1251 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

13 01 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGT CGGCGTA 

13 51 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 550; ORF 140>: 

ml4 0 .pep 

1 MSARGKGAGY LNSTGRRVPF LSAAKIGQDY SFFTNIETDG GLLASLDSVE 
51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
101 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV QHANAADGVR 
151 IFNSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 
201 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGRST WSENSANAKT 
251 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 
301 MQLGALGGVN VP FAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 
4 01 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHS GRVGV 
451 GYRF * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 140 shows 94.5% identity over a 454 aa overlap with a predicted ORF (ORF 140.ng) 
from N. gonorrhoeae: 

ml40/gl40 

10 20 30 40 50 60 

ml4 0 . pep MSARGKGAGY LNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVE KTAGSEGDTL 

II III Ml II II III hi I Mill MM II I! I IhlllMIIIIIIIIIIIIIIMI 
gl4 0 MSARGKGAGYLNSTGRHVPFLSAAKIGQDYSFFKNIKTDGGLLASLDSVBKTAGSEGDTP 

10 20 30 40 50 60 - 

70 80 90 100 110 120 
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ml 4 0 . pep S YYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLBNLMVELDASESSATPETVETAAAD 

II Ml 1 1 II I II IIMII : II Ml III II III III III III III II III! III! [II :|| 

gl4 0 S YYVWiGNAARTASAAAHSAPAGLKHAVEQGGSNLBNlJWELDASESSATPETVETAVAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml40 . pep RTDMPGIRPYGATFRAAAAVQHANAADGVRIFNSLAATVYADSTA^HADMQGRRLKAVS^ 

M M II I I : I I I : I I I I II I I : I I I I I E I I I I I I I I I I I I : I I I I I I I i I | I M I I I 
g 1 4 0 RTDMPGI RLRRTTFRTAAAVQHANTADGVRI FNS LAATVY ADS AAAHADMQGRRLKAVS D 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVI AQTQQDGGTWEQGGVEGKMRGSTQ TVGI AAKTGENTTAAATLGMGRST 

II MMI Ml II II MM III IIMII II Mill I Ih II I III IMIIMI II hi III 

gl4 0 GLDHNGTGLR V I ACTQQDGGTW EQGGVEG KMRGS TQTIGI AAKTGSNTTAAATLG IGRST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 4 0 . pep WSENSANAKTDS I SLFAGI RHDAGDIGYLKGLFS YGRYKNS I SRS TGADEHAEGS VNGTL 

I M f 1 1 1 f 1 1 1 1 1 1 1 ! I M I M : 1 1 1 ! 1 1 1 1 II I I I ! I I I II I : I M I I I I II 

g 1 4 0 WSENSANAKTDS I SLFAGI RHDVGD I GYLKGLFS YGRYKNS I SRSTGADEYAEGS VNGTL 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 0 . pep WLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 

1 1 1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 M I M I M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

g 1 4 0 MQU5ALGGVNVPFAATGDLTVEGGLmn5LLKODAFAEKGSALGWSGNSLTEGTLVGIiAGL 
310 320 330 340 350 360 



370 380 390 400 410 420 

ml 40 . pep KLSQP LSDKAVL F ATAGVERDLNGRD YTVTGGFTGAT AATGKTGARNM PHTRLVAGLGAD 

1 1 II I II MMI Mill III II IMhl Mil 1 1 MM I III III I II II I I II I hi 

g 1 4 0 KLSQP LSDKAVL SATAGVERDLNGRDYAVTGG FTGAAAATGKTGARNMPHTRRVAGLGVD 

370 380 390 400 410 420 



430 440 450 

ml40 . pep VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 

i III III III IIMhll II MM IhM MM II 

gl40 VE FGNGWNGLARYSYTGS KQYGNHSGQI GVGYRFX 

430 440 450 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 55 1>: 

al40.seq 

1 ATGTCGGCAG GCGGTAAGGG GGCAGGCTAT CTCAACCGTA CCGGACAACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCGGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GTAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGCCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

401 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 

4 51 ATCTTCAACA ATCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

551 ACAACGCTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

701 TGGGCATGGG ACACAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 

801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
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951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAG CATC ACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGCCTGG TTGCCGGTCT 

1251 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

1301 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 

1351 GGCTACCGGT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 552; ORF 140.a>: 
al40 .pep 

1 MSAGGKGAGY 
51 KTAGSEGDTL 
101 ELDASESSAT 
151 IFNNLAATVY 
201 TWEQGGVEGK 
251 DSISLFAGIR 
301 MQLGALGGVN 
351 EGTLVGLAGL 
401 GKTGARNMPH 
451 GYRF* 

ml40/al40 98.2% identity in 454 aa overlap 

10 20 « 30 40 50 60 

ml 40 . pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 
lit II I I I I I I I I : I I I I I I 1 II I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al40 MSAGGKGAGYLNRTGQRVPFLSAAKIGRDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

10 20 30 40 50 60 



LNRTGORVPF LSAAKIGRDY 
SYYVRRGNAA RTASAAAHSA 
PETVETAAAD • RTDMPGIRPY 
ADSTAAHADM QGRRLKAVSD 
MRGSTQTVGI AAKTGENTTA 
HDAGDIGYLK GLFSYGRYKN 
VPFAATGDLT VEGGLRYDLL 
KLSQPLSDKA VLFATAGVER 
TRLVAGLGAD VEFGNGWNGL 



SFFTNIETDG GLLASLDSVE 
PAGLKHAVEQ GGSNLENLMV 
GATFRAAAAV QHANAADGVR 
GLDHNATGLR VIAQTQQDGG 
AATLGMGHST WSENSANAKT 
SISRSTGADE HAEGSVNGTL 
KQDAFAEKGS ALGWSGNSIT 
DLNGRDYTVT GGFTGATAAT 
ARYSYAGSKQ YGNHSGRVGV 



70, . 80 90 100 110 120 

ml 4 0 . pep SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 II I I I I I I I I I I II I ! I I I I 
al 4 0 SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml40 . pep RTDMPGIRPYGATFRAAAAVQIiANAADGVRIFNSIiAATVYADSTAAHADMQGRRLKAVSD 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I M I I II I I I I I I I I I I M I M I I 
a 1 4 0 RT DMPGI RPYGAT FRAAAAVQHANAADGVR I FNNLAAT VYADSTAAHADMQGRRLKAVS D 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVI AQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGRST 

I I I M : II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I F I I I : I I 
a 1 4 0 GLDHNATGLRVI AQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGHST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml40 . pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
I M I I II I I I I i I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I 
al40 WSENSANAKTDSISLFAG1RHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 0 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 
I I I I I II I I I I II I I I I I I I I li I I I I I I I I I I I I I II I I I III M I I: I I I I I I I II I I 
al40 MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSITEGTLVGLAGL 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 0 . pep KLS QP LS DKAVL FAT AG VE RD LN GR D YT VTGG FTG AT AATGKTG ARNM PHTR LV AG LG AD 

I IIIIIIIMIIIIIIIIIlilllllMIIIIMIIIIII IMIII IIIIIIIIIIMM 
al40 KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
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370 380 390 400 410 420 

430 440 450 

ml 4 0 . pep VE FGNGWNG LAR Y S Y AGSKQYGNH SGRVG VG YR FX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 1 4 0 VE FGNGWNGLARY S YAGSKQ YGNHSGRVGVGYRFX 

430 440 450 



The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 553>: 
gl41.seq 

1 atgagcttca aAAccgATGC CGAAACCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAGCTG 

151 CCGCAAAAAC AAGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCATTGC 

251 GCCATATCGG CAAAGACTCT GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 

301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ACGCGCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGCGAC TTCCACGCCA 

401 TCGGTGCGGC GAATAACCTC CTCGCCGCCA TGCTCGACAA CCATATCTAC 

451 CAAGGTAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGTATGGGCA 

551 AGCCTGTtga cggCGTGATG CGtCCCGACG GCTTCGACAT CACCGTCGCC 

601 TCCGAAGTGa tggcgGTATT CTGCCTTGCC AAAGACATCA GCGATTTGAA 

651 AGAGCGTTtt gGCAATATTC TCGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

701 1TTACG CCAA AGATTTGAAG GCACACGGCG CGATGGCGGC ATTGCTAAAA 

751 GATGCGATTA AGCCCAATTT GGTGCAAACC ATCGAAGGCA CTCCGGCCTT 

801 TGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTTA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTTCGGCG CGGACTTGGG TGCGGAAAAA TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCAGT CGTCGTGGCG ACTGTCCGCG 

1001 CCCTGAAATA CAACGGCGGC GTGGAACGCG CCAACCTTGG TGAAGAAAAC 

1051 CTCGAAGCCT TGGCAAAAGG TTTGCCCAAC CTGTTGAAAC AGATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACTC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GCGGCGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA TGCCATCGAC AACCAACCTA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTCG ATTTCAGCGC 

1401 GGAAGCGTCT GCCGAAATCG CCTCGCTGGA AAAACTGGGC TTGGACAAAA 

1451 TGCCGATCTG CATGGCGAAA ACCCAATATT CATTGAGCGA CAACGCCAAA 

1501 CTCTTGGGCT GCCCCGAAGG CTTCCGCATC GCCGTACGCG GTATCACTGT 

1551 TTCCGCCGGC GCGGGCTTCA TCGTTGCGTT GTGCGGCAAT ATGATGAAAA 

1601 TGCCGGGCCT GCCGAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGAA 

1651 CACGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 554; ORF I41.ng>: 

gl41.pep 

1 MSFKTDAETA QSSTMRPIGE IAAKL3LNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRULV TAINPTPAGE GKTTVTIGLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRKIID GMGKPVDGVM RPD GFDITVA 

201 SEVMAVPCLA KDISDbKERF GNILVAYAKD GSPVYAKDLK AHGAMAALLK 

251 DAIKPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

351 LEALAKGLPN LLKHISNLKN VFGLPVWAL NRFVSDSDAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARKWNAID NQPNNFGFAY DVELGIKDK1 

451 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEGF RI AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDE 

551 HGVIHGLF* 

The following partial DNA sequence was identified in N.meningitidis <SEQ ID 555>: 
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ml41 . seq 

1 ATGAGCTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTTGGTCT GAATGCCGAC AACATTGAGC 

101 CTTACGGTCA TTACAAGGCG AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCGTTGC 

251 GCCACATCGG CAAAGATGCC GTGATTGCCC TGCGCGAACC TTCTCTGGGG 

301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

4 01 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 

451 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGTA 

551 AACCCGTTGA CGGCGTGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 

601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 

651 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

701 TTTACGCCAA AGATTTGAAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 

751 GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 

801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTTCGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCOGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGrCCGACGC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

1401 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 

1451 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 

1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 

1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 

1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 

1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ED 556; ORF 141>: 

ml41 .pep 

1 MSFKTDAEIA QSSTMRPIGE IAAKLGLNAD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDA VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAANLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPD GFDITVA 

201 SEVMAVFCLA KDISDLKERL GNILVAYAKD GSPVYAKDLK ANGAMAALLK 

251 DAI KPWLVQT I EGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK FCDIKCRLAG LKPDAAWVA TVRALKYNGG VERANLGEEN 

351 LDALEKGLPN LLKHISNLKN VFGLPVWAL NRFVSDADAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 

451 RAIAQKVYGA EDVDFSAEAS AEIA5LEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEDFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKZDVDA 

551 EGVIHGLF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 141 shows 97.5% identity over a 558 aa overlap with a predicted ORF (ORF 141. ng) 
from N. gonorrhoeae: 

ml4l/gl41 

10 20 30 40 50 60 

ml41 .pep MS F KTDAE I AQS S TMRPI GE I AAKLGLNADN I E P YGH Y KAKIN P AE AFKLPQKQGRLI LV 

M 1 1 1 1 1 1 IIIIIIIIIIIIIIIIIIMIIIilllll IIMI I II I! 1 1 M II I II II 

9141 MSFKTDAETAQSSTNIRPIGEIAAKLGI^JVDNIEPYGHYKAKINPAEAFKLFQKQGRIjILV 
10 20 30 40 50 60 
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80 



90 



100 



110 



120 



ml41 .pep TAINPTPAGEGKTTVT I GLADALRH IGKDAVI ALRE PSLGPVFGVKGGAAGGGYAQVLPM 

Ml I II M III I II III I II II! II III MINN lllllllll II II I II Mil II 1 1 

gl4 1 TA I NPT PAG EGKTTVT I GLAD ALRH I GKD SV I ALRE PS LGPV FGVKGGAAGGGY AQVL PM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml41 .pep E D I NLHFTGDFHAI GAANNLLAAMLDNH I YQGNE LN I DP KRVLWRRVVDMND RQLRN 1 1 D 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIIIII 

gl41 ED I NLH FTGDFHAI GAANNLLAAMLDNH I YQGNE LN I DPKRVLWRRWDMNDRQLRN I ID 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml41 . pep GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKER 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I 
gl41 GMGKPVDGVMRPDGFD I TVASEVMAV FCLAKD I SDLKER FGN I LVAYAKDGS PVYAKDLK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ra!4 1 . pep ANGAMAALLKDAI KPNLVQTI EGTPAFVHGGPFANI AHGCNSWATRLAKHLADYAVTEA 

MIIIMIIIMMIIM IIIIIMIIIIIIIIIIIIIIIIIIIMMIIIIIMIII 

gl4 1 AHGAMAALLKDAI KPNLVQTI EGTPAFVHGGPFANI AHGCNSVTATRLAKHLADYAVTEA 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml4 1 . pep GFGADLGAEKFCDI KCRLAGLKPDAAVWATVRALKYNGGVERANLGEENLDALEKGLPN 

1 1 ! 1 1 1 1 1 1 1 1 J 1 1 1 » f 1 1 1 1 1 E 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 [ f 1 1 » 1 1 r | i inn 

g 1 4 1 GFGADLGAEKFCDI KCRLAGLKPDAAVWATVPJUjKYNGGVERANLGEENLEALAKGLPN 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 41 . pep LLKHI SNLKNVFGLPVVVALNRFVSDADAELAMI EKACAEHGVEVS LTEVWGKGGAGGAD 

II I M III I III I III III III ||||:| II II I II I III II III MIMI III II III I! 
gl41 LLKHISNLKWFGLPWVALNRFVSDSDAELAMIEKACAEHGVEVSLTOVWGKGGAGGAD 

370 380 390 400 410 420 

430 440 450 460 470 480 

tn!41 .pep IJVRKVVNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

ii i ii 111 1- 1 iiiiiiiii ii i in iii inn in it ii Mini in ii in ii 

g 1 4 1 LARKWNAI DNQPNNFGFAYDVELGI KDKI RAI AQKVYGAEDVDFSAEASAEI ASLEKLG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml4 1 . pep LDKMP I CMAKTQYS LSDNAKLLGCPEDFRI AVRG I TVSAGAGFI VALCGNMMKMPGLPKV 

1 1 1 1 1 M 1 [ 1 1 II 1 M i 1 1 i M i 1 1 1 llllllllllllllllllllllllinilllll 

gl41 LDKMP I CMAKTQYSLSDNAKLLGCPEGFRI AVRG ITVSAGAGFI VALCGNMMKMPGLPKV 

490 500 510 520 530 540 

550 559 
ml4 1 . pep PAAEKI DVDAEGVIHGLFX 

lllllllll HI Ml II I 
gl4 1 PAABKI DVDEHGVI HGLFX 

550 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 557>: 

alll.seq 

1 ATGAGTTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 
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201 GGCGGGCGAA GGTAAAACCA CCGTAACCAT CGGTTTGGCG GACGCATTGC 

251 GCCATATCGG CAAAGACTCT GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 

301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

4 01 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 

451 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGCA 

551 AGCCTGTTGA CGGCGTGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 

601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 

651 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

"701 TTTACGCCAA AGATTTGAAA GCGAATG GCG CGATGGCGGC ATTGCTTAAA 

751 GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 

801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTTCGGCG CGGACTTGGG CGCGGAAAAA - TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1C01 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1C51 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACTC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

1401 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 

1451 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 

1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 

1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 

1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 

1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 558; ORF 14i.a>: 

al41 .pep 

1 MSFKTDAEIA 

51 PQKQGRLILV 

101 PVFGVKGGAA 

151 QGNELNIDPK 

201 SEVMAVFCLA 

251 DAIKPNLVQT 

301 GPGADLGAEK 

351 LDALEKGLPN 

401 HGVEVSLTEV 

451 RAIAQKVYGA 

501 LLGCPEDFRI^ 

551 EGVIHGLF* 

ml41/a!41 99.5% identity in 558 aa overlap 

10 20 30 40 50 60 

ml41 . pep MS FKTDAE I AQS STMRPIGEIAAKLGLNADN IEP YGH YKAK INPAEAFKLPQKQGRLI LV 

Ml I I I I I I I I I ! Ml I I 1 1 I I i I I I I I: I I It III I I I I I! I II I I I I I I I i I I I I I I I 
al41 MSFKTDAEIAQSSTMRPIGEIAAKLGLNVDNIEPYGHYKAKINPAEAFKLPQKQGRLILV 
10 20 30 40 50 60 

70 80 90 100 110 120 

TAINPTPAGEGKTT\'TIGLADALRHIGKDAVIALREPSLGPVFGVKGGAAGGGYAQVLPM 
II II I I I M I M I I I I I I I M I I M II M : I I II I I S I II 1 1 I I I I II I I I II I I II M I 
TAINPTPAGEGKTTVTIGLADALRHIGKDSVIALREPSLGPVFGVKGGAAGGGYAQVLPM 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 1 . pep EDI NLH FTG DFHAIG AANNLLAAMLDNH I YQGNE LN I DPKRVLW RRVVDMN DRQLRN I ID 

I I I I I I I I M I I I Ml I 1 1 M 1 1 I 1 1 1 1 1 I I I I II I I 1 1 I II I 1 1 M I I I I I I I I M 1 1 I 
al41 EDINLHFTGDFHAI GAANNLLAAMLDNFil YQGNE LNIDPKRVLWRRVVDMNDROLRNI ID 

130 140 150 160 170 180 



QSSTMRPIGE IAAKLGLNVD 
TAINPTPAGE GKTTVTIGLA 
GGGYAQVLPM EDINLHFTGD 
RVLWRRWDM NDRQLRNIID 
KDISDLKERL GNILVAYAKD 
IEGTPAFVHG GPFANIAHGC 
FCDIKCRLAG LKPDAAVWA 
LLKHISNLKN VFGLPWVAL 
WGKGGAGGAD LARKWNAIE 
EDVDFSAEAS AEIASLEKLG 
AVRGITVSAG AGFIVALCGN 



NIEPYGHYKA KINPAEAFKL 
DALRHIGKDS VIALREPSLG 
FHAIGAANNL LAAMLDNHIY 
GMGKPVDGVM RPDGFDITVA 
GSPVYAKDLK ANGAMAALLK 
NSVTATRLAK HLADYAVTEA 
TVRALKYNGG VERANLGEEN 
NRFVSDSDAE LAMIEKACAE 
SQTNNFGFAY DVELGIKDKI 
LDKMPICMAK TQYSLSDNAK 
MMKMPGLPKV PAAEKI DVDA 



ml 41 .pep 
al41 
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190 200 210 220 230 240 

ml41.pep GMGKPVDGVMRPDGFDITVASEVMAVFCIAKDISDLKERLGNILVAYAKOGSPVYAKDLK 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I I I M I || I I | 
al41 GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml4 1 . pep ANGAMAALLKDAI KPNL VQT I EGT PAFVHGG P FAN I AHGCN S VTAT RLAKHLADYAVTEA 

I II I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I i I I I I I I I 1 I I I I I I I I II 
al4 1 AN GAMAALLK DAI K PNLVQT I EGT PAFVHGG P FAN I AHG CN S VT AT RLAKH LAD Y AVTEA 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml41.pep GFGADLGAEKFCD1KCRLAGLKPDAAVWATVRALKYNGGVERANLGEENLDALEKGLPN 
I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II II I I I I I I I 
a 141 GFGADLGAEKFCDIKCRLAGLKPDAAWVATVRALKYKGGVERANLGEENLDALEKGLPN 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml41.pep LLKH I SN LKN V FGL FVWALNRFVS DADAE LAM I EKACAEHGVE VS LTE VWGKGG AGGAD 

I I I I I i I I I I I I I I I I I I I I I I I I I I : I i I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 
al41 L LKH I SN LKN V FGL PVW ALN RFV S DS DAE LAM I EKACAEHGVE VS LTE VWG KGG AGGA D 

370 380 390 400 410 420 

430 440 450 460 47C 480 

ml4 1 .pep LARKVVNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 ! 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

al41 LARK WN A I E S QTNN FG FA YD VE LG I KDK I RAI AQKV YG AE D V D FS AEAS AE IAS LE KLG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml4 1 . pep LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 
I I I 1 1 1 1 I I I 1 1 I I I I 1 1 I I M I I I I I I I I I I I 1 1 I II I II 1 1 I 1 1 I I I II M I I I I I 1 1 
al41 LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 

490 500 510 520 530 540 

550 559 
m!4 1 . pep PAAEKIDVDAEGVIHGLFX 
I I I I I I I I I I I I I I I I I I I 
al4 1 PAAEKIDVDAEGVIHGLFX 

550 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 559>: 
gl42 .seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCTTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAAATATGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCAT ATTGATGCAG AGgCTGCCGT 

201 TTTCCGACAG GATcggaATG AttcgCGCAC TCCGGTTTAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

3 01 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

3 51 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

4 01 GCCATTTTTC CCCTTTAAAC CGTCCCCTAT ATAAGAATGC TGCACACAAG 
451 GCATCCCCCC ATGTGCAGCA GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 560; ORF 142.ng>: 

gl4 2.pep 

1 MRADFMFADN MPVQVRQRAF YFKLSRFAAM PNMVGKPLFG RQAGQPGKMF 

51 GNILMFVRQH IDAEAAVFRQ DRNDS RTPVY AQHHGRRLVG NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN RPLYKNAAHK 

151 ASPHVQOF* 
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The following partial DNA sequence was identified in N.meningitidis <SEQ ID 561>: 

ml4 2 .seq 

1 ATGCGTG CCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTTCAAGT TGTCCCGTTT TGC CGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

.151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCGCG ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCGC 

351 AAGATGC CAT CGCATCACGG AACGAAGTTT GAAAATTTTT CTGCAAATCC 

4 01 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

4 51 GCATCCCCcC ATGTGCAGCA GTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 562; ORF 142>: 

ml42.pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 
51 GNILMFVRQR IDAEAAVFRQ DRNDSRTPVD AQHHGRRLVG NRRDRRHCKA 
101 VTPCRTVCRD DMNACRARCH RITERSLKIF LQIRHFSPLN CPLYKNAAHK 
151 ASPHVQQF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 142 shows 93.7% identity over a 158 aa overlap with a predicted ORF (ORF 142.ng) 
from N. gonorrhoeae: 
nl42/gl42 

10 20 30 40 50 60 

MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPGKMFGNILMFVRQR 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 h • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h . 

MRADFMFADNMPVQVRQRAFYFKLSRFAAMPNMVGKPLFGRQAGQPGKMFGN I LMFVRQH 
10 20 30 40 50 60 

70 80 90 100 110 120 

I DAE AAV FRQDRNDS RT P VDAQHHGRRLVGNRRDRRHCNAVT PCRTVCRDDMNACRARCH 

lllllllllllllllllll I MINI II IIIUIII I MM III Ml IIMIIh II 

I DAE AAV FRQDRNDS RTPVY AQHHGRRLVGNRRNRRHCNAVTPCRTVCRDDMNACRTGCH 
70 80 90 100 110 120 

130 140 150 159 

RITERSLKI FLQIRHFSPLNCPLYKNAAHKASPHVQQFX 

iiiiiiii milium iimiiiiiiiiiiiiii 

RI TERSLKS FLQ I RH FSPLNRPLYKNAAHKAS PHVQQFX 
130 140 150 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 563>: 

al42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCCGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

401 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCACCCCCCA TGTGCAGCAG TTCTGATTCA AAAAGCCGTC GGTCGGACAT 

501 TTCCGCGCGT TACGGCGTAT TACGAGTTCA ACGCATCCTC GATTTTGGCA 

551 AGTTCTGCCA ACAGGTCTTT AAGCAGCAGC ATTTTCTCGC GGCCCAGCAC 

501 TTCCTCGATA GCGTCGTAAC GCTCGTCCAC TTCTTCGCCG ATTTCCTCAT 

651 ACAGCTTCTC GCCCTCGGCA GTCAGCTTCA GAAAAACACG TCGTTGGTCG 

701 TTGGAAGGTT TCAGGCGGAC AACCAAACCC GCTTTTTCAA GGCGGGTCAG 

751 GATACCGGTC AGGCTGGGGC GCAAAATGCA CGCCTGATTC GCCAAATCTT 



ml42 -pep 
gl42 

ml42 -pep 
gl42 

ml42 .pep 
9142 
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801 GAAAGTCCAG CGTGCCGTTT TCCGCCAAAA GACGGATAAT CCGCCATTGC 
851 TGATCGGTAA TATTCGCCTG ATTCAGAATA GGCCTGAATT GGGTCATCAG 
901 GGCTTCCCTT GCCTGTATCA GACCGATATT GATAGACGCA TGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 564; ORF 142.a>: 

al42.pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 

51 GNILMFVRQR I DAEAAVFRQ DRNDSRTPVD AQHHGRRLVR NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN CPLYKNAAHK 

151 APPMCSSSDS KSRRSDISAR YGVLRVQRIL DFGKFCQQVF KQQHFLAAQH 

201 FLDS WTLVH FFADFLIQLL A LGSOLOKNT SLWGRFQAD NQTRFFKAGQ 

251 DTGQAGAQNA RLIRQILKVQ RAVFRQKTDN PPLLIGNIRL IQNRPELGHQ 

301 GFPCLYQTDI DRRMF* 

ml42/al42 96.1% identity in 153 aa overlap 

10 20 30 40 50 60 

MRADFMFADNNPVQVRQRALYFKLSRFAAMPDWGKPLBX5RQAGQPGKMFGNILMFVRQR 
I I N 1 1 I I I I 1 1 1 1 1 1 I 1 1 M M | | | | | | 1 1 | | j | | 1 1 | 1 1| | | | | | | | 1 1 | | | , | , , , | 

MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPGKMFGNILMFVRQR 
10 20 30 40 50 60 

70 80 90 100 110 120 

IDAEAAVFRQDRNDSRTPVDAQHHGRRLVGNRRDRRHCNAVTPCRTVCRDDMNACRARCH 
I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I II I || | | | | | | | | | | |: || 
IDAEAAVFRQDRNDSRTPVDAQHHGRRLVRNRRNRRHCNAVTPCRTVCRDDMNACRTGCH 
70 80 90 100 110 120 

130 140 150 159 

R ITERS LK I FLQI RH FS PLN C PL YKNAAHKAS PHVQQFX 
I I M M I I 1 M I I I I I I I I I I I I I I I I | II I 

RITERS LKS FLQI RHFS PLNCPL YKNA AHKAP PMCS SS DSKSRRS DI SAR YGVLRVQR I L 
130 140 150 160 170 180 

DFGKFCQQVFKQQHFLAAQHFLDSVVTLVHFFADFLIQLLALGSQLQKNTSLWGRFQAD 
190 200 210 220 230 240 



ml42 .pep 
al42 

ml 4 2 . pep 
al42 

ml 4 2 -pep 
al42 

a!42 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 565>: 

gl43.seq 

1 ATGTTGAGCT TCGGCTATCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAGATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGAT3CT GGTTCAGCCG 

151 ATAGTGgGCT ACT AC TC AG A CCGCACTTGG AAGCCGCGCT TGGGCGGCCG 

201 CCGCCTGCCG TATCTGCTTT ACGGCACGCT GATTGCGGTC ATCGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCCTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTGGACG TGTCGTCGAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGATATG GTCAACGAGG 

401 AGCAGAAAAG CTACGCCTAC GGGATTCAAA' GT7TCTTAGC GAATACGGAC 

451 GCGGTTGTGG CAGCGATTCT GCCGTTTGTG TTcgcgtata TCGGTTTGGC 

501 GAACACTGCC GAGAAAGGCG TTGTGCCACA AACCGTGGTC GTAGCATTCT 

551 ATGTGGGTGC GGCGTTACTG ATTATTACCA GTGCGTTCAC AATCTCCAAA 

601 GTCAAAGAAT ACGACCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGTT CGAACTCTTA AAAACCGCGC 

701 CTAAAGTGTT TTGGACGGTT ACTCCGGTAC AGTTTTTCTG CTGGTTCGCC 

751 TTCCGGTATA TGTGGACTTA CTCGGCAGGC GCGATTGCAG AAAACGTCTG 

801 GCACACTACC GATGCGTCTT CCGTAGGCCA TCAGGAGGCG GGCAACCGGT 

851 ACGGCGTTTT GGCGGCGGTG TAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 ATTCTGGCAA AAGTACCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GTTTCTTCTC TATCTTCTTC ATCTACAATC 

10 Gl AATACGCACT CATCCTGTCT TATATCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGGCC AACGCTTTGT CGGGCAAACA 

1101 CATGGATACT TATTTGGGCC TGTttaacgg ctctgtCTGT ATGCcgcaaa 

1151 tcgTcgctTC GctgttgAGT TTCGTGCTTT TCCCGATGCT GGGCGGCCAT 
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1201 CAGGCAACCA TGTTCTTGGT TGCAGGCGCA GTCTTGCTGC TGGGAGCCTT 

1251 CTCAGTCTGT CTGATTAAAG AGATCCACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 566; ORF 143.ng>: 

gl43.pep 

1 MLSFGYLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGYYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKSYAY GIQSFLANTD 

151 AWAAILPFV FAYIGLAN TA EKGVVPQTW VAFYVGAALL IITSA FTISK 

201 VKEYDPETYA RYHGIDVAAN QEKANWFELL KTAPKVFWTV TPVQFFCWFA 

251 FRYMWTYSAG AIAENVWHTT DASSVGHQEA GNRY GVLAAV *SVAAVICSF 

301 ILAKVPNKYH KAG YFGCLAL GALGFFSIFF IYN QYALILS YILIGIAWAG 

351 IITYPLTIVA NALSGKHMDT YLGLF NGSVC MPQIVASLLS FVL FPMLGGH 

401 QATM FLVAGA VLLLGAFSVC L IKEIHGGV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 567>: 

ml43 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAAATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC AAATACGGGC 

4 51 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACGATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

7 01 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 

751 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

B51 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTG AT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1301 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1351 ATTATCACTT ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCT TGTTTAACGG CTCTATCTGT ATGCCTCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 568; ORF 143>: 

ml 4 3 .pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHNLG WFFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYDPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAG YFGCLAL GALGFFSVFF IGN QYALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFN GSIC MPQIVASLLS FVL FPMLGGL 

401 QATM FLVGGV VLLLGAFSVF LIKETHGGV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml43 / gl43 93.9% identity in 429 aa overlap 

10 20 30 40 50 60 
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ml 4 3 . pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 
I 1 1 I I : I I M I II I 1 1 1 1 1 1 I 1 1 I 1 1 I 1 1 I I I I 1 1 1 1 1 1 I II I 1 1 I 1 1 I II I I : II I 1 1 I 
g!43 MLSFGYLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGYYSDRTW 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 43. pep kprix;grrlpyllygtliavivmilmpnsgsfgfgyasiaalsfgalmialldvssnmam 
I I I I I 1 1 I I M It I 1 1 I I I I I 1 1 M II 1 1 I 1 1 I I I I I i I I 1 1 II I i 1 1 I 1 1 1 I 1 1 I I 1 1 I 
gl43 kprlggrrlpyllygtliavivmilmpnsgsfgfgyaslaalsfgalmialldvssnmam 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml43.pep qpfkmmvgdiwneeqkgyaygiqsfiantgavvaailpfvfayiglantaekgvvpqtvv 
I i I I I M I I ! I I i If I : I I II I I II I I I I I 1 I I 1 I t I I I I I I I I I ! I I I 1 I I f I I I t I i 

gl4 3 qpfkmmvgdmvneeqksyaygiqsflantdawaailpfvfayiglantaekgwpqtw 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 3 . pep vafyvgaallvitsaftifkvkeydpetyaryhgidvaanqekanwiellktapkafwtv 

I I I I 1 I I I II : I I I i I I I I I I I I 1 I II 1 I I 1 I I I I I I I I I I I I I ! : I I I II I I I : II I I 

gl43 vafyvgaalliitsaftiskvkeydpetyaryhgidvaanqekanwfellktapkvfwtv 

190 200 - 210 220 230 240 

250 260 270 280 290 300 

ml 4 3 . pep tlvqffcwfafqymwtysagaiaenvwhttdassvgyqeagnwygvlaavqsvaavicsf 
I I M I I I I I I : I I I 1 N I I I I I I I ! I I I I I I II I I : I I I N lllllil I 1 I t I I I t 1 

gl43 tpvqffcwfafrymwtysagaiaenvwhttdassvghqeagnrygvlaavxsvaavicsf 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml43 . pep vlakvpnkyhkagyfgclalgalgffsvffignqyalvlsytligiawagiitypltivt 

: I I I I I I I I I I I ! I I I I I I I I I I I I I I : I I I 11111:111 I I I I I II I I M I I I II I : 

gl43 ilakvpnkyhkagyfgclalgalgffsiffiynqyalilsyiligiawagiitypltiva 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml43 . pep nalsgkhmgtylglfngsicmpqivasllsfvlfpmlgglqatmflvggwlllgafsvf 

I I I 11 I I I I I I I I I I I I : I I I II I I I I I I I I I I I I I I I I I I I I I I : I : I I I I II M I 
g 1 4 3 NALSGKHMDTYLGLFNGSVCMPQI vasllsfvlfpmlgghqatmflvagavlllgafs VC 

370 380 390 400 410 420 

430 

ml 4 3. pep LIKETHGGVX 
I I I I I I I I I 
gl43 LIKEIHGGVX 
430 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 569>: 

a 143 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAGATG AGCCGCATCT TCCAGACGCT CGGTGCCGAT CCGCACAGCC 

101 TCGGCTGGTT CTTTATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

401 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC GAATACGGGC 

451 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACAATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

701 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 
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751 TTCCAATATA TGTGGACTTA 

801 GCACACCACC GATGCGTCTT 

851 ACGGCGTTTT GGCGGCGGTG 

901 GTATTGGCGA AAGTGCCGAA 

951 TTTGGCTTTG GGCGCGCTCG 

1001 AATACGCGCT GGTGTTGTCT 

1051 ATTATCACTT ATCCGCTGAC 

1101 TATGGGCACT TACTTGGGCC 

1151 TCGTCGCTTC GCTGTTGAGT 

1201 CAGGCCACTA TGTTCTTGGT 

1251 TTCCGTGTTC CTGATTAAAG 



CTCGGCAGGC GCGATTGCGG AAAACGTCTG 
CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 
CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 
TAAATACCAT AAGGCGGGTT ATTTCGGCTG 
GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 
TATACCTTAA TCGGCATCGC TTGGGCGGGC 
GATTGTGACC AACGCCTTGT CGGGCAAGCA 
TGTTTAACGG CTCTATCTGT ATGCCGCAAA 
TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 
AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 
AAACACACGG CGGGGTTTGA 



This corresponds to the amino acid sequence <SEQ ID 570; ORF 143.a>: 

al43 .pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHSLG WFFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGR RLP YLLYGTLIAV IVMIL MFNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYNPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAGY FGCLAL GALGFFSVFF I GNQY ALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFNGSIC MPQ IVASLLS FVLFPMLGGL 

401 QATM FLVGGV VLLLGAFSVF LI KETHGGV* 



ml43/al43 99.5% identity in 429 aa overlap 



10 20 30 40 50 60 

ml43.pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 
I I I I I I I 1 I I I I I I I I I I I I I I I I I I 1 I I I I I : I I I I I I I I I I I I I I I I II I I I I I I I II 
al43 MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHSLGWFFILPPLAGMLVQPIVGHYSDRTW. 

10 20 30 40 50^ 60 

70 80 90 100 110 120 

ml 43 .pep KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 
I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al43 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 

TO 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 3 . pep QPFKMMVGDMVNEEQKGYAYGIQSFIJ^TGAWAAILPFVFAYIGLANTAEKGVVPQTVV 
I I i I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 
al43 QPFT<MMVGDMV^EEQKGYAYGIQSFLANTGAWAAILPFVFAYIGLANTAEKGWPOTW 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml43 . pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
I I I I I I I I I I I I I I II I I I I I I I I : I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I 
al43 VAFYVG AALLV I T S AFT I FKVKE YN PET Y AR YH G I DV AANQE KANW I E L LKT APKA EV?T V 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml43 . pep TL VQFFCW FAFQ YMWT Y SAG AI AEN VWHTT DAS SV3YQEAGNWYG VLAAVQS V AAV I C S F 

I I I I I I I I II I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al 4 3 T LVQ FFCW FAFQ YMWT Y SAG AIAENVWHTT DAS SVGY QEAGNWYG VLAAVQS VAAV ICS F 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml43 . pep VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLS YTLIGIAWAG IITYPLTIVT 

I I I I I I I M II I 1 1 I I.I I I 1 1 1 I 1 1 I I M 1 1 I I II I I II II I I I I 1 1 I I II 1 1 I II II I I 
al43 VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLS YTLIGIAWAG IITYPLTIVT 

310 320 330 340 350 360 



ml 4 3. pep 



370 380 390 400 410 420 

MALSGKHMG T Y LG L FNG S I CM PQ I V AS LL S FVL FPM LGG LQATM FL VGG WLLLG A FS V F 
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MIMIIIIMMl IIIMMMIEIIIII MINIMUM M I t I I t I M I 

al43 N ALS GKHMGT YLGL FN G SI CMPQI V AS LL S FV L FPM LGG LQATM FL VG G V V L L LGAFS V F 

370 380 390 400 410 420 

430 

ml 43 .pep LIKETHGGVX 

if m 1 1 1 1 1 i 

al43 LIKETHGGVX 
430 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 57 1>: 

gl44 .seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGGGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGTGC GTCTTCGT3C 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC CCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGTAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCcgtTT 

351 CAACGCGGTG GCGGCAGACG GccgacggTt atCCCAACGA TTTGGatatT 

401 TCctaccgCT TGGACGAGGA CGGCCGGCTT ACCGTtaccT ATCGCGCCAC 

451 CGCgctCGGC GACACGGTGT TCGACCCGAC GCTGCACATT TACTGGCGGC 

501 TGGACGCGGG CCTGCACGAT GCGGTTCTGC ATATTCCGCA GGGCGGACAT 

551 ATTCCGGCCG ATGCCGAAAA ACTGCCCGTC TTAACGGTTT CAGACGGCCT 

601 CGAAGTATTT GA 

This corresponds to the amino acid sequence <SEQ ID 572; ORF 144 .ng>: 

gl44 .pep 

1 MSDTPATRDF GLI DGRAVTG YVLSNRRGT C VFVLDLGGIV QEFSVL ADGV 

51 RENPWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRRLSQR FGYFLPLGRG RPAYRYLSRH 

151 RARRHGVRPD AAHLLAAGRG PARCGSAYSA GRTYSGRCRK TARLNGFRRP 

201 RSI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 573>: 

ml 4 4 . seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGTCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGATGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAAC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTGg 

401 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TAcCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATGCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 

This corresponds to the amino acid sequence <SEQ ID 574; ORF 144>: 

ml4 4.pep 

1 MSDTPATRDF GLI DGRAVTG YVLSNRRGT R VCVLDLGGIV QEFSVL ADGV 

51 RENLWS FDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLATVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYA 

201 GRCRKTARLN GFRRPRSI* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ml44 / gl44 91.3% identity in 218 aa overlap 

10 20 30 40 50 60 

ml 4 4 .pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSV1ADGVRENLVVSFDD 
I I I I I I i I I I II I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I 
g 1 4 4 MS DT PATRDFG LI DGRAVTG YVLSNRRGTC V FVLDLGG I VQEFS VLADGVREN P WS FDD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 4 . pep AA S YA DN P FQ I NKQ I GRVAGR I RG AAFD I N GRT YRVEANE GRN ALH GG S HG LAVT R FN AV 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gl4 4 AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSKGLAVTRFNAV 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 4 . pep AADGRSWLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 
III I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M | | 

gl4 4 AAD GRRLSQRFG — YFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 



190 200 210 219 

ml4 4 . pep AAGRGPARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 
I I I I I I I I I I I I I II I I I I : I I I I I I I I I I I I I I I I I I I 
gl4 4 AAGRG P ARCG SAY S AGRT Y SGRC RKT ARLN G FRRPR SIX 

170 180 190 200 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 575>: 

al44 .seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTG . 

4 01 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TACCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATTCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 

This corresponds to the amino acid sequence <SEQ ID 576; ORF 144.a>: 
al4 4 .pep 

1 MSDTPATRDF GL I DGRAVTG YVLSNRRGTR VCVLDLGGIV QEFSVLA DGV 

51 RENLVVSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLXTVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRG PARC G SAYS AGRT YS 

201 GRCRKTARLN GFRRPRSI* 



ml44/al44 99.1% identity in 218 aa overlap 

10 20 30 40 5C 60 

ml44 .pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 
II IIIIIIIIIIMHIIIIIIIIIIIIIIMIII IIMIIIIIIIIIIIIMIMMII 
a 14 4 MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGV RENLVVSFDD 

10 20 30 40 50 60 



ml4 4 .pep 



70 80 90 100 110 120 

AASYADNPFQ I NKOIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTR FN AV 



WO 99/57280 



PCT/US99/09346 



410 



I I M I II I I 1 1 I I I ! I I It I I I I I 1 1 I I I 1 1 I 1 1 1 1 I I I I 1 1 I I I I M 1 1 i I I I I I I I M 
al44 AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 4 . pep AADGRS WLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

I I M M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I! I I I I I I II I I I I I I I I I M 
al 4 4 AADGRS WLRSRLXTVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 170 130 



190 200 210 219 

AAGRGPARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 
I I M I I I I I I I I I I I I I I I: I I I II I I I I I I I I I I I I I I 
AAGRGPARCGSAYSAGRTYSGRCRKTARLNGFRRPRSIX 
190 200 210 

The foflowing partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 577>: 

gl46.seq 

1 ATGAAGCAAA TCCCCCTCCG CCTTCTCCAG GTCGTCATTG ACCACGACAA 

51 AGTCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAa ctTCCCGACT GTCCGTCCCG CGCcctTTGA GGCGCGCGGC 

151 AAGCACGTCG AAAGAAGGCG GCAGGATAAA GATACCGACA GCTTCCGGCA 

201 GCGCGTTGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251. TCATAGCCTG CCGCCGCCAA CGCATTCACG CCCTCCGTGC TTGTGCCGTA 

301 ATAGTTGCCG AATACGTCTG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGATTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGGCG CGTCGTATGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAC CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAA CCGATTGTAT CACAACGGAC ACCCTATTTC 

601 ATATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 



ml 4 4 . pep 
al44 



This corresponds to the amino acid sequence <SEQ ID 578; ORF 146.ng>: 
g!46.pep 

1 MKQIPLRLLQ WIDHDKVEQ YGLFDFMPCL RQPPLDNFPT VRPAPFEARG 

51 KHVERRRQDK DTDSFRQRVA NLRRALNVDF QNHVIACRRQ RIHALRACAV 

101 IVAEYVCVFQ KSLLRDKRFK LFFGNKVIMY AVCFAFTRRA RRMRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPYF 

201 IFADAHILPL LF+ 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 579>: 

ml 4 6 . seq 

1 ATGGCGCAAA TCCTCCTCCG CTCGCGCCAA GTCGTCATTG ACCACGACAA 

51 AGTCAAACAA TACGGACTGC TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GGCGCGCGGC 

151 AAGTACGTCG AAAGAAGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCGTCGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC CTGTGCCGTA 

301 ATAGTTGCCA AATACGTCGG CGTATTCCAA AAAAGCTTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGGCG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAATGGAC ACCCAGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 580; ORF 146>: 

ml 4 6. pep 

1 MAQI LLRSRQ WIDHDKVKQ YGLLDFMPCL RQPPLDNFPT VRPASVEARG 

51 KYVERRRQDK DADGFGQRVA NLRRALNVDF QNHVIACRRQ RIHTLRACAV 

101 I VAKYVGV FQ KSFLRDKRLK LFFGNKVIMY AVCFAFTRRA RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQWTPSF 

201 LFADAHILPL LF* 



WO 99/57280 



411 



PCT/US99/09346 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml46 / gl46 90.1% identity in 212 aa overlap 

10 20 30 40 50 60 

ml4 6 .pep MAQILLRSRQVVIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 

I II II I I I II II I I : I I I I : I I I I I I II I I I I I I I I I I I I II i I I : I I i I I I I I 
g 1 4 6 MKQI PLRLLQWI DHDKVEQYGLFDFM PCLRQP PLDN FPTVR PA PFE ARGKH VERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 6 . pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 
1:1:1 I I I I I I I I I I I II I I II I I I I I I I I I I : I I I I I I I I I : I I I I I I I : 1 I I I I : I 
gl4 6 DTDSFRQRVANLRRALNVDFQNHVIACRRQRIHALRACAVIVAEYVCVFQKSLLRDKRFK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 6 . pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 
M i I i i I I I 1 I I I I I I M I M I : I I I I I I i ! ! I I i I I! I I I I I I I I I I ! I I II It I I I I t 
gl4 6 LFFGNKVIMYAVCFAFTRRARRMRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

130 140 150 160 17C 180 

190 200 210 

ml 4 6. pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 

II I I I I I I I M I I II II I: I 11 I I I I I I I i I 
gl4 6 GHIFYLYIFQPIVSQRTPYFIFADAHILPLLFX 

190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 581>: 

al46.seq 

1 ATGGCGCAAA TCCTCCTCCG CCCGCGCCAA GTCATCATTG ACCACGACAA 

51 AATCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GACGCGCAGC 

151 AAGCACATCG AAAGACGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCATCTCG AACCTGAGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAACCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC TTGTGCCGTA 

301 ATAGTTGCCG AACACGTCCG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

4 01 TCGCCTTCAC GCGGCGGACG CGTCGTGTGC GAC AC GGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAACGGAC ACCCGGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 582; ORF 146.a>: 

al4 6.pep 

1 MAQILLRPRQ VIIDHEKIEQ YGLFDFMPCL RQPPLDNFPT VRPASVETRS 

51 KHIERRRQDK DADGFGQRIS NLSRALNVDF QNHVITCRRQ RIHTLRACAV 

101 IVAEHVRVFQ KSLLRDKRLK LFFGNKVIMY AVC FAFTRRT RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPGF 

201 LFADAHILPL LF* 

ml46/al46 90.6% identity in 212 aa overlap 

10 20 30 40 50 60 

ml4 6 . pep MAQILLRSRQVVIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
I! Mill IM:||lil::||||:||||||||MIIIIMIIIIIII:|:|::IMIill 
al46 MAQILLRPRQVIIDHDKIEQYGLFDFMPCLRQPPLDNFPTVRPASVETRSKHIERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml4 6 . pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 
I I I I I I I I : : I I I I I I I I I I I I II : I i I I I I I I I I I I I I I I I : : I I I I I I : I I I I I I I 
al4 6 DADGFGQRISNLSRALNVDFQNHVITCRRQRIHTLRACAVIVAEHVRVFQKSLLRDKRLK 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 6 . pep L FFGN KV I M Y A VC FAFT R RARR VRHGN AQT VM V CQQ PRHQRG FARAG SG RN DK D V AF S I S 

I I I I II I I I I 1 I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
al4 6 LFFGNKVIMYAVCFAFTRRTRRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

130 140 150 160 170 180 

190 200 210 

ml46.pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 
I I I I I I I I I I I I I I I I I : 1 I I I I I I I I I I I I I 
al4 6 GHIFYLYIFQPIVSQRTPGFLFADAHILPLLFX 

190 200 210 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 583>: 

gl47.seq (partial) 

1 . .ATGCGACGAG AAGCCAAAAT GGCACAAATC ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CGCCCAAGCG CATGAAACTG 

101 AGCAATCGGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCGACTT CGGGGCTGCT GCACACTTCG ACCGCCTCCG ACAAAA7CAT 

201 CTCCGGCGAT ACTTTGCGCC AAAAAGCCGT CAACTTGGGC GACGCTTTGG 

251 ACGGCGTACC GGGCATCCAC GCTTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC GGGCAGACGG ATTAAAGTAT TGAACCATCA 

351 CGGCGAAACG GGCGATATGG CGGACTTTTC TCCCGATCAC GCCATTATGG 

401 TAGATACCGC CTTGTCGCAA CAGGTTGAAA TCCTGCGCGG GCCGGTTACG 

451 CTCTTGTACA GCTCGGgcaa tgtggccgGG GCTGGtcaat gttgccgatg 

501 gAAAAAtccc ccaaaaAAtg cc. . 

This corresponds to the amino acid sequence <SEQ ID 584; ORF 147.ng>: 

gl47.pep (partial) 

1 . -MRREAKMAQI TLKPIVLSIL LINTPLLAQA HETEQSVGLE TVSWGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDTALSQ QVEILRGPVT 

151 LLYSSGNVAG AGQCCRWKNP PKNA. . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 585>: 

ml47.seq (partial) 

1 . . CCGCATAAAA CTGAGCAATC GGTGGATTTG GAAACGGTCA GCGTCGTCGG 

51 CAAAAGCCGT CCGCGCGCCA CGTCGGGGCT GTTGCACACT TCGACCGCCT 

101 CCGACAAAAT CATCTCCGGC GATACCTTGC GCCAAAAAGC CGTCAACTTG 

151 GGCGACGCTT TAGACGGCGT ACCGGGCATC CACGCTTCGC AATACGGCGG 

201 CGGCGCGTCT GCTCCCGTCA TTCGCGGTCA AACAGGCAGG CGGATTAAAG 

251 TGTT GAACCA TCACGGCGAA ACAGGCGATA TGGCGGATTT TTCGCCCGAT 

301 CACGCCATTA TGGTAGATAC CGCCTTGTCG CAACAGGTCG AAATCCTGCG 

351 CGGGCCGGTT ACGCTCTTGT ACAGCTCGGG CAATGTGGCG GGGCTGGTCG 

401 ATGTTGCCGA TGGCAAAATC CCCGAAAAAA TGCCTGAAAA CGGCGTATCG 

451 GGCGAACTCG GATTGCGTTT GAGCAGCGGC AATCTGGAAA AACTCACGTC 

501 CGGCGGCATC AATATCGGTT TGGGCAAAAA CTTTGTATTG CACACGGAAG 

551 GGCTGTACCG CAAATCGGGG GATTACGCCG TACCGCGTTA CCGCAATCTG 

601 AAACGCCTGC CCGACAGCCA CGCCGATTCG CAAACGGGCA GCATCGGGCT 

651 GTCTTGGGTT GGCGAAAAAG GTTTTATCGG CGTAGCGTAC AGCGACCGTC 

701 GCGACCAATA TGGTCTGCCT GCCCACAGCC ACGAATACGA TGATTGCCAC 

751 GCCGACATCA TCTGGCAAAA GAGCTTGATT AACAAACGCT ATT T ACAGCT 

801 TTATCCGCAC CTGTTGACCG AAGAAGACAT CGATTACGAC AATCCGGGCT 

851 TGAGCTGCGG CTTCCACGAC GACGATAATG CACACGCACA CACCCACAGC 

901 GGCAGACCGT GGATAGACCT GCGCAACAAA CGCTACGAAC TCCGTGCCGA 

951 ATGGAAGCAA CCGTTCCCCG GTTTTGAAGC CCTGCGCGTA CACCTGAACC 

1001 GCAACGACTA CCGCCACGAC GAAAAAGCAG GCGATGCAGT CGAAAACTTT 

1051 TTTAACAACC AAACGCAAAA CGCCCGCATC GAGTTGCGCC ACCAACCCAT 
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1101 AGGTCGTCTG AAAGGCAGCT GGGGCGTGCA ATATTTACAA CAAAAATCCA 

1151 GTGCTTTATC TGCCATATCC GAAGCGGTTA AACAACCGAT GCTGCTTGAC 

1201 AACAAAGTGC AACATTACAG CTTTTTCGGT GTAGAACAGG CAAACTGGGA 

1251 CAACTTCACG CTTGAAGGAG GCGTACGCGT GGAAAAACAA AAAGCCTCCA 

1301 TTCAGTACGA CAAAGCATTG ATTGATCGGG AAAACTACTA CAACCACCCC 

1351 CTGCCCGACC TCGGCGCGCA CCGCCAAACC GCCCGCTCAT TCGCACTTTC 

1401 GGGCAACTGG TATTTCACGC CACAACACAA ACTCAGCCTG ACCGCCTCCC 

1451 ATCAGGAACG CCTGCCGTCA ACGCAAGAGC TGTACGCACA CGGCAAACAC 

1501 GTCGCCACCA ACACCTTTGA AGTCGGCAAC AAACACCTCA ACAAAGAGCG 

1551 TTCCAACAAT ATCGAACTCG CGCTGGGCTA CGAAGGCGAC CGCTGGCAAT 

1601 ACAATCTGGC ACTCTACCGC AACCGCTTCG GTAACTACA? TTACGCCCAA 

1651 ACCTTAAACG ACGGACGCGG CCCCAAATCC ATCGAAGACG ACAGCGAAAT 

1701 GAAGCTCGTG CGCTACAACC AATCCGGCGC CGACTTCTAC GGCGCGGAAG 

1751 GCGAAATCTA CTTCAAACCG ACACCGCGCT ACCGCATCGG CGTTTCCGGC 

1801 GACTATGTAC GAGGCCGTCT GAAAAACCTG CCTTCCCTAC CCGGCAGAGA 

1851 AGATGCCTAC GGCAACCGTC CTTTCATCGC ACAGGACGAC CAAAATGCCC 

1901 CCCGTGTTCC GGCTGCGCGC CTCGGCTTCC ACCTGAAAGC CTCGCTGACC 

1951 GACCGTATCG ATGCCAATTT GGACTACTAC CGCGTGTTCG CCCAAAACAA 

2001 ACTCGCCCGC TACGAAACGC GCACGCCCGG ACACCATATG CTCAACCTCG 

2051 GCGCAAACTA CCGCCGCAAT ACGCGCTATG GCGAGTGGAA TTGGTACGTC 

2101 AAAGCCGACA ACCTGCTCAA CCAATCCGTT TACGCCCACA GCAGCTTTCT 

2151 CTCTGATACG CCGCAAATGG GCCGCAGCTT TACCGGCGGC GTGAACGTGA 

2201 AGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 586; ORF 147>: 

ml 4 7. pep (partial) 

1 . . PHKTEQSVDL ETVSVVGKSR PRATSGLLHT STASDKIISG DTLRQKAVNL 

51 GDALDGVPGI HASQYGGGAS APVIRGQTGR RIKVLNHHGE TGDMADFSPD 

101 HAIMVDTALS QQVEILRGPV TLLYSSGNVA GLVDVADGKI PEKMPENGVS 

151 GELGLRLSSG NLEKLTSGGI NIGLGKNFVL HTEGLYRKSG DYAVPRYRNL 

201 KRLPDSHADS QTGSIGLSWV GEKGFIGVAY SDRRDQYGLP AHSHEYDDCH 

251 ADIIWQKSLI NKRYLQLYPH LLTEEDIDYD NPGLSCGFHD DDNAHAHTHS 

301 GRPWIDLRNK RYELRAEWKQ PFPGFEALRV HLNRNDYRHD EKAGDAVENF 

351 FNNQTQNARI ELRHQPIGRL KGSWGVQYLQ QKSSALSAIS EAVKQPMLLD 

401 NKVQHYSFFG YEQANWDNHT LEGGVRVEKQ KASIQYDKAL 1URENYYNHP 

4 51 LPDLGAHRQT ARSFALSGNW YFTPQHKLSL TASHQERLPS TQELYAHGKH 

501 VATNTFEVGN KHLNKERSNN IELALGYEGD RWQYNLALYR NRFGNYIYAQ 

551 TLNDGRGPKS 3EDDSEMKLV RYNQSGADFY GAEGEIYFKP TPRYRIGVSG 

601 DYVRGRLKNL PSLPGREDAY GNRPFIAQDD QNAPRVPAAR LGFHLKASLT 

651 DRIDANLDYY RVFAQNKLAR YETRTPGHHM LNLGANYRRN TRYGEWNWYV 

701 KADNLLNQSV YAHSSFLSDT PQMGRSFTGG VNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from Af. gonorrhoeae 

ml47 / gl47 92.3% identity in 142 aa overlap 

10 20 30 

ml4 7 . pep PHKTEQSVDLETVSWGKSRPRATSGLLHTS 

I: I I I I I I I I I I II II I I I I I I I ! M I I I 
914 7 MRREAKMAQITLKPIVLS ILL INT PLLAQAHETEQSVGLETVSWGKSR PRATSGLLHT 5 

10 20 30 40 50 60 

40 50 60 70 80 90 

ml47 . pep TASDKIISGDTLRQKAVNLGDALDGVPGIR7VSQYGGGASAPVIRGQTGRRIKVLNHHGET 
MIIIIIIIIIIIII MUM IHI Mi! IN Ml III III II III Mill mm II! 
gl47 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
70 80 90 100 110 120 

100 110 120 130 140 150 

ml47 . pep GDMADFS PDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVOVADGKIPEKMPENGVSG 

I I I ' I M M I M I II I M I I I I I M 1 M I II M I I M M I : ill 
gl47 GDMADFS PDHAIMVDTALSQQVE ILRGPVTLLY SSGNVAGAGQCCRWKN PPKNA 
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The following partial DNA sequence was identified in K meningitidis <SEQ ID 587>: 

al47 . seq 

1 ATGCGACGAG AAGCCAAAAT GGCACAAACT ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CTCCCAAGCG CATGGAACTG 

101 AGCAATCAGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCCACTT CGGGGCTGCT GCACACTTCT ACCGCCTCCG ACAAAATCAT 

201 CAGCGGCGAC ACCTTGCGAC AAAAAGCCGT CAACTTGGGT GATGCTTTAG 

251 ACGGCGTACC GGGCATTCAT GCCTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC AGGCAGACGG ATTAAAGTGT TGAACCATCA 

251 CGGCGAAACG GGCGACATGG CGGACTTCTC TCCAGACCAT GCAATCATGG 

401 TGGACAGCGC CTTGTCGCAA CAGGTCGAAA TCCTGCGCGG TCCGGTTACG 

451 CTCTTGTACA GCTCGGGCAA TGTGGCGGGG CTGGTCGATG TTGCCGATGG 

501 CAAAATCCCC GAAAAAATGC CTGAAAACGG CGTATCGGGC GAACTCGGAT 

551 TGCGTTTGAG CAGCGGCAAT CTGGAAAAAC TCACGTCCGG CGGCATCAAT 

601 ATCGGTTTGG GCAAAAACTT TGTATTGCAC ACGGAAGGGC TGTACCGCAA 

651 ATCGGGGGAT TACGCCGTAC CGCGTTACCG CAATCTGAAA CGCCTGCCCG 

701 ACAGCCACGC CGATTCGCAA ACGGGCAGCA TCGGGCTGTC TTGGGTTGGC 

751 GAAAAAGGCT TTATCGGCGC AGCATACAGC GACCGTCGCG ACCAATATGG 

801 TCTGCCTGCC CACAGCCACG AATACGATGA TTGCCACGCC GACATCATCT 

851 GGCAAAAGAG TTTGATTAAC AAACGCTATT TGCAGCTTTA TCCGCACCTG 

901 TTGACCGAAG AAGACATCGA TTACGACAAT CCGGGCTTGA GCTGCGGCTT 

951 TCACGACGAC GATGATGCAC ACGCCCATGC CCACAACGGC AAACCTTGGA 

1001 TAGACCTGCG CAACAAACGC TACGAACTCC GCGCCGAATG GAAGCAACCG 

1051 TTCCCCGGTT TTGAAGCCCT GCGCGTACAC CTGAACCGCA ACGACTACCG 

1101 CCACGACGAA AAAGCAGGCG ATGCAGTAGA AAACTTTTTT AACAACCAAA 

1151 CGCAAAACGC CCGTATCGAG TTGCGCCACC AACCCATAGG CCGTCTGAAA 

1201 GGCAGCTGGG GCGTGCAATA TTTGGGACAA AAATCCAGTG CTTTATCTGC 

1251 CACATCCGAA GCGGTCAAAC AACCGATGCT GCTTGACAAT AAAGTGCAAC 

1301 ATTACAGCTT TTTCGGTGTA GAACAGGCAA ACTGGGACAA CTTCACGCTT 

1351 GAAGGCGGCG TACGCGTGGA AAAACAAAAA GCCTCCATCC GCTACGACAA 

1401 AGCATTGATT GATCGGGAAA ACTACTACAA CCATCCCCTG CCCGACCTCG 

1451 GCGCGCACCG CCAAACCGCC CGCTCATTCG CACTTTCGGG CAACTGGTAT 

1501 TTCACGCCAC AACACAAACT CAGCCTGACC GCCTCCCATC AGGAACGCCT 

1551 GCCGTCAACG CAAGAGCTGT ACGCACACGG CAAACACGTC GCCACCAACA 

1601 CCTTTGAAGT CGGCAACAAA CACCTCAACA AAGAGCGTTC CAACAATATC 

1651 GAACTCGCGC TGGGCTACGA AGGCGACCGC TGGCAATACA ATCTGGCACT 

1701 CTACCGCAAC CGCTTCGGCA ACT AC ATT T A CGCCCAAACC TTAAACGACG 

1751 GACGCGGCCC CAAATCCATC GAAGACGACA GCGAAATGAA GCTCGTGCGC 

1801 TACAACCAAT CCGGTGCGGA CTTCTACGGC GCGGAAGGCG AAATCTACTT 

1851 CAAACCGACA CCGCGCTACC GCATCGGCGT TTCCGGCGAC TATGTACGAG 

1901 GCCGTCTGAA AAACCTGCCT TCCCTACCCG GCAGGGAAGA CGCCTACGGC 

1951 AACCGCCCAC TCATTGCCCA AGCCGACCAA AACGCCCCTC GCGTTCCGGC 

2001 TGCGCGCCTC GGCGTCCACC TGAAAGCCTC GCTGACCGAC CGCATCGATG 

2051 CCAATTTGGA CTACTACCGC GTGTTCGCCC AAAACAAACT CGCCCGCTAC 

2101 GAAACGCGCA CGCCCGGACA CCATATGCTC AACCTCGGCG CAAACTACCG 

2151 CCGCAATACG CGCTATGGCG AGTGGAATTG GTACGTCAAA GCCGACAACC 

2201 TGCTCAACCA ATCCGTTTAC GCCCACAGCA GCTTCCTCTC TGATACGCCG 

2251 CAAATGGGCC GCAGCTTTAC CGGCGGCGTG AACGTGAAGT TTTAA 

This corresponds to the amino acid sequence <SEQ ID 588; ORF 147.a>: 

a!47 . pep 

1 MRREAKMAQT TLKPIVLSIL LINTPLLSQA HG TEQSVGLE TVSVVGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDSALSQ QVEILRGPVT 

151 LLYSSGNVAG LVDVADGKIP EKMPENGVSG ELGLRLSSGN LEKLTSGGIN 

201 IGLGKNFVLH TEGLYRKSGD YAVPRYRNLK RLPDSHADSQ TGSIGLSWVG 

251 EKGFIGAAYS DRRDQYGLPA HSHEYDDCHA DIIWQKSLIN KRYLQLYPHL 

301 LTEEDIDYDN PGLSCGFHDD DDAHAHAHNG KPWIDLRNKR YELRAEWKQP 

351 FPGFEALRVH LNRNDYRHDE KAGDAVENFF NNQTQNARIE IjRHQPIGRLK 

401 GSWGVQYLGQ KSSALSATSE AVKQPMLLDN KVQHYSFFGV EQANWDNFTL 

451 F.GGVRVEKQK ASTRYDKALI DRENYYNHPL PDLGAHRQTA RSFALSGNWY 

501 FTPQHKLSLT ASHQERLPST QELYAHGKHV ATNTFEVGNK HLNKERSNNI 

551 ELALGYEGDR WQYNLALYRN RFGNYIYAQT LNDGRGPKSI EDDSEMKLVR 
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601 YNQSGADFYG AEGEIYFKPT PRYRIGVSGD YVRGRLKNLP SLPGREDAYG 

651 NRPLIAQADQ NAPRVPAARL GVHLKASLTD RIDANLDYYR VFAQNKLARY 

701 ETRTPGHHML NLGANYRRNT RYGEWNWYVK ADNLLNQSVY AHSSFLSDTP 

751 QMGRSFTGGV NVKF* 

ml47/al47 98.1% identity in 734 aa overlap 

10 20 30 

ml 4 7 . pep PHKTEQSV DLE TVS WGKSR PRAT SGLLHTS 

I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
al47 MRREAKMAQTTLKPTVLSILLINTPLLSQAHGTEQSVGLF.TVSWGKSRPRATSGLLHTS 
10 20 30 40 50 60 

40 50 60 70 80 90 

ml47 . pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
I I I I I I I I II I I I I I I I I fl I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I (I I 
al4 7 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
70 80 90 100 110 120 

100 110 120 130 140 150 

ml4 7 . pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
ll!lll!lll!llll:ltl till llllllll MllllilMI Ml lllll! Ilillilll 

al47 GDMADFSPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
130 140 150 160 170 180 

160 170 180 190 200 210 

ml4 7 . pep ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 
I I I I I I I i I I I I I II I I I I II I I I I 1 I I II II 1 I I II i I I I I I M I I I I I I I I i I I ! I I I 
al4 7 ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 
190 200 210 220 230 240 

220 230 240 250 260 270 

ml 4 7. pep TGSIGLSWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 
I I I I I II I II I I I I I I : I M II I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I II I I I I 
al47 TGSIGLSWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 
250 260 270 280 290 300 

280 290 300 310 320 330 

ml4 7 . pep LTEEDIDYUNPGLSCGB^HUDDNAHAHTHSGRPWIULRNKRYELRAEWKQPB'PGFEALRVH 
I I I I I If I I I I I I 1 1 I I I I I I : I 1 1 I : I : I: I 1 1 I I I I I I I I 1 1 I 1 1 I 1 1 1 1 1 I I I M I I 
al47 LTEEDIDYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVH 
310 320 330 340 350 360 

340 350 360 370 380 390 

ml4 7 . pep LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISE 
I I I I I II I I I I I I I I I I I I I I I I I I 1 I I I I M II I I I I I I I I I I I I II I I I I I I I I II 
al47 LNRNDYRHDEKAGDAVENFTNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSE 
370 380 390 400 410 420 

400 410 420 430 440 450 

ml47.pep AVKQPMLLDN KVQH YS FFGVEQANWDN FT LEGGVRVEKQKAS I QYDKAL I DREN Y YNHP L 

I I II I I i I I I I I I! M I M I I I I I I I I I II I I I I I I I I I I I I i : I I I I I I I I I I I I I 1 I I 
al4 7 AVKQPMLLDN KVQH YS FFGVEQANWDN FT LEGGVRVEKQKAS I RYDKAL I DREN YYN HP L 

430 440 450 460 470 480 

460 470 480 490 500 510 

ml4 7 . pep PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 

llllll IIIIIMII III llllilllill IMIMIIIMIIIIIIIIIII Ilillilll 
al4 7 PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 
490 500 510 520 530 540 

520 530 540 550 560 570 

ml47.pep HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVR 
I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
al4 7 HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVR 
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550 560 570 580 590 600 

580 590 600 610 620 630 

YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQ 
I I I I I I I I I I I I ! I I I I I I I I i I M I I I I I I I I I I I I I II I I I I I I I I I I I I I : I I I II 
YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQ 
610 620 630 640 650 660 

640 650 660 670 680 690 

NAPRVPAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRT PGHHMLNLGANYRRNT 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I II I I I I I I I I I I 
NAPRVPAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNT 
670 680 690 700 710 720 

700 710 720 730 

RYGEWNWYVKADNLLNQS V YAH SS FLSDT PQMGRS FTGGVNVKFX 
II II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
R YGEWNW YVKADNLLNQS VYAHS S FLSDT PQMGRS FTGGVNVKFX 
730 740 750 760 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 589>: 

gl48 . seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGctgg ttcaTCCCGA 

51 AgctATgagt gtcggcgCGC TTGccgAcaa AATCCGCAAA AtcgaAAact 

101 gGCCGCAAAA AGgcaTCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGT 

151 GCGGAATACT TCCGCCTTTT GGTCGATTTG CTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGCTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAaCGtcg gctTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTAcg cgcTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGa tgccgTCAAA CCCGGTTCGC 

401 GCGTCCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC AATGCTTGCC 

4 51 GGGCTGGAAC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAgccgccgC 

501 CAT TT TGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGCGCAAGTG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGCAT GAAAGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 590; ORF 148.ng>: 

g!48.pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK PGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 591 >: 

ml 4 8 .seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGCGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTTCAAAGC 

151 GCGGAATACT TCCGCCTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGATTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAAC TGATCCGCAA ACTCGGCGGA GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 592; ORF 148>: 

ml 4 8 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 



ml 47 .pep 
al47 

ml 47 .pep 
al47 

ml 4 7 .pep 
al47 
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101 KGKLPFETVS QS YALEYGEA AVEIHTDAVK LGSRVLLVDD LIATGGTMLA 
151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAFLFTL LQNEGCMKG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted QRF from N. gonorrhoeae 

ml48 / gl48 99.0% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 . pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 
I ! I I I I II I I ! I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I f 
gl48 MALKTSNLEHAMLVllPEAMSVGALADKIRKIENWPCKGILrHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 8 . pep LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II II I I I II I I I I I I I I II I I II I I I I I I 
gl48 LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml48 . pep AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAI LEFTDLQGGKNI 

I I I I I I I I I I I I I I II II I I : I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
gl48 AVE I HTDAVKPGSRVLLVDDLVATGGTMLAGLE LI RKLGGE I VEAAAI LEFTDLQGGKNI 

130 140 150 160 170 180 

190 . 200 

ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
I I 1 I 1 I I I I II I I I I 1 I I I I 
g 1 4 8 RASGAPLFTLLQNEGCMKGX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 593>: 

al48 . seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGTGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGC 

151 GCGGAATACT TCCGACTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

401 GCGTGCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC GATGCTTGCC 

451 GGACTGGAGC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 594; ORF 148.a>: 

al48 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

ml48/a!48 99.5% identity in 199 aa overlap 

10 20 30 40 50 60 

m!4 8 . pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 
I I 1 1 I I I I I I I 1 1 I I I I I I 1 1 I 1 1 I I 1 1 I I I I I I I I I II I I 1 1 I 1 1 I I I I I I 1 1 I I I M I 
al48 MALKTSNLEIHAMLVHPFAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 
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LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
I i M I I I I I I I I I I I I I I II I M i j | | | H t | | II I III I I I I I I I I I! I I f I I I I M | I 
LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
70 80 90 1.00 110 120 

130 140 150 160 170 180 

AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 
1 M I II I I I ! I M I I i I I I I I : II I M I I I II I | f | I I | M I M | | | | | M f | | | | | | | [ 
AVEI HTDAVKLGSRVLLVDDLVATGGTMLAGLEL I RKLGGEI VEAAAI LEFT DLQGGKN I 

130 140 150 160 170 180 

190 200 
RAS G A P L FT LLQN E GCMKGX 
I I I I I I I I I I I I I I I I I I II 
RASGAPLFTLLQNEGCMKGX 

190 200 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 595>: 
gl49.seq 

1 ATGTTGATTG ACAACAATGT CCGCCATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAATTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGGTAC GACAAAGCAT TGATTGATCG AGAAAACTAC 

151 TACAACCAGC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 GTTCGCACTT TCGGGCAACT GGTATTTCAC GCCACACCAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAa cgCCTGCCGT CAACGCaagA actGtACgca 

301 cacggcAAGC ACGtcgccac CAACACCTTT GAagtcggca acaaACACCT 

351 CAACAAAGaG CgttccaacA atatcgaACT CGCGCTGGgc tAcaaaggcg 

401 accGCTGGCA ATACAATCTG GCAGCCTACC GCAACCGAtT CGGCAACTAC 

451 ATTTACGCCC AAACCTTAaa cgacggacgC GGCCCCAAAT CCATCgaaga 

501 cgacagcgaA ATGaagcTCG TGCGCTACAA CCAATCCGGT GCCGACTTCT 

551 ACGgcgcggA aggcgaaatc tACTTcaaaC CGAcACCGCG CTACCGCATC 

601 GGTGTTTCCG GCGACTatgt acgaggccgT CTGAAAAACC TGCCGTCCCT 

651 ACCCGGCAGG gaagatCCCT AcggcAAACG TCccttcaTC GCACAAGCCG 

701 ACCAAAACGC CCCCCGGATT ccggctGCGC GCCTCGGCTT CCACCTGAAA 

751 ACCTCGCTAA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGTACGCCC GGACACCATA 

851 TGCTCAACCT CGGTGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCtc aACcaatCcg tTTACGCCCa 

951 cAGCAGCTTC CTCTCTGATA CGCCGCAAAt gGGCCGCAGC TTtgccgGCg 

1001 gcgtaAACGT GaAGTTttaA 

This corresponds to the amino acid sequence <SEQ ID 596; ORF 149jig>: 

gl49.pep w 

1 MLIDNNVRHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNQPLPDLGA HRQTARSFAL SGNWYFTPHH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YKGDRWQYNL AAYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDPYGKRPFI AQADQNAPRI PAARLGFHLK 

251 TSLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FAGGVNVKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 597>: 

ml49.seq 

1 ATGCTGCTTG ACAACAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG AGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATTCAGTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACCACC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

3 51 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

4 01 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGTAACTAC 



ml4 8 .pep 
al48 

ml 4 6 . pep 
al46 

ml4 8.pep 
al48 
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451 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGC GCCGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGA GAAGATGCCT ACGGCAACCG TCCTTTCATC GCACAGGACG 

701 ACCAAAATGC CCCCCGTGTT CCGGCTGCGC GCCTCGGCTT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

951 CAGCAGCTTT CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 598; ORF 149>: 

ml49.pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIQY DKALIDRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATOTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDAYGNRPFI AQDDQNAPRV PAARLGFHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N ^gonorrhoeae 

ORF 149 shows 95.9% identity over a 339 aa overlap with a predicted ORF (ORF 149.ng) 
from K gonorrhoeae: 

ml49/gl49 

10 20 30 40 50 60 

MLLDN KVQH Y S FFGVEQANWDN FTLEGG VRVE KQKAS I Q YD KAL I DREN Y YNHP LPDLGA 

IhllMMM Mill Mill Ml III MUM MIMMI II II I MM II Ml I Mil 
MLIDNNVRHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNQPLPDLGA 
10 20 30 40 50 60 



ml49 .pep 
gl49 



70 80 90 100 110 120 

ml4 9 . pep HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 
II I II II I II I II llllhl I II I III III III II Mil I I II I II III I II I II II III 
gl4 9 HRQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml49 . pep RSNNI ELAIiGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKS I EDDSEMKLVRYNQSG 

• II I II M M II M Mi III II 1 1 1 1 1 1 1 1 1 M 1 1 r ! I ! 1 1 1 1 M M 1 1 1 i 1 1 1 1 f ! M E 

g!4 9 RSNNI ELALGYKGDRWQYNLAAYRNRFGNYI YAQTLNDGRGPKS I EDDSEMKLVRYNQSG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml4 9 . pep • ADFYGAEGE I YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREEAYGNRP FI AQDDQNAPRV 

1 1 1 1 i 1 1 1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 M 1 1 1 1 1 1 1 ] 1 1 1 1 i 1 1 1 1 1 Ihllllll Mlllh 

gl49 AD FYGAEGE I Y FKPT PRYR IGVSGD YVRGRIiKNL P SLPGRED PYGKRP FI AQADQNAP RI 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml4 9 . pep PAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHML 

MM IMMh II Ml III MM IIMMI Ml IIIIIIM IMM MM II I MIIMI 

gl4 9 P AARLG FHLKTSLTDR IDANLD Y YRV FAQNKLAR YETRT PGHHMtiNLGANYRRNTRYGEW 

250 260 270 280 290 300 



310 320 330 340 

ml4 9 . pep NWYVKADNLLNQSVYAHSS FLSDTPQMGRSFTGGVNVKFX 

II I IMMIMM III MMMM II I II I hi IIMII I 
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gl4 9 NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFAGGVNVKFX 
310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 599>: 
al49.seq 

1 ATGCTGCTTG ACAATAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGCTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACCATC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

351 CAACAAAGAG CGTTCCAACA AT AT CGAACT CGCGCTGGGC TACGAAGGCG 

4 01 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGCAACTAC 

4 51 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CC AT CGAAG A 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGT GCGGACTTC? 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGG GAAGACGCCT ACGGCAACCG CCCACTCATT GCCCAAGCCG 

701 ACCAAAACGC CCCTCGCGTT CCGGCTGCGC GCCTCGGCGT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGCAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

951 CAGCAGCTTC CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 600; ORF 149.a>: 

al 4 9 . pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR 3DAYGNRPLI AQADONAPRV PAARLGVHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

m!49/al49 98.8% identity in 339 aa overlap 

10 20 30 40 50 60 

ml 4 9 . pep MLL DNKVQHY S FFGVEQANW DN FTLEGGVRVEKQKAS I QYDKALI DREN YYNH PLPDLGA 

M I I 1 I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I : I I I ! M I I II I 1 I I I i I I I I I 
al4 9 MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPLPDLGA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 9 . pep HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNECHLNECE 
M I I I I I I 1 1 I I I I I I I I I I I 1 1 I I ! I I II I 1 1 I 1 1 I I I I I I I I M I Ml I I I I I I I 1 1 I 
al 4 9 HRQT ARSF^ALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml49.pep RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 
I I I I I I I I I I M I I I 1 1 I M I I 1 1 I I I I I I M 1 1 I 1 1 I I I I I I I I 1 1 I 1 1 I I M I I I I M 
al 4 9 RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 9 . pep A D F YG AE GE I Y FKPT PR Y R I G V S G D YVRG RLKN L P S L PGRE DA YGNR P F I AQD DQNAP R V 

I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I 
al49 ADFYG AEGE I Y FK PT PR YR I GV S GD YVRGRLKN L P S L PGRE DA YG NR P L I AQ A DQN A PR V 

190 200 210 220 230 240 



ml 4 9 . pep 



250 260 270 280 290 300 

PAARLGFHLKASLTDRID AN LDYYRVFAQNKLARYETRTPGHHMLNLGAN YRRNTRYGEW 
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linn 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 ii 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 ii 1 1 1 

al4 9 PAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 
250 260 270 280 290 300 

310 320 330 340 

ml 4 9 . pep NW YVKADNLLNQSVYAHS SFLSDT PQMGRS FTGG VN VKFX 

I I I I I I I I I ! I I I I I I I I II I II ! I I I I I i I I i I I I I I I I 
a 1 4 9 NW YVKADNLLNQ S V YAHS S FL S DT PQMGRS FTGG VN VKFX 

. 310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 601>: 

gl49-l.seq 

1 ATGGCACAAA TCACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCGAC TTCGGGGCTG 

151 CTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACTTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT GGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACGGGCAGAC GGATTAAAGT ATTGAACCAT CACGGCGAAA CGGGCGATAT 

351 GGCGGACTTT TCTCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTTGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGAAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGaagccgG ATTGCGTTTG AGCAGCGGCA 

551 ATTTAGAAAA ACTGACATCC GCAGGCATCA ATATCGGACT GGGCAAAAAC 

601 TTCGTGCTGC ATACCGAAGG CTTGTACCGC AAATCGGGCG ATTACGCCGT 

651 AC C GC GT T AC CGCAATCTGA AACGCCTGCC CGACAGCCAT GCCGATTCGC 

7 01 AAACGGGCAG CATCGGGCTG TCTTGGGTGG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCGCTAC GGCCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATCA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACT TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG GCGACGGTGC 

951 ACACGCACAC ACCCACAACG GCAAACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAGC CATTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ATCTGAACCG CAATGACTAC CACCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTCT TCAACAACAA AACACACAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG CGCGCTTTCC GCCATTCCCG AAACCGTCCA 

1251 ACAACCGATG TTGATTGACA ACAATGTCCG CCATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAATTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGGTACGAC AAAGCATTGA TTGATCGAGA 

1401 AAACTACTAC AACCAGCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCGTT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACACCACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAACT 

1551 GTACGCACAC GGCAAGCACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA GCCTACCGCA ACCGATTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGT GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CGTCCCTACC CGGCAGGGAA GATCCCTACG GCAAACGTCC CTTCATCGCA 

1951 CAAGCCGACC AAAACGCCCC CCGCATTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAACC TCGCTAACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG TACGCCCGGA 

2101 CACCATATGC TCAACCTCGG TGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TAAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 602; ORF 149-1 .ng>: 

gl49-l.pep 

1 MAQITLKPIV LSILLINTPL I.AQA HETEQS VGLETVSWG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGEAGLRL SSGNLEKLTS AGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDRY GLPAHSHEYD DC HAD I IWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDGDGAHAH THNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY HHDEKAGDAV ENFFNNKTHN ARIELRHQPI GRLKGSWGVQ 

401 YLGQKSSALS AIPETVQQPM LIDNNVRHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIRYD KALIDRENYY NQPLPDLGAH RQTARSFALS GNWYFTPHHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA AYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DPYGKRPFIA 

651 QADQNAPRIP AARLGFHLKT SLTDRI DANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 603>: 

Bl49-l.oeq 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGATTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC GTCGGGGCTG 

151 TTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACCTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT AGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCGTCTG CTCCCGTCAT TCGCGGTCAA 

301 ACAGGCAGGC GGATTAAAGT GTTGAACCAT CACGGCGAAA CAGGCGATAT 

351 GGCGGATTTT TCGCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTCGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG TTTTATCGGC 

751 GTAGCGTACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGCTTGATTA 

851 ACAAACGCTA TTTACAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

90 : GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG ACGATAATGC 

951 ACACGCACAC ACCCACAGCG GCAGACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGTGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTC GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGCATCG 

1151 AGTTGCGCCA CCAACCCATA GGTCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTACAAC AAAAATCCAG TGCTTTATCT GCCATATCCG AAGCGGTTAA 

1251 ACAACCGATG CTGCTTGACA ACAAAGTGCA ACATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGAGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT TCAGTACGAC AAAGCATTGA TTGATCGGGA 

1401 AAACTACTAC AACCACCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 TAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGCGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGAGAA GATGCCTACG GCAACCGTCC TTTCATCGCA 

1951 CAGGACGACC AAAATGCCCC CCGTGTTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTTCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 604; ORF 149-1 >: 

ml 4 9-1. pop 

1 MAQTTLKPIV LSILLINTPL LAQA HETEQS VDLETVSWG KSRPRATSGL 

51 LHTSTASDKI I SGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVI RGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVE1LR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 VAYSDRRDQY GL PAH SHE YD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDNAHAH THSGRPWIDL RNKRYELRAE WKQPFPGFEA 

351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

401 YLQQKSSALS AISEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIQYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 
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551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPFIA 

651 QDDQNAPRVP AARLGFHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL 5DTPQMGRSF 

751 TGGVKVKF* 

m!49-l/gl49-l 96.2% identity in 758 aa overlap 

10 20 30 40 50 60 

ml 4 9-1 . pep MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 
III I I I I I I I I II I I I I I I ! I I I I I II I I I I I I I I I I I I I I ! I I I I 1 t I I I I I I I I I ! 
gl49-l MAQI TLKPIVLSILLINT PLLAQAHETEQS VGLETVS WGKS RP RATS GLLH T STAS DK I 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 9-1 . pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I E I I I I I I I I I I I I I I I I I I I! I I I II I 
g 1 4 9-1 I SGDTLRQKAVNLGDALDGVPGI HASQYGGGASAPVIRGQTGRRIKVXNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 9-1 . pep S PDHAIMVDTALSQQVEI LRGPVTLLYSSGNVAGLVDVADGKI PEKMPENGVSGELGLRL 

1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 n 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 

gl49-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGEAGLRL 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 9-1 . pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I I I I II I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 
gl4 9-1 SSGNLEKLTSAGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml49-l . pep S WVGEKG FI G VA Y SDRRDQYG L P AHS HE Y DDCH ADI IWQKSLINKRYLQLYPHLLTEEDI 
I I I I I I I I I I : I I I I I I I : I I I : I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
gl49-l SWVGEKGFIGAAYSDRRDRYGLPAHSHEYDDCHADI IWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 9-1 . pep DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

I I I I I I I I I I I I I I : I I I I I I : I : I I I II I I I I 1 1 I I I I I I I I I \ I I I I I I I I I I I I I I 
g 14 9-1 DYDNPGLSCGFHDGDGAHAHTHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml49-l . pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 
:! II I I I I I I I I I I I I :l : M I I 1 I I I I I I I I I 1 I I 1 I I I I I lllllllll Chill 
gl49-l HHDEKAGDAVENFFNNKTHNARIELRHQPIGRLKGSWGVQYLGQKSSALSAIPETVQQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml49-l.pep LLDNKVOHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGAH 
I: 11 : I M I I I I I I I I I I II I I I I I I I I I I I I I I II I: I I I I I I I I I I I I I :i I I I I I I I 
gl4 9-1 L I DNNVRH Y S FFG VEQANWDN FT LEGGVRVEKQKAS I RYDKAL I DREN Y YNQP L P DLG AH 

430 440 450 460 470 480 

490 500 510 520 530 540 

m!4 9-1 . pep RQTARS F Al S GNW YFT PQHKL S LT ASH QERLP S TQEL YAHG KH V ATN T FE VGN KHLNKER 

II I I I I t I I I I I I I I I I: I I I I I 1 I I I | I I I I I I | | | | | I | || I | | | 1 | | | | | | | | I || I 
g 14 9-1 RQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGWKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 4 9-1 . pep SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
I I I I 1 1 I I I I I t I I t I I I 1 1 I I I I M I I I I I I M I I I I M I I I f M I I I I I I I I I I I t 
g 14 9-1 SNNIELALGYEGDRWQYNLAAYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml4 9-1 . pep DFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRVP 
I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I ihllllll 11111:1 
gl49-l DFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRIP 

610 520 630 640 650 660 
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670 680 690 700 710 720 

ml49-l .pep AARLGFHLKASLTDRIDRNLDYYRVFAQNKXARYETRTPGHHMLNLGANYRRNTRyGEWN 
I i I I I I I I I :| I I I I I I I I I I II I I I I I I (I I I I I I I I I I I II i | | | | | | | || | | | | || | 
gl 4 9 - 1 AARLGFHLKTSLTDRI DANLD Y YRVFAQNK LARYET RT PGHHMLNLGANYRRNTRYGEWN 

670 680 690 700 710 720 

730 740 750 759 

ml 4 9-1 . pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

III! Illlllllllllllllllilimillllllllll 

g 1 4 9- 1 W YVKADNLLNQSV YAHSS FLS DTPQMGRSFTGGVNVKFX 

730 740 750 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 605>: 

a!49-1.8oq 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCTCCCAAG CGCATGGAAC TGAGCAATCA GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC TTCGGGGCTG 

151 CTGCACACTT CTACCGCCTC CGACAAAATC ATCAGCGGCG ACACCTTGCG 

201 ACAAAAAGCC GTCAACTTGG GTGATGCTTT AGACGGCGTA CCGGGCATTC 

251 ATGCCTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACAGGCAGAC GGATTAAAGT GTTGAACCAT CACGGCGAAA CGGGCGACAT 

351 GGCGGACTTC TCTCCAGACC ATGCAATCAT GGTGGACAGC GCCTTGTCGC 

401 AACAGGTCGA AATCCTGCGC GGTCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATTA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTTCACGACG ACGATGATGC 

951 ACACGCCCAT GCCCACAACG GCAAACCTTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG TGCTTTATCT GCCACATCCG AAGCGGTCAA 

1251 ACAACCGATG CTGCTTGACA ATAAAGTGCA ACATTACAGC TTTTTCGGTG 

13 Dl TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGCTACGAC AAAGCATTGA TTGATCGGGA 

1401 AAACTACTAC AACCATCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

14 51 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCG 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGGGAA GACGCCTACG GCAACCGCCC ACTCATTGCC 

1951 CAAGCCGACC AAAACGCCCC TCGCGTTCCG GCTGCGCGCC TCGGCGTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGCATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

22 51 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 606; ORF 149-l.a>: 

al 49-1 .pep 

1 MAQTTLKPIV LSILLINTPL LSQA HGTEQS VGLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPV1RGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDS ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

2 51 AAYSDRRDQY GLPAHSHEYD DCHADIIWQK SUNKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDDAHAH AHNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

401 YLGQKSSALS ATSEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIRYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIBLALGY 

551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPLIA 

651 QADQNAPRVP AARLGVHLKA SLTDRIDA^L DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

al49-l/ml49-l 98.0% identity in 758 aa overlap 

10 20 30 40 50 60 

al4 9-1 . pep MAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVSVVGKSRPRATSGLLHTSTASDKI 

ininiiiniiiii inn 1 1 1 1 1 1 1 { 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 

m!4 9-1 MAQTTLK PI VL S I LL INT PLL AQAH ETEQS VDLET VS VVGKS RPRAT S GLLHT STA S DK I 

10 20 . 30 40 50 60 

70 80 90 100 110 120 

a 14 9-1 .pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
I M I I M I I I I I I I I I I H I I I I ! I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I t I I 
ml4 9-1 I SGDTLRQKAVNLGDALDGVPGI HASQYGGGASAPVI RGQTGRRIKVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

al 4 9-1. pep SPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKNPENGVSGELGLRL 
!fHMIII:MMII II Ml IMIMIMIIIMI IMIII till INMIIINIill 
ml 4 9-1 S P DH A I MVDT AL S QQVE I LRG PVTLL YS S GN VAGL VDV A DG K I PEKM PENG VS GELG LRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

al4 9-2 .pep SSGNLEKLTSGGIKIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I I I I I II I I I I I I I I I I I I I I M I I 1 I i f I I I } I I I I I I I I I I I I I I I I I I I I I I I I I II 
ml49-l SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

al4 9-1 . pep SWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
MIIIIIIIMIIIIMIMMIIIIIimiMMIIISINIIIIIIIhl IMIUI 
ml 4 9-1 SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
250 260 270 280 290 300 

310 320 330 340 350 360 

al4 9-1 . pep D Y DN PGLSCG FHDDDDAH AHAHNGKPW I DLRNKRYEL RAEWKQP FP G FE ALRVHLN RND Y 
IMIIIIIIIIMII:|lll:|:t:|||IIIIIIIIIIIMMIIIIItllll lllllll 
ml 4 9-1 DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
310 320 330 340 350 360 

370 380 390 400 410 420 

al4 9-1 . pep RHDEJ^AGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKS SALS ATSEAVKQPM 

I I I I I I I M I I I I I I It 11 I I I I I I I ! II I I I I I I I I I 1 I I | I I I I I I I I I I I I I I I I 
ml 4 9-1 RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

al49-l . pep LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPLPDLGAH 

I I I 1 I t I I I i I I r 1 1 I I t I I I I I I I 1 1 1 1 I I 1 1 I 1 1 1 r | 1 1 1 1 | | | 1 1 | | | f | | | 1 1 1 1 | 
ml 4 9-1 LLDNKVQHYS FFGVEQANWDNFTLEGGVRVEKQKAS I QYDKALIDRENYYNHPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

a 14 9-1 . pep RQT ARS FAL SGNW Y FT PQHKL S LT AS HQERL P STQELYAHGKHVATNT FE VGNKHLNKER 

MMIIlii (Ml I I I I I I I I M II II 1 1 I I I I I I I I I I II I I I I II I 

ml 4 9-1 RQT ARS FAL S GNW Y FT PQHK L S LT ASH QERLP STQELYAHGKHVATNT FE VGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

al49-l.pep SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

ii ii ii ii ii mm 1 1 ii mi m ii mm iii ii i mi inn i ii i iii ii mi i 

ml 4 9-1 SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
550 560 570 580 590 600 



610 520 630 640 650 660 
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a 14 9-1 . pep DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQNAPRVP 
I I I I I I I I I I I I I I I I I t I I I I I I I I i I I 1 1 t I 1 I I I I I I i I I I I i I : i I I i I I I I I I I 
ml49-l DFYGAEGEIYFKPTPRYRIGVSGDYVRGR1KNLPSLPGREDAYGNRPFIAQDDQNAPRVP 
610 620 630 640 650 €60 

670 680 690 700 710 720 

a 14 9-1 .pep AARLiGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRHTRYGEWN 
lilfi I I I I I I I I I I I I I ! I I I I I I I I I I I I I I ! I 1 I I I 1 I i I I I I I I I I I I I I I I II I 
ml 4 9- 1 AARLGFHLKASLTDRIDANLDYYRVFACKKLARYETRTPGHHMLNLGANYRRKTRYGEWN 

670 580 690 700 710 720 

730 740 750 759 

al4 9-1 . pep W YVKADN LLNQS V YAH S S FLS DT PQMGRS FTGG VNVKFX 

Itll lilllllllitiii iiii iittltiiiiti iiiii 

ml4 9- 1 WYVKADN LLNQ S VY AH S S FLS DT PQMGRS FTGGVNVK FX 

730 740 750 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 607>: 

glSO.seq (partial) 

1 . . TACTGCAAGG CAGACCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT 

51 CACCGCCCGC CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA 

101 GCGGTTCGGA TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT 

151 GACAACGATC CGGCACTGGT CGGGGAAATC CTAGACCTGC TCGGCATCAA 

201 TCCGGCAACG GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG 

251 CACTGTTATC CCATTTCGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA 

301 GGCTATGCCA CGTTCGCCGA TAATGACGAA CTCGACCGTA TTGCTGCCGA 

351 CAACGCCGTT TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGGTGTGC 

401 TGCACCGCTT CCCGGCAAAA CTGACGGCGG AACAATTCGC CGGCCTGCTG 

451 CGCCCGCTTG CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGC 

501 GGGGGACGAA GTGCACCTGA CCGTCGGCGC AGTGCGTTTC GAACACGAAG 

551 GGCGCGCCAG GGCGGGCGGC GCATCGGGTT TCTTTGCCGA CCGGCTGGAA 

601 GAGGACGGCA CGGTGCGCGT GTTTGCGGAA CGCAACGACG GCTTCAGGCT 

651 GCCCGAAGAC AGCCG CAAGC CGATTGTGAT GATCGGCTCC GGTACCGGCG 

701 TCGCACCGTT CCGCGCCTTC GTCCAACAAC GTG CCGCAG A AAATGCGGAA 

751 GGCAGAAACT GGCTGATTTT CGGCAATCCG CATTTTGCCG CCGACTTCCT 

801 CTATCAGACC GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGAT 

851 ATGACTTCGC CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC 

901 AAAATCCGCG AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC 

951 GCATATCTAT GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GAAGTGGAAG 

1001 CCGCCTTGCT GGATGTGATT ATCGGGGCAG GGCATTCGGA CGAAGACGGC 

1051 GCAGAAGGAT ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA 

1101 TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 608; ORF 150.ng>: 

gl50.pep (partial) 

1 . . YCKADPFPAA LLANQKITAR QSDKDVRHIE IDLSGSDLHY L.PGDALGVWF 

51 DNDPALVGEI LDLLGINPAT EIQAGGKTLP VASALLSHFE LTQNTPAFVK 

101 GYATFADNDE LDRIAADNAV LQGFVOSTPI AGVLHRFPAK LTAEQFAGLL 

151 RPLAPRLYSI SSSQAEAGDE VHLTVGAVRF EHEGRARAGG ASGFFADRLE 

201 EDGTVRVFAE RNDGFRLPED SRKPIVMIGS GTGVAPFRAF VQQRAAENAE 

251 GRNWLIFGNP HFAADFLYQT EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD 

301 KIREQAEGLW QWLQEGAHIY VCGDAAKMAK EVEAALLDVI IGAGHSDEDG 

351 AEGYLDMLRE EKRYQRDVY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 609>: 
ml50.seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT CTGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AG CAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGGCGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAACATCG CCGGCGAACG CCGCCTGCTG CTGGTTACCT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGAAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

401 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 
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451 GGCGACAGTT CCTATCCGAA TTTCTGTCAG GCAGGTAAAG ATTTCGACCG 

501 GCGTTTTGAA GAATTGGGCG CAAAACGGCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TACCGCCTCC GCAAACGCCT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGCCCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCCGGCAACG 

901 GAAATACAGG CGGGCGGAAA GATGATGCCG GTTGCGCGCG CACTTTCATC 

951 TCATTTCGAA CTCACGCAAA ACACTCCGGC TTTCGTCAAA GGCTATGCCG 

1001 CGTTCGCCCA TTATGAAGAA CTCGATAAAA TCATTGCCGA TAACGCCGTT 

1051 TTGCAGGATT TCGTGCAAAA CACGCCTATT GTCGATGTGC TGCACCGCTT 

1101 CCCGGCAAGC CTGACGGCAG AACAATTCAT CCGTTTACTG CGTCCGCTTG 

1151 CACCCCGTTT GTATTCGATT TCTTCAGCAC AGGCGGAAGT GGGCGATGAA 

1201 GTGCATTTAA CTGTCGGCGT GGTTCGTTTT GAACACGAAG GCCGCGCCAG 

1251 AACGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

13 01 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCOGAAGAC 

13 51 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 
1451 GGCTGATTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 
1501 GAATGGCAGC AGTTTGCCAA AGACGG CTTC CTGCACAGGT ACGATTTCGC 

15 51 CTGGTCCCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 
1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 
1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTG CGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 610; ORF 150>: 



ml5 0 .pep 




1 


MQNTNPPLPP 


51 


TALPAAEPFS 


101 


KNIAGERRLL 


151 


GDSSYPNFCQ 


201 


LLKEEAAKNR 


251 


QSDKDVRHIE 


301 


EIQAGGKMMP 


351 


LODFVONTPI 


401 


VHLTVGWRF 


451 


SRKPIVMIGS 


501 


EWQQFAKDGF 


551 


VCGDAAKMAK 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 150 shows 91.3% identity over a 369 aa overlap with a predicted ORF (ORF 150.ng) 
from N. gonorrhoeae: 

ml50/gl50 

210 220 230 240 250 260 

ml 50 . pep LLKEEAAKNRAT PAPQTTPPAGLQTAPDGRYCKAAPFPAALLANQKITARQS DKDVRHIE 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
9 1 50 YCKADPFPAALLANQKITARQS DKDVRHIE 

10 20 30 

270 280 290 300 310 320 

ml50.pep IDLSGSDT..HYLPGDALGVWFDNDPALVREILDLLGIDPATEIQAGGKMMPVARALSSHFE 
I MM I I M I I I I I I I I I I I IMI | M | | | | | | | | : || | | M M || :||| || |||| 
g!50 IDLSGSDLKYLPGDALGVWFDNDPALVGEILDLLGINPATEIQAGGKTLPVASALLSHFE 
40 50 60 70 80 90 



330 340 350 360 370 380 

ml 50. pep LTQNTPAFVKGYAAFAHYEELDKIIADNAVLQDFVQNTPIVDVLHRFPASLTAEQFIRLL 
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I 1 t I I I I I l I I I I : I I : I I I : I II I I I I I I I I : I I I : I ! i I I I I : I I I I I I II 
g 1 50 LTQNTPAF\ r KGYATFADNDELDRIAADNAVLQGFVQSTPIAGVLHRFPAKLTAEQFAGLL 
100 110 120 130, 140 150 

390 400 410 420 430 440 

mlSO.pep RPLAPRLYSISSAQAEVGDEVHLTVGWRFEHEGRARTGGASGFLADRLEEDGTVRVFVE 
M MMMMIMI IM1 IMMI I M MM II M I MMMMMI III MUM I M! 
gl50 RPLAPRLYSISSSQAEAGDEVHLTVGAVRFEHEGRARAGGASGFFADRLEEDGTVRVFAE 

160 170 180 190 200 210 

450 460 470 480 490 500 

ml 50 . pep RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGKNWLIFGNPHFARDFLYQT 

i i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i r i j 1 1 f 1 1 f i f i inn 

g!50 RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGRNWLIFGNPHFAADFLYQT 
220 230 240 250 260 270 

510 520 530 540 550 560 

ml50 . pep EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 
I I I I I II II I I 1 1 I I I I I I 1 1 1 1 I II I 1 1 I 1 1 II I I I I I I I I I I 1 1 I il I IN 1 1 I I | M 
gl50 EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 

280 290 300 310 320 330 

570 580 590 600 

ml50 .pep DVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 
: I I I I I I I I I I I I I I 11:111 II I I II I I I I I I I M ! I 
g 1 50 EVEAALLDVI IGAGHSDE DG AE G Y L DML RE E KR YQRDVYX 

340 350 360 370 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 61 1>: 

al50 . seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT ATGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGACGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AAT GCG AAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAACATCG CCGGCGAACG CCGCCTGCTG CTGGT7ACCT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGGAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

401 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 

451 GGCGACAGCT CCTATCCGAA TTTCTGCCGG GCGGGCAAAG ATTTCGACAA 

501 ACGTTTTGAA GAATTGGGCG CAAAACGCCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TGCCGCCGCC GCAGACGGAT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGACCCCTT TGCCGCCGCC CTGCTGGCCA ATCAGAAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGA7TTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCAGGCAACG 

901 GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG CACTGTTATC 

951 CCATTTTGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA GGCTATGCCC 

1001 CGTTCGCCGA TGATGACGAA CTCGACCGTA TTGCTGCCGA CAACGCCGTT 

1051 TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGATGTGC TGCACCGCTT 

1101 CCCGGCAAAA CTGACAGCGG AACAATTCGC CGGCCTACTG CGCCCGCTTG 

1151 CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGT GGGGGACGAA 

1201 GTGCACCTGA CCGTCGGCGC GGTGCGTTTC GAACACGAAG GGCGCGCCAG 

1251 GGCGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

1301 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

1351 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

1451 GGCTGTTTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 

1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGAT ACGATTTCGC 

1551 CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 

1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 

1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
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1701 GGATGTGATT ATCGGGGCAG GACATTTGCA CGAAGAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 



This corresponds to the amino acid sequence <SEQ ID 612; ORF 150.a> 

al50 . pep 

MQNTNPPLPP MPPEITQLLS GLDAAQWAWL SGYAWAKAGN 
TALPTAEPFS VTVLSASQTG NAKSVADKAA 
KNIAGERRLL LVTSTQGEGE 
GDSSYPNFCR AGKDFDKRFE 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



DSLEAAGIQV 
PPEEAWLHK LLNGKKAPKL 
ELGAKRLLER VDADLDFAAA 



LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKADPFPAA 
LPGDALGVWF DNDPALVREI 
LTQNTPAFVK GYAPFADDDE 



QSDKDVRHIE I DLSGSDLHY 
EIQAGGKTLP VASAL LSHFE 



LQGFVQSTPI ADVLHRFPAK LTAEQFAGLL RPLAPRLYSI 
VHLTVGAVRF EHEGRARAGG ASGFLADRLE EDGTVRVFVE 
SRKPIVMIGS GTGVAPFRAF VQQRAAENAE GKNWLFFGNP 
EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD KIREQAEGLW 
VCGDAAKMAK DVEAALLDVI IGAGHLDEEG AEEYLDMLRE 



GASAGLPALQ 
SRAELKDYKA 
DKLQFAVLGL 
ADGWTDNIAA 
LLANQKITAR 
LDLLGIDQAT 
LDRIAADNAV 
SSSQAEVGDE 
RNDGFRLPED 
HFARDFLYQT 
QWLQEGAHIY 
EKRYQRDVY* 



ml 50/al 50 94.8% identity in 599 aa overlap 

10 

ml50.pep 



20 



30 



40 



al50 



50 



60 



MQN TN P PLP P L P PEI TQLLS GLDAAQWAWLS GY AWAKAGNG AS AGL P A1QT AL P AAE PFS 
I I I I I I I I I I :H I I I f I I I I I I I I I I I I I 111111111111111111 IMIhMIM 
MQNTNPPLPPMPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPTAEPFS 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml50 . pep VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYBCAKNIAGERRLLLVTSTQGEGE 

MII MIMIMIIMIMMMMMIMMMMIM llillllll IIII1IIIMM 

a 150 VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGEF.RLLI.VTSTQGEGE 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 50 . pep PPKEAWLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCQAGKDFDRRFEELGAKRLLER 

I I : M 1 1 1 1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 | ! | 1 1 : 1 1 1 1 | [: | 1 1 1 1 1 1 1 1 1 i 1 1 

al50 PPEEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCRAGKDFDKRFEELGAKRLLER 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml 50 . pep VDADLD FT AS AN AWT DN I AALLK EE AAKN RAT PA PQTT P PAGLQT A P DGRY C KAA P FP AA 

I I I I I I I : I : I :: I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I i I I I 1 I I I I I I I 
al50 VDADLD FAAAADGWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKADPFPAA 

190 200 210 220 230 240 

250 260 270 280 290 300 

mlSO.pep LLANQKrTARQSDKDVRHIEIDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPAT 

HiMIMMHimilMllllllllllllllilimm Ill MM II II 

al50 LLANQKITARQSDKDVRHIEIDLSGSDLHYLPGDALGVWFDNDPALVREI LDLLGIDQAT 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 50 . pep EIQAGGKMMPVARALSSHFELTQNTPAFVKGYAAFAHYEELDKIIADNAVLQDFVQNTPI 
NIMH II MMIMM MMMM II :|||:| I 1 I 1 1 I I IIMIII 

a 150 EIQAGGKTLPVASALLSHFELTQNTPAFVKGYAPFADDDELDRIAADNAVLQGFVQSTPI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 50 . pep VDVLHRFPASLTAEQFIRLLRP^APRLYSI SSAQAEVGDEVHLTVGWRFEHSGRARTGG 

= I t I I t I 1 I = I 1 1 1 1 I II M I I i I I M I 1 I : I II I I I I I 1 I I I I : I II I ! I I I i I : I I 
al50 ADVLHRFPAKLTAEQFAGLLRP1APRLYS I SS SQAEVG DEVH LT VG AVRFEHEGRARAGG 

370 380 390 400 4.10 420 

430 440 450 460 470 480 

ml 50 . pep ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 

linn ii i iiniiii nun ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 u in 

al50 ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 
430 440 450 460 470 480 

490 500 510 520 530 540 

ml 50 . pep GKNWLIFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
I I M I •■ ! HI I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
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al50 GKNWLFFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
490 500 510 520 530 540 

550 560 570 580 590 600 

ml 50 . pep QWLQEGAHI YVCGDAAKMAKDVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 

I t t It I I I ! I I I I I I I I I I I I I I I I I I I I I 1 I I I I I t I I I I I I I I I I I | I I I I | | | I | | | 
a 150 QWLQEGAHIYVCGDAAKMAKDVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 

550 560 570 580 590 600 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 613>; 
gl51 .seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCACACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTG 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TGGAGCGCGT TTTGGGGATG GTGGATTGCG TCGTCTTGTT GGTGGACGCA 

301 CAGGAAGGTC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAACCGTCCG 

401 CCCGTCCGAG CTGGGTTATC GACCAGACTT TCGAGTTGTT CGACAACTTG 

451 GGTGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGTTT 

501 GAGCGGCTTT GCCAAGCTGG AAGAAAccga CGAGAGCAGC GATATGCGCC 

551 CGCtgttcgA CACCATCCTA AAATACAcgc CtgCACCGAG CGGCAGCGCG 

601 GACGAGCCGC TGCAACTGCA AATT7CCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 

701 AAACCGTTGC CGTGATGAAC CACGAGCAGC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGC7TGGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATTGTGA TTAT7TCCGG TATCGAAGAC ATCGGCATCG 

851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTAAACA CCAGCCCGCT 

951 CGCAGGTACA GAAGGCAAAT TCGTGACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACG7TGCCC TGCGCGTGGA AGACACCGCC 

1051 GatgCCGACG TGTTCCGCGT ATCcgGGCGC GGCGAACTGC ACCTGACGAT 

1101 TTTGCTGGAA AATATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAGC 

1151 CGCGCGTCGT GTACCGAGAC ATCGACGGTC AAAAATGCGA ACCTTATGAA 

1201 AACCTGACTG TGGACGTACC cgacgacAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACCAA TATGGAAAGC GACGGCAACG 

1301 GacgCACCCG CCTCGAATAC CATA7TCCAG CGCGCQGCTT GATCGGTTTC 

1351 CAAGGCGAAT TCATGACCCT GACGCGCGGC GTCGGGCTGA TGAgccacGT 

1401 GTTcgacgac tacgcgcccg tcaAACCCGA TATGCCCGGC CGCCACAACG 

1451 GCGTactggt GtcccaAGAG CAGGGCGAGG CGGTTGCTTA CGCCTTGTG3 

1501 AATCTTGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGTATG ATTATCGGCA TCCACAGCCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTCACCAATA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCGGTGC GCCTGACCAC GCCGATCAAA CTGAcgcTGG AAGGCGCGGT 

1701 CGAGTTTATC GACGATGACG AGCTGGTGGA AATCACGCCG CAAtCCatCC 

1751 gcctgcgcat gcgttacctG AGCGaattgg aacgccgccg t caTTTTAAA 

1801 AagctgGATT AA 

This corresponds to the amino acid sequence <SEQ ID 614; ORF 151.ng>: 

gl51.pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGCHIN IVDTPGHADF GGEVERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESS DMRPLFDTIL KYTPAPSGSA 

201 DEPLQLQISO LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HEQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG DIVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HI PARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
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501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRMRYL SELERRRHFK 

601 KLD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 615>: 

mlSl.seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCAC AT CAAT ATCGTCGACA CGCGGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGCGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAGCCGTCCG 

4 01 CTCGTCCGAG CTGGGTTATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

4 51 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGGTT 

501 GAGCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 

701 AAACCGTTGC CGTCATGAAC CACGATCAGC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGTTTGGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATCGAAGAC ATCGGTATCG 

851 GCGTAACCAT C AC CGACAAA GACAATCCCA AAGGCCTACC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGCT 

951 GGCGGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGATGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

1301 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 

1351 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

1401 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCCGGC CGCCACAACG 

1451 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGCATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCCGTTC GCCTGACCAC GCCAATCAAG CTGACGCTGG AAGGTGCGGT 

1701 TGAGTTTATC GACGATGACG AACTCGTTGA AATCACGCCG CAATCCATCC 

1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCACTTTAAA 

1801 AAGCTGGATT GA 

This corresponds to the amino acid sequence <SEQ ID 616; ORF 151>: 

ml51 .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGEVERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTITD K DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted QRF from N. gonorrhoeae 

ORF 151 shows 99.2% identity over a 603 aa overlap with a predicted ORF (ORF lSl.ng) 
from N. gonorrhoeae: 

ml51/g!51 

10 20 30 40 50 60 

ml 51. pep MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEK5RGITILAKNT 

r 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 i r 1 1 1 1 1 1 1 1 

gl51 MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANCXJVDERVMDSNDLEKSRGITILAKNT 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml51 . pep AIDYEGYHINIVDTPGEADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 

I HI I I I INI II IIIUIII III III III I || I I I E M f I i I I I I I I 1 I I I I 

gl51 AIDYEGCHINIVDT PGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRBVTKKALALGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml51 . pep KPIWINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 
t M i I I I I I I M I I I I I I i I I II I I [ I I I I ( I I | I 1 | I | | | | | | | | | j | | | | I I I I I I t : 
gl51 KPIWINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSG5-AKLEETDESS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 51. pep DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 
M I I I I I I I I I I I II I I I I II I f 1 I 1 t I I t t I I I 1 t I I I I I I I ! I I I I I 1 I I I I I I I I t 
gl51 DMRPLFDTILKYTPAPSGSADEPLQLQISQLDYDNYTGRLGIGRlIiNGRTKPGQTVAVMN 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml51.pep HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 
l:IMIMMMII IIIMIMIII III il MIIIIIIIIIIIIIIMfMIIIIIIII! 
gl51 HEQQIAQGRINQLLGFKGLERVPLEEAEAGDIVII5GIEDIGIGVTITDKDNPKGLPMLS 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 51 . pep VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 

1 1 1 1 1 1 1 < 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 f 1 1 1 f in 

gl51 VDEPTLTMDET4VNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 
310 320 330 340 350 360 

370 380 390 400 410 420 

ml51.pep GELHLTILLENMRREGYELAVGKPRWYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 

MIIMI I I I I M 1 I I I I I I I I I M : I M I I M ! I I II M I I I I I II I ! M I I I I 

gl51 GELHLTILLENMRREGYELAVGKPRWYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 51 .pep RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl51 RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml51 .pep RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIKSRDNDLVVNPLKGKK 

I I M I 1 1 I I I I I I I I 1 1 I I I I 1 1 I I | | | 1 1 | 1 1 | 1 1 I I 1 1 I I M 1 1 1 1 I I 1 1 I I I M I I I 
g!51 RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIKSRDNDLWNPLKGKK 
490 500 510 520 530 540 

550 560 570 580 590 600 

ml51.pep LTNIRASGTDEAVRIiTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 
M I I II I I I I I I I I I I II I I I I I I I I I I I M | M | M | | | | | | | | I | | ! | | | | | | | | | | 
3151 LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRMRYLSELERRRHFK 

550 560 570 580 590 600 
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ml 51. pep KLDX 
(III 

gl51 KLDX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 617>: 

al51. seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGAGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA TAAAATCGAC AAACCGTCCG 

401 CCCGTCCGAG CTGGGTCATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

451 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTATG CTTCCGGTCT 

501 GTCCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGTATC AAGCCCGGTC 

701 AAGTTGTTGC CGTCATGAAC CACGATCAAC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGTTTAGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATTGAAGAC ATCGGCATCG 

851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGTT 

951 GGCAGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGACGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

1301 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGG CTT GATCGGCTTC 

1351 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

1401 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCTGGC CGCCACAACG 

1451 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGTATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCCGTTC GCCTGACCAC GCCGATTAAG CTGACGCTGG AAGGTGCGGT 

1701 CGAGTTTATC GACGATGATG AGCTGGTAGA AATCACGCCG CAATCCATCC 

1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCATTTCAAA 

1801 AAGCTAGATT GA 

This corresponds to the amino acid sequence <SEQ ED 618; ORF 15La>: 
alSl.pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANCQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQWAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

ml 5 1/al 5 1 99.8% identity in 603 aa overlap 

10 20 30 40 50 60 
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ml 51 .pep MKQTRNTAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 

1 1 m i n 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 n ! 1 1 1 1 1 1 1 1 n i i 1 1 1 1 1 1 1 1 1 

al51 MKQIRNIAIIAHVDHGKTTLVDOLLRQSGTFRANQQVDERVMDSNDLEKERGITTLAKNT 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 51 . pep AIDYEGYHIN2 VDTPGHADFGGEVERVLGMVDCVVLLVDAQEGPMPQTRFVTKKALALGL 

I I I I I I I I I I I I I I I I t I I I I I I I I I I ! I II II I I I I I I I I I I I I I I J I I I I I I I I I | I I 
a!51 AIDYEGYHINIVDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 51 .pep KPIVVINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 

1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 i 1 1 1 1 n i n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 

al51 KPIWINKIDKPSARPSWVIDCTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml 51 .pep DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRJLGIGRILNGRIKPGQTVAVMN 
I I I I I I M I I I I I I I I I I t I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II I I 
a 1 5 1 DMRPLFDTI LK YTPAPSGS ADETLQLQI SQLDYDNYTGRLGIGRILNGRI KPGQVVAVMN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 51 . pep HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 
I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I II I II | ! I I I | K 
al51 HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml 5 1 . pep VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 
I I I I I I I I I I i I I I I I II I I I I ! I I I I I I I I I I I I I I I I I I I II I I I I I I I I t I i I I I I I 
alSl VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVBDTADADVFRVSGR 

310 320 330 340 350 360 

370 380 390 400 410 420 

mi 5 1 . pep gelhltillenmrregyelavgkprvvyrdidgqkcepyenltvdvpddnqgavmeei.gr 

I I I I I I I I M M I I I I I I I 1 I I I I M 1 I I I I 1 ! I I t I I t I I I I I I 1 I I I I I I I I N I I I I 

a 15 1 gelhlti llenmrregyelavgkprvvyrdi dgqkcepyenltvdvpddnqgavmeelgr 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml5 1 . pep RRGELTNMESDGNGRTFLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 : i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

al51 RRGELTNMESDGNGRTRL.EYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 
430 440 450 460 470 480 

490 500 510 520 530 540 

ral51 .pep RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLVVNPLKGKK 

i 1 1 1 M I [ 1 1 1 1 1 1 1 1 li 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 i 1 ii i ii 1 1 1 1 1 1 

a 15 1 RHNGVLVSQEQGEA VA YALWNLE DRGRMFVS PNDKI Y EGMI ~ G I HSRDN DLWN PLKGKK 

490 500 510 520 530 540 

550 560 570 580 590 600 

mlSl.pep LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 
I I I I i I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! t I I I I I I I I I I I I I 1 I I i I I 
a 151 LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 

550 560 570 580 590 600 



ml 51. pep KLDX 
I I I I 

al51 KLDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 619>: 

gl52 . seq 

l ATGAAAAa c a aAACCaaagt ctgGGacttc cCcacccgcc ttTTCCactG 

51 GctgcttgCC gCATCCctgc CCTTTATGTG gtatagCGCA AAAGCCGGCG 

101 GcgataTGCT GcaatgGCAC ACGCGCGTCG GGCTGCTCGT CCTTTTCCTG 
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151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAgcgATA CCGCCCGTTT 

201 CTCCCgTtTC GTCCGAGGTT GGGCAGGTAT ACGCGGCTAT CTGAAAAAcg 

251 gCATTCCCGA ACAtatcCAG CCCGGACACA ACCCCTTGGG CGCACTgatg 

301 gtcGTTGCGC TTTTGgccgc cgtcTCATTT CAagtcggcA CGGGGCTTTT 

3 51 Tgccgccaat gaaaacacct tcagcaCCAa cggctacctc aaccatttgg 

401 tttccgaaca tacgGGCAGC CTTATACGGA AAATCCACCT CAACTTTTTC 

451 AAGCTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGCCG TCGCCGCATA 

501 CCGCATATTC AAAAAGAAAA ACCTCGTCCG CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 620; ORF 152.ng>: 

gi52 .pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLLVLFL 

51 LVFRLCWGI W GSDTARFSRF VRGWAGIRGY LKNGIPEHIQ PGHNPLGALM 

101 WALLAAVSF QV GTGLFAAN ENTFSTNGYL NHLVSEHTGS LIRKIHLNFF 

151 KLLAVFS AVH IAAVAAYRIF KKKNLVRPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVAALA AAAILLL S* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 62 1>: 

ml52 . seq 

1 ATGAAAAACA AAACCAAAGT CTGGGACCTC CCCACCCGCC TTTTCCACTG 

51 GCTGCTTGCC GCGTCCCTGC CCTTTATGTG GTATAGCGCG AAAGCCGGCG 

101 GCGATATGCT GCAATGGCAC ACGCGCGTCG GGCTGTTCGT CCTTTTCCTG 

151 CTCGTATTTC GCCTCTGCTG GGGCATTTGG GGCAGCGATA CCGCCCGTTT 

201 TTCCCGTTTC GTC CAAGGCT GGGCAGGCAT ACGCGGCTAT CTGAAAAACG 

251 GTATTCCCGA ACACATCCAG CCCGGACACA ACCCCTTGGG CGCACTGATG 

301 GTCGTTGCGC TTTTGGCCGC CGTGTCCTTC CAAGTCGGCA CCGGGCTTTT 

351 TGCCGCCGAT GAAAACACCT TCAGCACCAA CGGCTACCTC AACCATTTGG 

401 TTTCCGAACA TACGGGCAGC CTTATGCGGA AAATCCACCT CAACTTTTTC 

451 AAGCTGCTCG CCGTTTTTTC TGCAATCCAC ATCGCCGCCG TCGCCGCATA 

501 CCGCGTATTC AAAAAGAAAA ACCTCATCCT CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCAGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CTCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 622; ORF 152>: 

ml52 .pep 

1 MKNKTKVWDL PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLFVLFL 

51 LVFRLCWGI W GSDTARFSRF VQGWAGIRGY LKNGIPEHIQ PGHNP LGALM 

101 WALLAAVSF QV GTGLFAAD ENTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

151 KLLAVFSAIH IAAVAA YRVF KKKNLILPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVASLA AAAILLL S* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N.gonorrhoeae 

ORF 152 shows 95.4% identity over a 218 aa overlap with a predicted ORF (ORF 152.ng) 
from N. gonorrhoeae: 

ml52/gl52 



10 20 30 40 50 60 

ml5 2 . pep MKNKTKVWDLPTRLFHWLLAASLPFMWYSAKAGGDMLQWHTRVGLFVLFLLVFRLCWGIW 

mmiihiiimiiiiiiiiiiiiiMiiiiiiiiiiiiihiiiiiiiMiiiii 

g!5 2 MKNKTKVWDFPTRLFHWLLAAS LP FMWYSAKAGQDMLQWHTRVGLLVLFLLVFRLCWG I W 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml5 2 . pep G SDTARF S RFVQGWAG I RGY L KNG I PEH 1 QPGHN P LGALMWALLAAVS FQ VGTGL FAAD 

I I I I I I I I I I I : I ! I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I : 
gl52 G SDTARFS RFVRGWAGI RG Y L KNG I PE H I QPGHNPLGALMWALLAAVS FQVGTGL FAAN 

70 80 90 100 110 120 
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130 140 150 160 17C 180 

ml52 . pep ENTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAIHIAAVAAYRVFKKKNLILPMI 

IMIIIIIII!IIIIIII!||:||||||||||||||||:MI||||||:|||||!: Ill 
gl52 ENTFSTNGYLNHLVSEHTGSLIRKIHLNFFKLLAVFSAVHrAAVAAYRIFKKKNLVRPMI 
130 140 ISO 160 170 180 



190 200 210 219 

ml52 . pep TGFKYI EGKTS IRFAGKAALAAALSVASLAAAAILLLSX 

II III IIIIIIIIIIMMIIIMIIhll Mill MM 

gl52 TGFKYIEGKTSIRFAGKAALAAALSVAALAAAAILIiLSX 
190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 623>: 

al52.seq 

1 ATGAAAAACA AAACCAAAGT CTGGGACTTC CCCACCCGCC TTTTCCACTG 

51 GCTGCTTGCC GCATCCCTAC CCTTTATGTG GTATAGCGCG AAAACCGGCG 

101 GCGATATGCT GCAATGGCAC ACGCGCGTCG GGCTGTTTAT CCTTTTCCTG 

151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAGCGATA CCGCCCGTTT 

201 CTCCCGTTTC GTCCGCGGAT GGTCGGGTAT CAGAGAGTAT ATGAAAAACG 

251 GTATTCCCGA ACACGTCCAA CCCGGACACA ACCCCTTGGG CGCACTGATG 

301 GTCGTTGCGC TTTTGGCCGC CGTGTCGTTC CAAGTCGGCA CAGGGCTTTT 

351 TGCCGCCGAT GTAAACACCT TCAGCACCAA CGGCTACCTC AACCATTTGG 

401 TTTCCGAACA TACGGGCAGC CTTATGCGGA AAATCCATCT CAACTTTTTC 

451 AAACTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGNCG TCGCCGCATA 

501 CCGCGTGTTC AAAAAGAAAA ACCTCGTCCT CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 624; ORF 152.a>: 

al52 .pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KTGGDMLQWH TR VGLFILFL 

51 LVFRLCWGI W GSDTARFSRF VRGWSGIREY MKNGIPEHVQ PGHNP LGALM 

101 VVALLAAVSF QV GTGLFAAD VNTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

151 KLLAVFSAVH IAXVA AYRVF KKKNLVLPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVAALA AAAILL LS* 

ml 52/al 52 94.0% identity in 2 1 8 aa overlap 

10 20 30 40 50 60 

ml 52 . pep MKNKTKVWDLPTRLFHWLIJASLPFMWYSAKAGGDMI^WHTRVGLFVLFLLVFRLCWGIW 
I I I I I II I I: I I I I 1 1 1 I II I I I I I 1 1 I 1 1 I : I 1 1 I 1 1 I I I 1 1 1 1 1 : I I I I M I I I I I 1 1 
al52 MKNKTKWDFPTRLFHWLIJVASLPFMWYSAKTGGDMLQWHTRVGLFILFLLVFRLCWGIW 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml52.pep GS DTARFS RFVQGWAG I RG YLKNG I PEH I QPGHNP LGALMWALLAAVS FQVGTGLFAAD 

I I I I I I I I I I I : I I : I ! I I : I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al52 GSDTARFSRFVRGWSGIREYMKNGIPEHVQPGHNPLGALMWALLAAVS FQVGTGLFAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 52. pep ENTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAIHIAAVAAYRVFKKKNLILPMI 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I : I I I I II I I I I I I I I I: I II I 
al 52 VNTFSTNGYLNHLVSEHTGS1MRKIHLNFFKLLAVFSAVHIAXVAAYRVFKKKNLVLPMI 

130 140 150 160 170 180 



190 200 210 219 

m!52 .pep TGFKYIEGKTSIRFAGKAALAAALSVASLAAAAILLLSX 
lllllllili 111! Ill lliilll III: II I IMII1II 
al 52 TGFKYIEGKTSIRFAGKAALAAALSVAALAAAAILLLSX 

190 200 210 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 625>: 



gl53 .seq 



1 


atggggtttg 


cttaCAgtat 


gacgtatatc 


gaggtCGGGa 


taccggaggc 


51 


ggcatccgtc 


ctttCgctGC 


CCGAGATgat 


gcgcctgatG 


GTGTTtCagg 


131 


attATGGTTT 


TttggcCGAA 


GTGATGTTTG 


TGctgaCTTT 


cGGCGcgcCG 


151 


GTTCTGTTtC 


TGCTGCTGTG 


CCTGTATGTC 


TATGCCGCGC 


TGATACGGAA 


201 


ACAGGCGTAT 


CCTGCGCTGC 


GTTTGGCAAC 


GCGTGTGATG 


GTGCGCTTGA 


251 


GGCAGGCGAT 


GATGGTGGAT 


GTGTTTTTTG 


TTTCCACTCT 


GGTGGCGTAT 


301 


ATCAAGCTCT 


CGTCTGTGGC 


AAAGGTTCGC 


TTCGGGCCGG 


CGTTTTATCT 


351 


GATGTTCGCG 


CTGTCGGTTA 


TGCTGATTCG 


GACTTCGGTA 


TCGGTTCCCC 


401 


AGCATTGGGT 


G TATTTC CAA 


ATCGGGCGGC 


TGACGGGGAA 


TAATGCGGTT 


451 


CAGACGGCAT 


CGGAAGGCAA 


AACCTGTTGC 


AGCCGCTGCC 


TGTATTTccg 


501 


cgacAGTgcc 


gaatccCCCT 


GCGGGGTGTg 


cgGCGcggaA 


CTgtacggcg 


551 


gacggccgaa 


aagtCTGAGt 


atttCgtCGG 


CGTTTCTgac 


ggcggcggTT 


601 


GTTTTGTATT 


TCCctgCcaa 


TATCctgccg 


attaTGAttt 


cgtccAATCc 


651 


tgccgccacg 


GAGGcCAACA 


CCATCTTTAG 


CGGCATCGCT 


TATATGTGGG 


701 


ACGagggcgA 


CAGGCTGATT 


GCGGCGGTTA 


TTTTCAGCGC 


GAGTATTTTG 


751 


GTGCCGGTGC 


TGAAGATTGC 


GGCAATGTCG 


GTTTTGATTG 


CGGCGGCACG 


801 


GTTCGCTTTG 


CCGGCGGGCG 


CAAAGAAATT 


GTCGCACCTC 


tacCGCATCA 


851 


CCGAAGCGGT 


CGGCCGCTGG 


TCGATGATTG 


ATATTTTTGT 


GATTATTATT 


901 


TTGATGTGTT 


CGTTCCacaC 


TTATGCCGCG 


CGCGTCATTC 


CGGGCAGTGC 


951 


GGCAGTCTAT 


TTCTGCCTGG 


TCGTGATTTT 


GACGATGCTG 


TCCGCCTATT 


1001 


ATTTCGACCC 


GCGCCTGCTT 


TGGGACAAAC 


GCGCTTCAGA 


CGGCATTGCT 


1051 


TTCAACGAAA 


CGGAAAAATA 


TGACTGA 






This corresponds to the amino acid sequence <SEQ ID 626; ORF 153.ng>: 


gl53 .pep 












1 


MGFAYSMTYI 


EVGIPEAASV 


LSLPEMMRLM V FQD YG F LAE 


VMFVLTFGAP 


51 


VLFLLLCLYV 


YAALIRKQAY 


PALRLATRVM VRLRQAMMVD 


VFFVSTLVAY 


101 


IKLSSVAKVR 


FGPAFYLMFA LSVNLIRTSV SVPQHWVYFQ 


IGRL.TGNUAV 


isi 


QTASBGKTCC 


SRCLYFRDSA 


ESPCGVCGAE 


LYGGRPKSLS 


ISSAFLTAAV 


201 


VLYFPANILP 


IMISSNPAAT 


EANTIFSGIA YMWDEGDRLI 


AAVIFSASIL 


251 


VPVLKIAAMS 


VLIAAARFAL 


PAGAKKLSHL YR I TEAVGRW 


SMIDIFVIII 


301 


LMCSFHTYAA 


RVIPGSAAVY 


FCLWILTML 


SAYYFDPRLL 


KDKRASDGIA 


351 


FNETBKYD* 











The following partial DNA sequence was identified in N. meningitidis <SEQ ID 627>: 

xnl53 .seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACTTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGGCCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCAGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

401 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

451 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 

651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

701 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 

851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATT T T T GT GATTATTATT 

901 TTGATGTGTT CGTTCCACAC TTATGCCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAATGAAA CGGAAAAACA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 628; ORF 153>: 

ml 5 3 .pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLC LYV YAALIRKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 
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101 IKLSSVAEVR FG PAPYLMFA LSVMLIRTSV SVPQIWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAB LYRRRPKSLS IS5 AFLTAAV 

201 ILYFPANILP I MISSNPAAT EVWTILNGIA YMWDEGDRLI AAVIFSASIL 

251 VPVLKIAAMS VLIAS ARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

301 LMCSFHTYAA RVIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

3 51 FNETEKHD* 



ml53 / gl53 96.1% identity in 358 aa overlap 



10 20 30 40 50 60 

ml 5 3 . pep MAFAYGMTYI EVGI PGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 

hllhlllllllll IIIIMIIIilllllllllltlllMlllllillMlliiillt 

gl 53 MGFAYSMTYIEVGI PEAASVI^LPEMMRIjMVFQDYGFIjAEVMFVLTFGAPVLFLLLCLYV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 53 . pep YAAL I RKQAYPAIjRLATRVMWLRQAMMVDVFFVS TLVAYI KLSS VAEVRFG PAFYLMFA 

1 1 U 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 H II M h I f 1 1 1 1 1 1 1 1 1 1 

gl 5 3 YAAL I RKQAY PALRLATRVMVRLRQAMMVDVFFVS TLVAYI KLSS VAKVRFG PAFYLMFA 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 53 . pep LS VMLIRTS VS VPQHWVYFQ IGRLTGDNAVQTAS E GKTCCSRC LYFRDS AE 3 PCGVCGAE 

l!ltllMlillllllllii]||li|:iiil||illi!tlllllllll||||!||||||| 

gl 53 LSVMLIRTSVSVPQHWVYFQIGRLTGNNAVQTASEGKTCCSRCLYFRDSAES PCGVCGAE 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml53 . pep LYRRRPKSLS IS SAFLTAAVILYFPANILPrMISSNPAATBVKTILNGIAYMWDEGDRLI 

II I i M ! 1 1 1 1 1 1 1 1 ! I h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I ! I •! I ! - 1 1 1 1 !! 1 1 1 1 1 1 1 

gl53 LYGGR PKS LSI S SAFLTAA WLYF PANI LP IM I S SN P AATEANTI FSG I AYMWD EGD RL I 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml5 3 . pep AAVI FSAS I LVPVLKI AAMSVL IAS ARFALPTGAKKLSHLYRI TEAVGRWSMI DIFVI 1 1 

IIIIIMIMII II miUlllhlllllMIIMSMillllillllUlilllill 

gl53 AAVI FSAS I LVPVLKI AAMSVLIAAARFALPAGAKKLSHLYRITEAVGRWSMIDIFVI 1 1 

250 260 270 280 290 300 



310 320 330 340 350 359 

ml5 3 . pep LMC S FHT YAARV I PGS AAVYFC LWI LTMLSAYYFD PR LLWDKRAS DG I AFIJETEKHDX 

IlilllliillMIIMIIIIMIillMlillllllllllMllilNII Nihil 

gl53 LMC 3 FHTYAARV I PG S AAVYFCLWI LTMLSAYYFD PRLLWDKRAS DG I AFNETEKYDX 

310 320 330 340 350 



The following partial DNA sequence 



al53.seq 






1 


ATGGCGTTTG 


CTTACGGTAT 


51 


GGCATCCGTC 


CTTTCGCTGC 


101 


ATTATGGTTT 


TTTGGCCGAA 


151 


GTTCTGTTTC 


TGCTGCTGTG 


201 


ACAGGCGTAT 


CCTGCGCTGC 


251 


GACAGGCGAT 


GATGGTGGAT 


301 


ATCAAGCTCT 


CGTCTGTGGC 


351 


GATGTTCGCG 


CTGTCGGTTA 


401 


AGCATTGGGT 


GTATTTTCAA 


451 


CAGACGGCAT 


CGGAAGGTAA 


501 


CGACAGTGCC 


GAATCCCCCT 


551 


GACGGCCGAA 


AAGTCTGAGT 


601 


ATTTTGTATT 


TCCCTGCCAA 


651 


TGCCGCCACG 


GAGGTCAATA 


701 


ACGAGGGCGA 


CAGGCTGATT 


751 


GTGCCGGTAC 


TGAAGATTGC 


801 


CTTCGCTTTG 


CCAACGGGTG 



identified in N. meningitidis <SEQ ID 629>: 

GACGTATATC GAGGTCGGGA TACCGGGTGC 
CCGAGATGAT GCGCCTGATG GTGTTTCAGG 
GTGATGTTTG TGCTGACCTT CGGCGCGCCG 
CCTGTATGTC TATGCCGCGC TGATACGGAA 
GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 
GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 
AGAGGTTCGC TTCGGATCGG CGTTTTATCT 
TGCTGATTCG GACTTCGGTA TCGGTTCCCC 
ATCGGGCGGC TGACGGGGGA TAATGCGGTT 
AACCTGTTGC AGCCGCTGCC TGTATTTCCG 
GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 
ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 
TATCCTGCCG ATTATGATTT CGTCCAATCC 
CCATCCTTAA CGGCATCGCT TATATGTGGG 
GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 
GGCAATGTCG GTTTTGATTG CGTCCGCCCG 
CAAAGAAATT GTCGCACCTC TACCGCATCA 
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851 CCGAAGCGGT CGGCCGCTGG 

901 TTGATGTGTT CGTTCCACAC 

951 GGCAGTCTAT TTCTGCCTGG 

1001 ATTTCGACCC GCGCCTGCT7 

1051 TTCAATGAAA CGGAAAAACA 



TCGATGATTG ATATTTTTGT GATTATTATT 
TTATGCCGCG CGCGTCATTC CGGGCAGTGC 
TCGTGATTCT GACGATGCTG TCCGCCTATT 
TGGGACAAAC GCGCTTCAGA CGGCATTGCT 
TGACTGA 



This corresponds to the amino acid sequence <SEQ ID 630; ORF 1 53. a>: 

al53.pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLCLYV YAALIRKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 

101 IKLSSVAEVR FGS AFYLMFA LSVMLIRTSV SVPQKWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISSAFLTAAV 

201 ILYFPANILP IM ISSNPAAT EVNTILNGIA YMWDEGDRL I AAVIFSASIL 

251 VPVLKIA AMS VLIASARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

301 LMCSFHTYAA R VIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

351 FNETEKHD* 



ml53/al53 99.7% identity in 358 aa overlap 

10 20 30 40 50 60 

ml53 . pep MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 
I 1 1 M 1 I 1 1 I I I I I 1 I I 1 1 I 1 1 I I I I I I I E I I I I I I I 1 1 I 1 1 I I I I 1 1 1 1 M !! i I 1 1 I I 
al53 MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 53 . pep YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLS SVAEVRFGPAFYLMFA 

I I I I I I I I I I I I I I I I I I I I I I I I II i I I I I I I I I I ! I I M I I I I I I I I I i I I I I I I I I 
al53 Y AAL I RKQA YP ALR LAT R VMVRLRQAMMVDV FFVSTL VAY I KLS S VAE VR FGS A FYLM FA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 53 . pep LSVMLIRTSVSVPQHWVYFQ1GRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I ! I I I I I I I I I I I I I I I M I I 1 
al53 LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 

130 140 150 160 170 180 



190 200 210 220 230 240 

m!53.pep LYRRRPKSLSISSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 e 1 1 1 1 1 1 1 i I 

a!53 LYRRRPKSLSISSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 
190 200 210 220 230 240 



250 260 270 280 290 300 

ml53.pep AAVI FS AS I LV PVLK I AAMS VLIASARFAL PTGAKKLSHL YR I TEA VGRWSMI DI FVI 1 1 

1 1 I I I M 1 1 I I I 1 1 I I I I M I I M II I I'l 1 1 I II I I I II I I II 1 1 I I I I I I I I I I I I I I I 

a 1 5 3 AAVI FSAS I LVPVLKI AAMSVLI ASARFALPTGAKKLSHLYRITEAVGRWSMI DIFVI 1 1 

250 . 260 270 280 290 300 



310 320 330 340 350 359 

m!53 .pep LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 

imiimii in iii ii imi iii iii iii mini i mm mi iii it ii i ii i 

al53 LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 
310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 63 1>: 

gl54. oeq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCTCAAG CACGCGTCCG 

51 CAAAAACAAC aCCttCCtCT CCGCCGTCTG GCTGGTCCCG CTGATCGCGC 

101 TGATTGCOGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATCGAAG TCAACAATAC 

201 GGTCATTAAG GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTTA CTGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGcgtAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 



WO 99/57280 



PCT/US99/09346 



440 



401 ACATCGCTTT TACACCCGGC AAAAGCGQCG AGGCAAAAGA CGTGTTCCAA 

451 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAgcg GGCTGCGCTT 

501 GAATTTGATT GGTAAAAACG AccgCATCCT CAACGTcaaC AGCCCTGTTT 

551 TGTATGAAAA CTTTATGGTC GGG CAAATCG AAAGCGCGCA TTTCGAcccG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 

651 ACTGATTCAT TCCGCCAGCC GTTTTTGGCT GGAAAGCGGC ATCAATATCG 

701 AAACCACAGG CAGCGGCATC AAACTCAATT COGCCCCTCT GCCTGCCCTG 

751 CTGTCAGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 

801 CGTCAAAAGC GAGGACAGCT TCACGCTTTA CGACAGCCGC AGCGAAATCG 

851 CCAACCTGCC TGACGACCGC TCGCTGTACT ACACCGCGTT TTTCAAACAA 

901 TCCGTGCGCG GACTGACCGT cggTTCGCCT GTcgaATACA AAGGGCtgaA 

951 TGTcggCATG GTTTCCGATG TCCCTTATTT TGACCGCAAt gacagCCTGC 

1001 ACCtgtTTGA aaacggctgg aTTcccGtac gCATCCGCAT cgagccTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAGCAAAGC AAAGAGCATT GGAAACAACA 

1101 ATTCCAGACG GCCTTAAACA AAGGCCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTGCTGAC CGGCGGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCC 

1201 TCGCCCAAGC TGCGACCGCA TACCGTTTAT GCAGGCGATA CCGTCATCGC 

1251 CACACGGGGC GGCGGTTTGG ATGACTTGCA GGTCAAATTG GCGGATTTGC 

1301 TGGACaaatT CAACAATCTG CCATTggata aAACCGTTGC CGAATTGAAC 

1351 GGCTCGCTCG COGAACTCAA GTCCGCACTC AAATCCGCCA ATGCCGCCCT 

1401 AAGCTCCATT GacaAACTGG TCGgcaaTCC GCAGACGCAA AACATCCCGA 

1451 ACGAACTGAA CCAAACTCTG AAAGAGTTGC GCATAACCCT GCAAGGCGTA 

1501 TCGcctCAAT CGCCTATCTa cgGAgacgta caAAATAcgc tgCaAAGTTT 

1551 GGACAAAACC TTAAAagacg TtcaACCCGT CATTAACACT TTGAaAGAAa 

1601 aacCCaaCgc actGATTTtc aacaACAGCA GCAAAGAccc tATCCCGAAA 

1651 GGAAGCCGAT AA 

This corresponds to the amino acid sequence <SEQ ID 632; ORF 154.ng>: 

gl54.pep 

1 MTDNSPPPNG HAQARVRKNN TFLSAVWIA/F LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAB GIBVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSGEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQIESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEIANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGM VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RLEINADBQS KEHWKQQFQT ALNKGLTATI SSNNLLTGGK MIELNDQPSA 

4 01 SPKLRPHTVY AGDTVIATRG GGLDDLQVKL ADLLDKFNNL PLDKTVAELN 

4 51 GSIiAELKSAIi KSANAALSSI DKLVGNPQTQ NIPNELNQTL KELRITLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NNSSKDPIPK 

551 GSR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 633>: 

ml54.seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCCCAAG CACGCGTCCG 

51 CAAAAACAAC ACCTTCCTCT CTGCCGTCTG GCTGGTTCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATTGAGG TCAACAATAC 

201 GGTCATCAAA GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTAA CCGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGCGTAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 

401 ACATCGCCTT TACACCCGGC AAAAGCGACG AGGCAAAAGA CGTGTTCCAA 

451 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAGCG GGCTGCGCTT 

501 GAATTTGATT GGTAAAAACG ACCGCATCCT CAACGTCAAC AGCCC TGTTT 

551 TGTATGAAAA TTTTATGGTC GGGCAAGTCG AAAGCGCGCA TTTCGACCCG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 

651 ACTGATTCAT TCCGCCAGCC GTTTTTGGCT GGAAAGCGGC ATCAATATCG 

701 AAACCACAGG CAGCGGCATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 

751 CTGTCGGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 

801 CGTCAAAAGC GAAGACAGCT TCACGCTTTA CGACAGCCGC AGCGAAGTCG 

851 CCAACCTGCC TGACGACCGC TCGCTGTACT ACACCGCGTT TTTCAAACAA 

901 TCCGTGCGCG GCCTGACCGT CGGTTCGCCC GTCGAGTACA AAGGGCTGAA 

951 TGTCGGCGTG GTTTCCGACG TTCCTTATTT CGACCGCAAC GACAGCCTGC 

1001 ACCTGTTTGA AAACGGCTGG ATACCCGTAC GCATCCGCAT TGAACCTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAACAAAGC AAAGAACATT GGAAACAACA 

1101 ATTTCAGACG GCCTTAAACA AAGGCCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTGCTGAC CGGAAGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCA 
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1201 TCACCTAAGC TGCGACCGCA TACCGTTTAT GCAGGOGATA CCGTTATCGC 

1251 GACCCAGGGC GGCGGTTTGG ACGATTTGCA GGTCAAATTG GCGGATTTGC 

1301 TGGACAAGTT CGACAAACTG CCTTTAGATA AGACGGTTGC CGAATTGAAC 

1351 GGTTCGCTTG CCGAGCTCAA ATCCACACTC AAATCTGCCA ATGCCGCCCT 

1401 AAGCTCCATC GACAAACTGG TCGGCAAACC GCAGACACAA AACATTCCGA 

1451 ACGAACTGAA CCAAACCCTG AAAGAGTTGC GCACAACCCT GCAAGGCGTA 

1501 TCGCCGCAAT CGCCTATCTA CGGCGACGTA CAAAATACGC TGCAAAGTTT 

1551 GGACAAAACT TTAAAAGACG TTCAACCCGT GATTAATACT TTGAAAGAAA 

1601 AACCCAACGC GCTGATTTTC AACAGCAGCA GCAAAGACCC TATCCCGAAA 

1651 GGAAGCCGAT AA 

This corresponds to the amino acid sequence <SEQ ID 634 ORF 154.a>: 

ml 5 4 .pep 

1 MTDNS PP PNG HAQARVRKNN TPLSAVWLVP LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI XLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RliEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 

401 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

451 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 

ml 54 / gl54 97.8% identity in 553 aa overlap 

10 20 30 40 50 60 

MTDNS PPPNGHAQARVRKNNTFLS AVWIj VPLI ALI AGGWLWVKE I RNRG PVVTLLMDSAE 

lllll lliltlllllllll lllllllillllll I! (KM Nil Itlll MMI MM II 

MTDNS PPPNGHAQARVRKNNTFLSAVWLVPLIAlil AGGWLWVKEI RNRGPWTLLMDSAE 
10 20 30 40 50 60 

70 80 90 100 110 120 

GIEVNNTVI KVLSI DVGRVTRI KLRDDQKGVEVTAQLNADVSG LI RSDTQFWWKPRIDQ 

iimiiiiiiiiimti iiiiiiimiiimmmimiimiiiimi n 

G I E VNNTVT KVLS I DVGRVTRI KLRDDQKGVEVTAQLNADVSG LI RSDTQFWWKPRIDQ 
70 80 90 100 110 120 

130 140 150 160 170 180 

SGVTGLGTLLSGSY IAETPGKSDEAKDVFQVQD IPPVTAIGQSGLRLNLIGKNDRILNVN 

mmiiiiiiiiiiiiiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SGVTGLGTLLSGSY IAFTPGKSGEAKDVFQVQD I PPVTAIGQSGLRLNLIGKNDRILN\'N 
130 140 150 160 170 180 

190 200 210 220 230 240 

SPVLYENPMVGQVE SAHFDPSDQSVHYTIFIQSPOTJKLIHSASRFWLESGINIETTGSGI 

IIIIIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

SPVLYENFMVGQIESAHFDPSDQSVHYTIFIQSPNDIUilHSASRFWLESGINIETrGSGI 
190 200 210 220 230 240 

250 260 270 2B0 290 300 

KI^SAPLPALLSGAISFDSPKTKNSIQIVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFICQ 

mmm i; ii i iiiii mmiii mil ii ii mi' : immmmiii 

KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEIANLPDDRSLYYTAFFKQ 
250 260 270 2B0 290 300 

310 320 330 340 350 360 

SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWI PVR I RI EPSRLE INADEQS 

lllllllll!!!llllllhllllllllilllMIIIIIIIIII III; MM illl 

SVRGLTVGS PVE YKGLNVGMVSDVP YFDRNDS LHLFENGWI PVRI RI EPSRLE INADEQS 
310 320 330 340 350 360 

370 380 390 400 410 420 

KEHWKQQFQTALNKGLTATI SSNNLLTGSKMI ELNDQPSAS PKLRPHTVYAGDTVIATQG 

I m 1 1 1 1 1 1 1 M I II 1 1 1 1 1 1 M M I : II 1 1 1 1 1 1 M 1 1 M 1 1 ! I M I M 1 1 1 1 1 IM 

KEHWKQQFQTALNKGLTATI SSNNLLTGGKMI ELNDQPSAS PKLR PHTVYAGDTVI ATRG 



ml 54 .pep 
gl54 

ml 54 .pep 
gl54 

ml 54 .pep 
gl54 

ml 54 .pep 
gl54 

ml54 .pep 
9154 

ml54 .pep 
gi54 

ml54 .pep 
gl54 
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370 



380 



390 



400 



410 



420 



430 440 450 460 470 480 

ml 54 . pep GG LDDLQVKliADLIiDKFD KLPLD KT VAELNG S LAELKS TLKSANAALS S IDKLVGKPQTQ 

llliitllllllllllh:||]ii|lllll||||||||:||||li||||||||||:|||| 
gl 54 GGLDDLQVKLADLI^KRINLPLDKTVAEIJJGSLAEI^ 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 54 . pep NI PNELNQTLKBLRTTLQGVSPOSPIYGDVQNTLQSI^KTLKDVQPVrNTLKEKPNALl F 

illlllilllitll lllillillillllllltlMlllllilllNlllllllilllll 
gl 54 NI PNELNQTLKE LRITLQG VS PQS P I YGDVQNT LQSLDKTLKD VQPVINTLKEKPNALI F 

490 500 510 520 530 540 



ml54 .pep 
gl54 



550 

NSSSKDPIPKGSRX 

hlllllllillll 
NNSSKDPIPKGSRX 
550 



The following partial DNA sequence was 

al54 . seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA 

51 CAAAAACAAC ACCTTCCTCT CTGCCGTCTG 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA 

201 GGT CATC AAA GTATTGAGCA TCGATGTCGG 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTAA 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG 

351 TATCGACCAA AGCGGCGTAA CCGGTTTGGG 

4 01 ACATCGCCTT TACACCCGGC AAAAGCGACG 

4 51 GTGCAGGACA TTCCGCCCGT TACCGCCATC 

501 GAATTTGATT GGTAAAAACG ACCGCATCCT 

551 TGTATGAAAA CTTTATGGTC GGGCAAGTCG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC 

651 ACTGATTCAT TCCGCCAGCC GTTTCTGGCT 

701 AAACCACAGG CAGCGGCATC AAACTCAATT 

751 CTGTCGGGCG CGATTTCATT TGATTCGCCG 

801 CGTCAAAAGC GAAGACAGCT TCACGCTTTA 

851 CCAACCTGCC TGATGACCGT TCGCTGTACT 

901 TCCGTGCGCG GACTGACCGT CGGTTCGCCT 

951 TGTCGGCGTG GTTTCCGATG TTCCTTATTT 

1001 ACCTGTTTGA AAACGGCTGG ATTCCCGTAC 

1051 CGTTTGGAAA TCAATGCCGA CGAACAAAGC 

1101 ATTTCAGACG GCCTTAAACA AAGGCCTGAC 

1151 ACCTGCTGAC CGGCAGCAAA ATGATTGAGT 

1201 TCGCCCAAGC TGCGACCGCA TACCGTTTAT 

1251 GACCCAGGGC GGCGGTTTGG ACGATTTGCA 

1301 TGGACAAGTT CGACAAACTG CCTTTAGATA 

1351 GGTTCGCTTG CCGAGCTCAA ATCCACACTC 

14 01 AAGCTCCATC GACAAACTGG TCGGCAAACC 

14 51 ACGAACTGAA CCAAACCCTG AAAGAGTTGC 

1501 TCGCCTCAAT CGCCTATCTA CGGCGACGTA 

1551 GGACAAAACC TTAAAAGACG TTCAACCCGT 

1601 AACCCAACGC GCTGATTTTC AACAGCAGCA 

1651 GGAAGCCGAT AA 



identified in N. meningitidis <SEQ ID 635>: 



CACGCCCAAG 
GCTGGTTCCG 
AAATCCGCAA 
GGCATTGAGG 
ACGCGTTACC 
CCGCCCAACT 
TTTTGGGTGG 
TACGCTGCTT 
AGGCAAAAGA 
GGGCAAAGCG 
CAACGTCAAC 
AAAGCGCGCA 
ATCCAAAGCC 
GGAAAGCGGC 
CCGCCCCTCT 
AAAACCAAAA 
CGACAGCCGC 
ACACCGCGTT 
GTCGAGTACA 
CGACCGCAAC 
GCATCCGTAT 
AAAGAACATT 
CGCCACCATC 
TGAACGATCA 
GCAGGCGATA 
GGTCAAATTG 
AGACGGTTGC 
AAATCTGCCA 
GCAGACACAA 
GCACAACCCT 
CAAAATACGC 
CATTAACACT 
GCAAAGACCC 



CACGCGTCCG 
CTGATCGCGC 
CAGGGGGCCT 
TCAACAATAC 
CGAATCAAAC 
CAATGCGGAC 
TCAAGCCGCG 
TCGGGTTCGT 
CGTGTTCCAA 
GGCTGCGCTT 
AGCCCTGTTT 
TTTCGACCCG 
CCAACGACAA 
ATCAATATCG 
GCCTGCCCTG 
ACAGTAAAAA 
AGCGAAGTCG 
TTTCAAACAA 
AAGGGCTGAA 
GACAGCCTGC 
TGAGCCTTCC 
GGAAACAACA 
TCCAGCAACA 
GCCTTCCGCC 
CCGTTATCGC 
GCGGATTTGC 
CGAATTGAAC 
ATGCCGCCCT 
AACATTCCGA 
GCAAGGCGTA 
TGCAAAGTTT 
TTGAAAGAAA 
TATCCCGAAA 



This corresponds to the amino acid sequence <SEQ ID 636; ORF 1 54.a>: 

al54 .pep 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGGW L WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 
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151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFT4V GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 

4 01 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

451 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 

ml54/al54 100.0% identity in 553 aa overlap 

10 20 30 40 50 • 60 

ml 54 . pep MT DNS P P PNG HAQAR VRKNN T FL SAVWL V P L I AL I AGGW LW VKE I RNRG P WT L LM D S AE 

I f IUMI UMI I I M I lllllfl II Nil Ml || | IH I! Ill M I I I III I II I III 
al54 MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWKEIRNRGPVVTLLMDSAE 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 5 4 . pep GIEVNNTVIKVLSI DVGRVTRIKLRDDQKGVE VTAQLN ADVS GLI RS DTQFW WKPRI DQ 

I IMMMMIIMIIIMIIIIMMIMIIIMI IIMIMMMII MMM IMI 

al54 GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADV3GLIRSDTQFWWKPRIDQ 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml54 . pep SGVTGLGTLLSGSY1AFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 
III MIMIIIIf I IIIIIIIIIMilllllllllllllllMIIIIIIIIIMII MM 
al54 SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml54 .pep SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 

I I M I M I M M M M 1 1 M 1 1 1 1 1 M I M 1 1 M M M M I M 1 1 M I II M I M I 1 1 1 1 

al54 SPVLYENFMVGQVESAHFDPSDQSVHYT I FIQSPNDKLIHSASRFWLESG INIETTGSGI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml54 .pep KLNSAPLPALLSGAI S FDS PKTKNSKN VKSE DS FTLY DSRSEVANLPDDRSLYYTAFFKQ 

I 1 1 I M 1 1 I 1 1 1 1 I 1 1 I M I M I I M I i M M I 1 1 I 1 1 1 I I I II f M I 1 1 I I I t M M M 
a!54 KLNSAPLPALLSGAI SFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml54.pep SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 

IMMMIMMMMMMMMMMIMMMMMMMI Mill 

al54 SVRGLTVGS PVEYKGLNVGWSDVPYFDRN DSLHLFENGWI PVRIRI E PSRLE INADEQS 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml54 .pep KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTYYAGDTVIATQG 
IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII M I M M M M M M M I 
al54 KEHWKQQFQTALNKGLTAT I SSNNLLTGSKM IE LNDQ PS AS PKLRPHTVY AGDTVIATQG 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 54 .pep GG L D DLQ VKLA DLL DKFDKLPLDKTVAELNGSLAELKSTLKSANAALSS I DKLVGKPQTQ 

I I I I I I I I < I N M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | M I I I 
al54 GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKSANAALSS I DKLVGKPQTQ 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml54 . pep NI PNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALI F 

"HI H I I M I M 1 1 I I M M M U 1 1 1 I IMIII Illllll 

a 154 NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 
490 500 510 520 530 540 
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550 

ral54 .pep NSSSKDPIPKGSRX 
I i I I ! I I I I I ( I I I 
al54 NSSSKDPIPKGSRX 
550 



The following partial DNA sequence was identified ini\T. gonorrhoeae <SEQ ID 637>: 

gl55 . seq 

1 atGAAaatcg GtatcCCACG CGAGTCAtta tcCGGCGAAA cccgcgtagc 

51 ctgcAcgccc gCCACCGTTG CCctgctggg caAactAGGC TTTGAAACCG 

101 TTGtcgaAAG CGGTGCAggt TTGGCGGCAA GTTTggaCGA TGCCGCTTAC 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGCCTGCCC 

201 TTTAATTTAT AAGGTCAACG CGCCGTCCGA AGGCGAGCTG CCGCTGCTCA 

251 AAGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 

301 TTGGTCGAGG CCTTGCGCGC CAAGAAAGTC AACGCGCTGG CGATGGACAT 

351 GGTTCCCCGC ATTTCCCGCG CTCAGGCCTT GGACGCTTTG TCTTCAATGG 

401 CAAACATCAG CGGCTACCGC GCCGTGATTG AAGCCGCCAA CGCCTTCGGC 

451 CGTTTCTTCA CCGGTCAAAT CACTGCCGCC GGCAAAGTGC CGCCTGCGCA 

501 GGTTTTGGTG ATTGGCGCCG GTGTGGCGGG TTTGGCGGCA ATCGGTACGG 

551 CAAATTCGCT CGGCGCAGTG GTGCGCGCGT TCGATACCCG CTTGGAAGTG 

601 GCGGAACAAA TCGAATCGAT GGGCGGTAAG TTcctGAAAC TCGACTTCCT 

651 GCAAGAATCG GGCGGCAGCG GAGACGgctA CGCCAAAGTG ATGAGCGACG 

701 AATTTATCGC CGCCGAAATG AAGCTCTTTG CCGAACAGGC GAAAGAAGTG 

751 GACATCATCA TCACCACCGC CGCCATTCCG GGCAAACCCG CTCCCAAGCT 

801 GAT T AC C AAA GAAATGGTGG AAAGCATGAA ATCCGGATCC GTCATCGTCG 

851 ATTTGGCGGC GACGGGCGGC AACTGCGAAC TCACCCGACC GGGCGAATTG 

901 TCCGTAACCG GCAAC3GCGT GAAAATCATC GGCTACACCG ACATGGCAAA 

951 CCGCCTTGCC GGACA3TCTT CCCAGCTTTA CGCCACCAAC TTGGTGAACC 

1001 TGACCAAGCT GTTAAGCCCG AACAAAGAcg gcgaAATCAC GCTGGACTTC 

105.1 GAAGacgtGA TTATCCGCAA TATGACCGTT ACCCGcgacc gcgaaATCAC 

1101 CTTCCCGCCT CCGccgaTTc aggtTTCcgc ccggccgCAG CAAAcgccgt 

1151 ctgaAAAagc cgcGCCTGCC GCCAagcccg AgccGaaacc tgttCCcctg 

1201 tggaAAAaac tcgCGCCCGC CGCcatcgCC GCCGTATTGG tgctgtgGgt 

1251 cggCgcggtc gcacccgcag CATTCTTGAA CCACTTTATC GTCTTCGTCC 

1301 TCGCCTGCGT CATCGGCTAC CATGTCGTTT GgaacgTCAG CCACTCGCTG 

1351 CACACACCGC TGAtgtcggt aaccaaCgcc atctccGGCA tcatggtcgt 

1401 cggCGCGCTG CTGCAAATCG GTCAGGGcaa cggcttcgtT TCgctGCTGT 

14 51 CGTTTGTTGC CATCCTGATT GCCGGCATCA ATATCTTCGG CGGCTTTGCG 

1501 GTTACACGGC GTATGCTGAA TATGTTTAAG AAAGGGTAA 

This corresponds to the amino acid sequence <SEQ ID 638; ORF 155.ng>: 

gl55.pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWACPLIY KVNAPSEGEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTA NSLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFLQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAATGG NCELTRPGEL 

301 SVTGNGVKII GYTDMANRLA GQSSQLYATN LVNLTKLLSP NKDGEITLDF 

351 EDVIIRNMTV TRDGEITFPP PPIQVSARPQ QTPSEKAAPA AKPEPKPVPL 

401 WKK LAPAAIA AVLVLWVGAV AP AAFLNHFI VFVLACVIG Y HWWNVSHSL 

451 HTP LMSVTNA ISGIMWGAL LQIGQGNG FV SLLSFVAILI AGINI FGGFA 

501 VTRRMLNMFK KG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 63 9>: 

ml 55. seq 

1 ATGAAAATCG GTATCCCACG CGAGTCATTA TCCGGCGAAA CCCGCGTCGC 

51 CTGTACGCCC GCCACCGTCG CCCTGCTGGG CAAACTGGGC TTTGAAACCG 

101 TTGTCGAAAG CGGTGCAGGT TTGGCGGCAA GTTTGGACGA TGCCGCTTAC 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGTCTGCCC 

201 TTTGATTTAT AAGGTCAACG CGCCGTCCGA ACAGGAACTG CCGCTTTTGA 

251 ACGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 
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301 


TTGGTCGAAG 




351 


GGTGCCCCGC 


ATTTCGCGCG 


401 


CAAACATCAG 


CGGCTACCGC 


451 


CG T TT CTTC A 


CCGGTCAAAT 


501 


GGTTTTGGTG 


ATTGGTGCAG 


551 


CAAACTCGCT 


CGGCGCAGTG 


601 


GCGGAACAAA 


T C G AATCG AT 


651 


AC AAG AAT C G 


GGCGGCAGCG 


701 


AATTTATCGC 


AGCCGAGATG 

flu vvufivn X VJ 


751 


GACATCATCA 


TCACCACCGC 


801 


GATTACCAAA 


GAAATGGTGG 


851 


ATTTGGCGGC 


GGCGACGGGC 


901 


TTGTCCGTAA 


CCGGCAACGG 


951 


AAACCGCCTT 


GCCGGACAGT 


1001 


ACCTGACCAA 


GCTGTTAAGC 


1051 


TTCGAAGACG 


TGATTATCCG 


1101 


CACCTTCCCG 


CCTCCGCCGA 


1151 


CGTCTGAAAA 


AGCCGTGCCT 


1201 


CTGTGGAAAA 


AACTCGCGCC 


1251 


GGTCGGCGCG 


GTCGCACCCG 


1301 


TTCTCGCCTG 


CGTCATCGGC 


1351 


CTGCACACAC 


CGCTGATGTC 


1401 


CGTCGGCGCG 


CTGCTGCAAA 


1451 


TGTCGTTTGT 


TGCCATCCTG 


1501 


GCGGTAACAC 


GGCGTATGCT 



CAAGAAAGTG AACGCGCTGG CGATGGATAT 
CGCAGGCTTT GGACGCTTTG TCTTCGATGG 
GCCGTAATTG AAGCCGCCAA CGCCTTCGGC 
TACCGCCGCC GGCAAAGTGC CGCCCGCGCA 
GTGTGGCAGG TTTGGCGGCG ATCGGTACGG 
GTACGCGCGT TCGATACCCG CTTGGAAGTG 
GGGCGGCAAG TTCCTGAAAC TCGACTTCCC 
GAGACGGCTA CGCCAAAGTG ATGAGCGACG 
AAGCTCTTTG CCGAGCAGGC GAAAGAAGTG 
CGCCATTCCG GGCAAACCCG CGCCCAAGCT 
AAAGCATGAA ATCCGGCTCC GTCATCGTCG 
GGCAACTGCG AACTCACCCG CCCGGGCGAA 
CGTGAAAATC ATCGGCTACA CCGACATGGC 
CTTCCCAGCT TTACGCCACC AACTTGGTCA 
CCGAACAAAG ACGGCGAAAT CACGTTGGAC 
CAACATGACC GTTACCCACG ACGGCGAAAT 
TTCAAGTTTC CGCCCAGCCG CAGCAAACGC 
GCCGCCAAGC CCGAGCCAAA ACCCGTTCCC 
CGCCGTCATC GCCGCCGTCT TGGTACTGTG 
CAGCATTCCT GAACCACTTT ATCGTGTTCG 
TACTACGTCG TCTGGAACGT CAGCCACTCG 
GGTAACCAAC GCCATCTCCG GCATCATCGT 
TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 
ATTGCCGGCA TCAACATCTT CGGCGGCTTT 
GAATATGTTT AAGAAAGGGT AA 



This corresponds to the amino acid sequence <SEQ ED 640; ORF 155>: 
ml 55. pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWVCP LIY KVNAPSEQEL PLLN EGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPAQVLV IGAGVAGLAA IGTANSLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAAATG GNCELTRPGE 

301 LSVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTHDGEITFP PPPIQVSAQP QQTPSEKAVP AAKPEPKPVP 

401 LWK KLAPAVI AAVLVLWVGA VAPAAFLNHF IVFVLACVIG YYWWNVSHS 

451 LHTP LMSVTN AIGGIIWGA L LQIGQGNG F V3LLSFVAIL IAGINI FGGF 

501 AVTRRMLNMF KKG* ~ 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 155 shows 97.9% identity over a 513 aa overlap with a predicted ORF (ORF 155.ng) 
from N. gonorrhoeae: 

ml55 / gl55 97.93 identity in 513 aa overlap 

10 20 30 40 50 60 

ral55.pep MK I G I PRE S LSGETRVACT PATVALLGKLGFETWES GAG L AAS L DDAAYQT AGAT VADK 

I I I I I 1 1 I I f I M I I I 1 1 1 1 I It II I I [I I 1 1 I II N I I I I II II I 1 1 I I I M I M I II i 
gl 55 MKIGI PRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ral55 . pep AAVT^CPLIYKV^APSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

MM: mm I I I I I : I I I I I M! II I I I I I I I I I I I I I t I I I I I I I I II I I 

g!55 AAVWACPLIYKVNAPSEGELPLLKEGQTIVS FLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 55 . pep I SRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 

I I I I I I I I I I I I I I I I I I I I I II I I I I II III I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
gl55 I SRAQALDALS SMAN I S G YRAVIEAANAFGRFFTGQI TAAGKV PPAQVLVI GAGVAGLAA 
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130 



140 



150 



160 



170 



180 



190 200 210 220 23C 240 

ml55.pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 
IIIIMIilllt I I I I I I I I I I I I I I I I M It I M 1 I I I 1 I I I J I I I I t I I I I 1 I I I t | 
gl55 I GTAN SLGAWRAFDTRLEVAEQIESMGGKFLKLDFLQESGGSGDGYAKVMS DEFI AAEM 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml55.pep KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAAATGGNCELTRPGE 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I II I I I I I I I I i I I I 
gl55 KLFAEQAKEVDIIITTAA1PGKPAPKLITKEMVESMKSGSVIVDLAA-TGGNCELTRPGE 

250 260 270 280 290 

310 320 330 340 350 360 

ml55 . pep LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 
llll! Illllllllllll IIIIIIIIIMMMII III IIIIIIIIMIIMIMIIMI 
gl55 LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 
300 310 320 330 340 350 

370 380 390 400 410 420 

ml55 . pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVIAAVLVLWVGA 
I I : I I I I I I I I I I I I I I I : I I I I I I I I I : I I I II I I I I I I I I I I I I I I : I I I I I I I I I I I 
gl55 VTRDGEITFPPPPIQVSARPQQTPSEKAAPAAKPEPKPVPLWKKLAPAAIAAVLVLWVGA 
360 370 380 390 400 410 

430 440 450 460 470. 480 

ml55 . pep VAPAAFLNHFIVFVLACVIGYYVVWNVSHSLHTPLMSVTNAISGIIWGALLQIGQGNGF 
M I I I li I I I I I M M I I I 1 1 : I I I I I I I I I I I I I I I I I I I I I I I: I I I I I I I I I I I I I I 
gl55 V APAAFLNH F I VFVL ACVI G YH WWN V SH S LH T P LM S VTN AI SGI MWG ALLQI G QGN G F 

420 430 440 450 460 470 

490 500 510 

ml55 . pep VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
\ I I I I II II I I I II I I I I I II I I I I I I I I I I I I I 
gl55 V S LL S FV AI L I AG IN I FGG FAVT RRM LNM FKKGX 

480 490 500 510 



The following partial DNA sequence was identified in JV. meningitidis <SEQ ID 641 >: 



al55 . seq 
1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 



ATGAAAATCG 
CTGTACGCCC 
TTGTCGAAAG 
CAAGCAGCAG 
TTTAATTTAT 
AAGAAGGACA 
TTGGTCGAAG 
GGTGCCCCGC 
CAAACATCAG 
CGTTTNTTCA 
GGTTTTGGTG 
CAAAC TCGCT 
GCGGAACAAT 
GCAAGAATCG 
AATTTATCGC 
GACATCATCA 
NNTNANCAAA 
ATTTGGCGGC 
TTGTTCGTAA 
AAACCGCCTT 
ACCTGACCAA 
TTCGAAGACG 
CACCTTCCCG 
CGTCTGAAAA 



GTATCCCACG 
GCCACCGTCG 
CGGCGCAGGT 
GCGCAACCGT 
AAGGTTAACG 
GACCATCGTC 
CCTTGCGCGC 
ATTTCGCGCG 
CGGCTACCGC 
CCGGCCAAAT 
ATTGGTGCAG 
CGGCGCAGTG 
TAGAATCGAT 
GGCGGCAGCG 
CGCCGAGATG 
TCACCACCGC 
GAAATGGTCG 
GGCGACGGGC 
CCGGCAACGG 
GCCGGACAGT 
GCTGTTAAGC 
TGATTATCCG 
CCTCCGCCGA 
AGCCGCGCCT 



TGAGTCATTA 
CCCTGCTGGG 
TTGGCGGCAA 
TGCCGACAAA 
CGCCGTCCGA 
AGCTTCCTGT 
CAAGAAAGTG 
CGCAGGCTTT 
GCCGTGATTG 
TACTGCCGCA 
GTGTGGCAGG 
GTACGCGTGT 
GGGCGGCAAG 
GCGACGGCTA 
AAGCTTTTTG 
CGCCATTCCG 
AAAGCATGAA 
GGCAACTGCG 
CGTGAAAATC 
CTTCGCAGCT 
CCGAACAAAG 
CAACATGACC 
TTCAAGTTTC 
GCCGCCAAGC 



TCCGGCGAAA 
CAAACTGGGC 
GTTTGGACGA 
GCAGCGGTTT 
AGACGAGCTG 
GGCCGCGCCA 
AACGCGCTGG 
GGACGNTTTG 
AAGCCGCCAA 
GGCAAAGTGC 
TTTGGCGGCG 
TCGATACCCG 
TTCCTGAAAC 
CGCCAAAGTG 
CCGAGCAGGC 
GGCAAACCCG 
ACCCGGCTCC 
AACTCACCAA 
ATCGGCTACA 
TTACGCCACC 
ACGGCGAAAT 
GTTACCCGCG 
CGCCCAACCG 
CCGAACCGAA 



CCCGCGTCGC 
TTTGAAACCG 
TGCCGCTTAC 
GGGCATACCC 
CCGCTGCTCA 
AAACGAGGCT 
CAATGGACAT 
TCTTNGATGG 
CGCCTTCGGC 
CGCCCGCGCA 
ATCGGTACGG 
CCTG.AAGTG 
TCGACTTCCC 
ATGAGCGACG 
GAAAGAAGTG 
CGCCCAAGCN 
GTCATCGTCG 
ACAGGGCGAA 
CCGACATGGC 
AACTTGGTCA 
CACGCTGGAC 
ACGGCGAAAT 
CAGCAAACGC 
ACCCGTTCCC 
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1201 CTGTGGAAAA AACTCGCGCC CGCCNTNATC GCCGCCGTGT TGGTACTGTG 

1251 GGTCGGCGCG GTCGCACCCG CAGCATTCCT GAACCACTTT ATCGTCTTCG 

1301 TCCTCGCCTG CGTCATCGGC TACTATGTCG TTTGGAACGT CAGCCACTCG 

1351 CTGCACACAC CGCTGATGTC GGTGACCAAC GCCATTTCCG GCATCATCGT 

1401 CGTCGGCGCG CTGCTGCAAA TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 

1451 TGTCGTTTGT TGCCATCCTG ATTGCCAGCA TCAACATCTT CGGCGGCTTC 

1501 TTTGTAACGC GGCGGATGCT GAATATGTTT AGGAAAGGGT AA 

This corresponds to the amino acid sequence <SEQ ED 642; ORF 155.a>: 

al55.pep 

1 MKIGI FRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QAAGATVADK AAVWAYPLIY KVNAPSEDEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDXL SXMANISGYR AVIEAANAFG 

151 RXFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTA NSLGAV VRVFDTRLXV 

201 AEQLESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKXXXK EMVESMKPGS VIVDLAAATG GNCELTKQGE 

301 LFVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTRDGEITFP PPPIQVSAQP QQTPSEKAAP AAKPEPKPVP 

401 LWK KLAPAXI AAVLVLWVGA V A PAAFLNHF IVFVLACVIG YYWWNVSHS 

4 51 LHTP LMSVTN AISGIIWGA L LQIGQGNG F VSLLSFVAIL IASINI FGGF 

501 FVTRRMLNMF RKG* 

ml55/al55 S5.3% identity in 513 aa overlap 

10 20 30 40 50 60 

ml55 . pep MKIGI PRESLSGETRVACT PATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 

I I I i I I I I I 1 1 1 I I i I I I I I I I I I I I I I I I I I I 1 E I I I I I I I I I I I I M I I : I I I I i I I I 
al55 MKIGI PRESLSGETRVACT PATVALLGKLGFET WES GAGLAASLDDAAYQAAGATVADK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml55 . pep AAVWVCPLI YKVNAPSEQELPLLNEGQT IVS FLWPRQNEALVEALRAKKVNALAMDMVPR 

MM: IIIIIMIM|:|||||:|||||I!IMMIMIIIIIIIIIIIIIIMIIIII 
a 1 55 AAVWAYPLI YKVNAPSEDELPLLKEGQT IVS FLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 

130 140 150 160 170 180 

ral55.pep ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 
M I I II M ; 1 M M M M M M M M M I I I I I I I I I I M II I I I I I I I I I I I II I I 
al55 ISRAQALDXLSXMANISGYRAVIEAANAFGRXFTGQITAAGKVPPAQVLVIGAGVAGLAA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml55 . pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 

II II I I I I I M I : I M I I I I I I M I I I II I I II ! I I I I I I M I I M II I I II I II II I I 
al55 I GT AN S LG AWR V FDTR LXVAEQLE SMGGK FLKL D FPQES GG S G DG Y AKVM S DE F I AAEM 

190 200 210 220 230 240 

250 260 270 280 290 300 

ral55 . pep KLFAEQAKEVDII ITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAAATGGNCELTRPGE 

I I II I I I I I : I I I I I I I I I I M I I I I : I I I I II I I I I I I I I I I I I I M II I II : II 
al55 KLFAEQAPCEVDIIITTAAIPGKPAPKXXXKEMVESMKPGSVIVDLAAATGGNCELTKQGE 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 55 . pep L S VTGNG VK 1 1 G YT DMAN RLAGQS S QL Y ATN LVN LT KLLS FNK DGE ITLDFEDVII RNMT 

I I t I I I I I M I I I I I I I I M I II I I M I I I I I I I II I I I I I I I I I I I M I II I M II I I 
al55 LFVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGE ITLDFEDVII RNMT 

310 320 330 340 350 360 

370 380 390 400 410 420 

ral55 . pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVI AAVLVLWVGA 

I hi I I I I I I I I II I I I I I I II II I I I I : I I II I I I I I II I II I I I I I M I I I I I I I I I 
al55 VTRDGEITFPPPPIQVSAQPQQT PS EKAAPAAKPEPKPVP LWKKLAPAXI AAVLVLWVGA 

370 380 390 400 410 420 
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430 440 450 460 470 480 

ml55 . pep VA PAAFLNH FI V FVLACVI G YTVVWN VS H S LHT PLM S VTNAI SG 1 I WG ALLQI GQGNG F 

I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I 1 I I I I I I 
al55 VA P AA FLN H F I V FVLACV I G YYWWNV S H S LH T PLM S VTN A I SG 1 1 WGAL LQI GQGNG F 

430 440 450 460 470 480 

490 500 510 

ml55.pep V S LL S FVAI L I AG IN I FG G FAVT RRMLN M FKKGX 

I I I I I I I ( I I I I : I II I I I I I I I I II I I I : I I I 
a 1 55 VS LLS FVAI LIAS IN I FGGFFVTRRMLNMFRKGX 

490 500 510 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 643>: 

gl56. seq 

I ATGACTTTCG CCTATTGGTG CATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCTCG CGGTTTTCTG GCACATACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCGC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC CGGACAAGCA ACCGTCAACA 

251 CGCTTGCCGG ATTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAC 

301 ATCGCAGACA AAGCAGCATT GCGCTCGCTG ATGTGGGCGG GCGGATTTGC 

351 CTGCACCGTC GGACTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 644; ORF 156.ng>: 

gl56.pep 

1 MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHTQGAAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 IADKAALRS L MWAGGFACTV GLFVAA A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 645>: 

ral56.seq 

1 ATGACTTTCG CCTATTGGTG TATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCGCG CGGTTTTCTA GCGCACACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCAC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC GGCGCAATCG ACCATCAACA 

251 CGCTTGCCTG CCTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAT 

301 ATCGCCGACA AAGCCGCTAT GCGCTCACTG AT GTGGGC AG GCGGATTTGC 

351 CTGCACCGTC GGGCTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 646; ORF 156>: 

ml 5 6. pep 

1 MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHTQGAAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAAQS TINT LACLFI LFRLAFIWCY 

101 IADKAAMRS L MWAGGFACTV GLFVAAA * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml56 / gl56 96.13 identity in 127 aa overlap 

10 20 30 40 50 60 

ml56 . pep MT FAY W C I L I AC L LP L FC AA YAKKAGG FR FK DNHN PRG FLAHTQG AAARAHAAQQNG FEA 

I I II I I I I M I I I 1 1 I I M I I II I it I 1 1 1 1 I I I I I I I M I II I I I I I M I I I I 1 1 I I 1 1 
gl56 MT FA YW C I L I ACL LP L FC AAYAKKAG G FR FKDNH N PRG FLAH TQG AAARAHAAQQNG FEA 

10 20 30 40 50 60 

"70 80 90 100 110 120 

ml56 . pep FAPFAAAVLTAHATGNAAQSTINTLACLFILFRLAFIWCYIADKAAMRSLMWAGGFACTV 
I M I I I I I I I! I I I I I I: I : I : I I | | I i I I I I I I I 1 I I I I I ( I I I : I I I I I M i I II I I 
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gl56 



FAPFAAAVLTAHATGNAGQATVNTLAGLFILFRLAFIWCYI/VDKAALRSLMWAGGFACTV 
70 80 90 100 110 120 



ml 56. pep 
gl56 



GLFVAAAX 
I I I I I I I I 
GLFVAAAX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 647>: 



al56. seq 
1 
51 
101 
151 
201 



ATGACTTTCG 
TTGTGCGGCG 
ACAAT CCGCG 
CACGCCGCGC 
CGTTTTGACG 
251 CGCTTGCCGG 
301 ATCGCAGACA 
351 CTGCACCGTC 



CCTATTGGTG 
TATGCCAAAA 
CGATTTTCTG 
AGCAAAACGG 
GCACACGCAA 
CCTGTTCATC 
AAGCAGCATT 
GGGCTGTTTG 



TATTCTGATT 
AAGCGGGCGG 
GCGCGCACGC 
TTTTGAAGCC 
CCGGCAATGC 
CTGTTCCGCC 
ACGCTCGCTG 
TCGTGGCTGC 



GCCTACCTAT 
ATTCCGGTTT 
AAGGCACAGC 
TTTGCACCGT 
CGGACAAGCA 
TCGCCTTTAT 
ATGTGGGTGG 
TTGA 



TGCCGCTTTT 
AAAGACAACC 
CGCCCGTGCC 
TTGCAGCCGC 
ACCGTCAACA 
CTGGTGCTAC 
GCGGATTTGT 



This corresponds to the amino acid sequence <SEQ ID 648; ORF 156.a>: 

a!56. pep 

1 MTFAYWCILI AYLLPLFCAA YAKKAGGFRF KDNHNPRDFL ARTQGTAARA 

51 HAAQ QNGFEA FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 IADKAALRS L MWVGGFVCTV GLFWA A* 

ml56/al56 90.6% identity in 127 aa overlap 

10 20 30 40 50 60 

m!56.pep MTFAYWCI L IACLLPL FCAAYAKKAGGFRFKDNHN PRGFLAHTQGAAARAHAAQQNG FEA 

1 I I I I I I I II I I I I I I I I I I ! I I I I I I I I I I I I I I t I I I : I I I : I t I I I ! It I I I I 1 I 
al56 MTFAYWCILIAYLLPLFCAAYAKKAGGFRFKDNHNPRDFLARTQGTAARAHAAQQNGFEA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml56 . pep FAPFAAAVLTAHATGNAAQSTINTLACLFILFRLAFIWCYIADKAAMRSLMWAGGFACTV 
I I I I I I I I I II I I I I I I : I : I: I I I I I I I I I I I I I I I I I I I I I I I : I I I I I : I II : I I I 
a 1 5 6 FAPFAAAVLTAHATGNAGQATVNTLAGLFI LFRLAFIWCY I ADKAALRSLMWVGGFVCTV 

70 80 90 100 110 120 



ml 56. pep GLFVAAAX 
1111:111 
al56 GLFWAAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 649>: 

g!57 . seq 

1 atgaggaacg aggAAAAACg cgccctgcgc cgcgaattgC gCgGgcggcg 

51 ttcgcAAATg GGgcgagacg tGCGggCGGC GGCGgCgatA Aaaatcaacc 

101 gcctgctcaa aCGTtatatc AAGCGCggtc gGaAaatcgG CGTGTATTgg 

151 cCGATGGGCA AGGAATTGcg TTTGGGCGgc tTtgtcCGCG CGGCGCAAAA 

201 ACGCgGCGCA AAactctatc tgccttATAT CGAACCCCAC ACGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GAACGCGGAA TGGAACGGGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGGCGCA AAATCCGCGT 

351 GCACGGTTTG TCGGTATTGC TCGTCCCGCT TGTCGGCATA GACCGCGAAG 

401 GCTACCGTTT GGGGCAGGCA GGCGGCTATT ACGATGCGAC GCTTTCGGCG 

451 ATGAAATACC GTTTGCAGGC GAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTGGTGGAC AGGCTCCCAC GCGAGGCGCA CGACCTGCCG CTGGACGGTT 

551 TTGTATCGGA AGCGGGGATA TTGTGTTTTT AG 
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This corresponds to the amino acid sequence <SEQ ID 650; ORF 157.ng>: 

gl57.pep 

1 MRNEEKRALR RELRGRRSQM GRDVRAAAAI KINRLLKRYI KRGRKIGVYW 

51 PMGKELRLGG FVRAAQKRGA KLYLPYIEPH TRRMWFTPYP ERGMERERKR 

101 GRAKLHVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLSA 

151 MKYRLQAKTV GVGFACQLVD RLPREAHDLP LDGFVSEAGI LCF* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 65 1>: 
ral57 .seq 

1 ATGAGGAACG AGGAAAAACG CGCCCTGCGC CGCGAATTGC GCGGGCGGCG 

51 TTCGCAAATG GGGCGGGACG TGCGGGCGGC GGCAACGGTA AAAATCAACC 

101 ACCTGCTCAA ACGTTATATT AAAAAAGGGC GGAAAATCGG CGTGTATTGG 

151 CCGATGGGCA AGGAATTGCG TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 

201 ACGCGGTGCG GAACTCTACC TGCCTTATAT CGAACCGCGT TCGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GCCGATGGAG TAAAACAAGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGTCGGA AAAAGCGTGT 

351 GCATGATTTG AACCTCCTGC TTGTGCCAGT GGTCGGTATG GACAGGCTGG 

4 01 GCTACCGCTT GGGACAGGCA GGCGGCTATT ACGATGCGAC GCTTTCAGCG 

451 AT GAAAT ACC GTTTGCAGGC AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTGGTGGAC AGGCTGCCGG TCGAGGCGCA CGACCGGTCT TTGGACGGTT 

551 TTGTGTCGGA GGCGGGGATA TTGTGTTTTT AG 



This corresponds to the amino acid sequence <SEQ ID 652; ORF 157>: 

ml57 .pep 

1 MRNEEKRALR RELRGRRSQM GRDVRAAATV KINHLLKRYI KKGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA ELYLPYIEPR SRRMWFTPYP ADGVKQERKR 

101 GRAKLHVPQF AGRKKRVHDL NLLLVPWGM DRLGYRLGQA GGYYDATLSA 

151 MKYRLQAKTV GVGFACQLVD RLPVEAHDRS LDGFVSEAGI LCF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m!57 / gl57 B8.1% identity in 193 aa overlap 



10 20 30 40 50 60 

ml 57 . pep MRNEEKRALRRELRGRRSQMGRDVRAAATVKINHLLKRYI KKGRKIGVYW PMGKELRLDG 

I I I I I I I I I I I I I I I M M I I I I I I I I I : : I I I : I i I I I I I: I II I I I I I I I I I I I I I I 
gl57 MRNEEKRALRRELRGRRSQMGRDVRAAAAI KINRLLKRYI KRGRKIGVYWPMGKELRLGG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 51 .pep FVRAAQKRGAELYLPYIEPRSRRMWFTPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 

I I I I I I I I I I : I I I I I I I I:: M II I : : : I II I I I I I I I I I ! I II I I III I 

gl57 FVRAAQKRGAKLYLPYIEPHTRRMWFTPYPERGMERERKRGRAKLHVPQFAGRKIRVHGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

nvl57 . pep NLLLVPVVGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 

: : II I 1 : I I : I I I II I I I I I I I I I I I I i I I I I I M i I I I I I I I I I I i i I I I I I I I I 
gl57 S VLLV P LVG I DREGYRLGQAGGYY DAT LS AMKYRLQAKTVG VG FACQLVDRLPREAH DLP 

130 140 150 160 170 180 



190 

ml57 . pep LDGFVSEAGILCFX 
I I I I II I I I I I 1 I I 
gl57 LDGFVSEAGILCFX 
190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 653>: 

al57 . seq 

1 ATGAGGAACG AGGAAAAACA CGCCTTGCGC CGAGAGTTGC GCCGCGCCCG 
51 CGCGCAGATG GGGCATCAAG GGCGGTTGGC GGCGGGGCAA ACGATTAACC 
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101 GCCTGCTCAA ACGTTATATC 

151 CCGATGGGCA AGGAATTGCG 

201 ACGCGGTGCA AAACTTTATC 

251 TGTGGTTTAC GCCGTATCCT 

301 GGCAGGGCGA AGTTGAACGT 

351 GCACGGTTTG TCGGTATTGC 

401 GCTACCGCTT AGGACAGGCA 

4 51 AT G AAATACC GTTTGCAGGC 

501 GTTTGTGGAC AGGCTGCCGC 

551 TTGTGTCGGA GGCGGGGATA 



AAGCGTGGTC GGAAAATCGG CGTGTATTGG 
TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 
TGCCTTATAT CGAACCGCGT TCGCGGCGGA 
GAAAGCGGAA TGGAACGGGA GCGCATACGG 
GCCGCAGTTT GCAGGGCGCA AAATCCGCGT 
TCGTCCCGCT TGTCGGCATA GACCGCGAGG 
GGCGGCTATT ACGATGCGAC GCTTGCGGCG 
AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 
GCGAACCGCA CGATCTGCTG CTGGACGGTT 
TTGTGCTTTT AG 



This corresponds to the amino acid sequence <SEQ ID 654; ORF 157.a>: 

al57.pep 

1 MRNEEKHALR RELRRARAQM GHQGRLAAGQ TINRLLKRYI KRGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA KLYLPYIEPR SRRMWFTPYP ESGMERERIR 

101 GRAKLNVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLAA 

151 MKYRLQAKTV GVGFACQFVD RLPREPHDLL LDGFVSEAGI LCF* 

ml57/al57 82.4% identity in 193 aa overlap 



10 20 30 40 50 60 

ml 57 . pep MRN EE KRALRRE LRGRRSQMGR D VRAAAT VK I N HL LKR Y IKKGRK IG V YW PMGKE LRL DG 

I I II I I : I II I I I I 1 : I I I : : I II I I : I I I I I I I : I I I I I I I I I I I I I I I I I I 
al57 MRN EEKHALRRELRRARAQM GHQG RLAAGQT I N RL LKR Y IKRGRK I G V YW PMGKE LRL DG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m!57 . pep FVRAAQKRGAELYLPYIEPRSRRMWETPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 
I I I I II I I I I : I I I I I I I I I I I I I I I I I II : I : : : I I I I I I I I : I I I I I I I I III I 
a 157 FVRAAQKRGAKLYLPYIEPRSRRMWFTPYPESGMERERIRGRAKLNVPQFAGRKIRVHGL- 

70 80 90 100 110 120 



130 140 ■ 150 160 170 180 

ml 57 . pep NLLLVPWGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 

: : I I I I : I I : I I II I II I II I I I I I I I : I II I I I I I I I I II I I I I I : I I I I I I II 
al57 SVLLVPLVGIDREGYRLGQAGGYYDATLAAMKYRLQAKTVGVGFACQFVDRLPREPHDLL 

130 140 150 160 170 180 



190 

ml57 .pep LDGFVSEAGILCFX 
I I I I I II I I I I I I I 
al57 LDGFVSEAGILCFX 
190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 655>: 

gl58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGgagcAGTT GGAGAtggCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGCGT GAAcCTGCtC 

151 aACCGCACCA CGCGGCAACT CAATCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAGTA CCGCAAGGCG TGTTGCGCGT GGATTCCGCG 

301 ATGCcgatgg TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCcgaC TTTCGCTCGT TTCTTCCGAa ggctatatca 

401 atctGattGA Acgcaaagtc gAtatTGCCT TACGGGCCGG AGAATTGGAC 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCACT TCCGCGtagt 

501 cgCCAGTCCT GAATATTTAG CAAAACACGG CACGCCACAA TCTGCAGAAG 

551 atcTTGCCAA CCATCAATGT TTAGGCTTCA CAGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGAtgC GCAGGGAAAT CCCTATAAAA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAAGtt 

701 gCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCACTGAA 

751 GGAAAGTTAA TTCCcctatt cgCCGAACAA ACCTCCAATA AAACACACCC 
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801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAACCTC CGCTT ACGCG 

851 TATTTTTGGA TTTTTTAGTG AAGGAACTGG GAAAAAATAT GAATAGAACG 

901 AATACCAAAT AA 

This corresponds to the amino acid sequence <SEQ ID 656; ORF 158.ng>: 

gl58 .pep 

1 MKTNSEELTV FVQWESGSF SRAA EQLEMA NSAVSRIVKP. LEEKLGVNLL 

51 NRTTRQLNLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEV PQGVLRVDSA 

101 MPMVLHLLAP LAAKFNERYF HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSHFRWASP EYLAKHGTPQ SAEDLANHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEILR SLC LSSCGIACLS DFLV DNDITE 

251 GKLIPLFAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV KELGKNMNRT 

301 NTK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 657>: 

ml58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT GGCGATGGCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGTGT GAACCTGCTC 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG GAAGAAGGCC- CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG TGTTGAGCGT GGATTCCGCG 

301 ATGCCGATGG TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT TTCTTCCGAA GGCTATATCA 

401 ATCTGATTGA ACGCAAAGTC GATATTGCCT TACGGGCCGG AGAATTGGAC 

4 51 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCGCT TCCGCGTAAT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG CACGCCGCAA TCTACAGAAG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA CCGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT CCCTATAAGA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAGGTT 

701 GCGGTATTGT TTGCTTATCA GATTTTTTGG TTGACAACGA CATCGCTGAA 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA ACCTCCGATA AAACACACCC 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAATCTC CGCTTACGCG 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG GAAACAATCT CTGTGGATAA 



This corresponds to the amino acid sequence <SEQ ID 658; ORF 158>: 

ml58.pep 

1 MKTNSEELTV FVQWESGSF SRAA EQLAMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLSLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEI PQGVLSVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSRFRVIASP EYLAKHGTPQ STEELAGHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEIL RSLC LSGCGIVCLS DFLV DNDIAE 

251 GKLIPLLAEQ TSDKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml58 / gl58 94.3% identity in 297 aa overlap 

10 20 30 40 50 60 

ml58 . pep MKTNSEELTVFVQVVESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
I I ! I I I I II I I I I I I I I I II I I I I I I I I I I \ I I I I I I I I I I I I II I II I I II I I I I : I I 
gl58 MKTNSEELTVFVQVVFISGSFSRAAEQLEM^SAVSRIVKRLEEKLGVNLLNRTTRQLNLT 

10 20 30 4C 50 60 



70 80 90 10C 110 120 

ml58 . pep E EG AQ Y FRR AQR I LQEMAAAET EMLAVHE I PQG VLS V DSAM PMVLHLLA P LAAK FN ERYP 

I II I I I I I I I I I I I I I I I I II I I I I I II I : I I I i I I I I I I I I I S I I I I I I I II I I I I I I 
gl58 EEGAQYFRRAQRILQEMAAAETEMLAVHEVPQGVLRVDSAMPMVLHLLAPLAAKFNERYP 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml 58 . pep HIE^LSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
M IN II Ml III 1 I III III llllll lllllllill IN 11:1 II: II II II 111! II! 
gl58 HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSHFRVVASPEYLAKHGTPQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml58.pep STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
I : I : I I: I I I I 1 1 I I I I I 1 1 I 1 1 I 1 1 I I II I I I I J I I I 1 1 1 1 I I I I I I I I I I: I I 1 : 1 1 I 
gl58 SAEDLANHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSSCGIACLS 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 58 .pep DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 

I I I I I II I : I I I I I I I : I I I I I : I I II I I I I I I I I I I I I I I I I | | I | I | | : I I I : I : 
gl58 DFLVDNDITEGKLIPLFAEQTSNKTHPFNAVYYSDKAVNLRLRVFLDFLVKELGKNMNRT 

250 260 270 280 290 300 

gl58 NTKX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65 9>: 

al58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT GGCGATGGCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGTGT GAACCTGCTC 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG TGTTGCGCGT GGATTCCGCG 

301 ATGCCG AT GG TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT TTCTTCCGAA GGCTATATCA 

401 ATCTGATTGA ACGCAAAGTC GATATTGCCT TACGGGCCGG AGAATTGGAC 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCGCT TCCGCGTAAT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG CACGCCGCAA TCTACAGAAG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA CCGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT CCCTATAAGA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAGGTT 

701 GCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCGCTGAA 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA ACCTCCAATA AAACGCACCC 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAACCTC CGCTTACGCG 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG GAAACAATCT CTGTGGATAA 

This corresponds to the amino acid sequence <SEQ ID 660; ORF 158.a>: 

al58.pep 

1 MKTNSEELTV FVQWESGSF SRAA EQLAMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLSLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEI PQGVLRVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSRFRVIASP EYLAKHGTPQ STEELAGHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEIL RSLC LSGCGIACLS DFLV DNDIAE 

251 GKLIPLLAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 

ml58/al58 99.0% identity in 299 aa overlap 

10 20 30 40 50 60 

iul58 . pep MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKEILEEKLGVNLLNRTTRQLSLT 
I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I 1 I II I I I I I I I I II I I I I I I I I I I I I I 
al58 MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ral58 . pep EEGAQY FRRAQR I LQEMAAAETEMLAVHE I PQGVL S VD SAM PMVLHLLAPLAAK FNERY P 

I M I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I | I | | | | | I I I I || | | | I | | | | | || 
al58 EEGAQYFRRAQRILQEMAAAETEMLAVHEIPQGVLRVDSAMPMVLHLLAPLAAKFNERYP 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml58.pep HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
I I I I I II I I 1 1 I I I M I I I i I I I I I I 1 1 I I I I M 1 1 I I 1 1 I I I I I 1 1 I 1 1 I ! I I I I I I I I 
al58 HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml58 . pep STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I : I M 
a!58 STEELAGHQCLGFTEPGSLNTWAVI.DAQGNPYKTSPHFTASSGEILRSLCLSGCGIACLS 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 58 . pep DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 
I I I I I I I I I I I I I I I I I II I I I: I I I I I 1 I I I II I I I I I I I I I I I I I I I I I I 1 1 I I I I I I 
al58 DFLVDNDIAEGKLIPLLAEQTSNKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 

250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 66 1>: 

gl60.seq 

1 ATGGAcattc tgGACAAact ggtcgatCTC GCccaATTGA CGGGCAGTGC 

51 GGATGTGCAG TgcctTTTGG GCGGACAATG gcATGaaacc TTGCAACGCG 

101 AAGGGCTGGT ACACATTGTT ACGGCGGGCA GCGGTTATCT CTGCATCGAC 

151 GGCGAAACTT CCCCGCGTCC GGTCGGCACG GGCGATATTG TATTTTTCCC 

201 GCGCGGCTTG GGTCATGTGT TGAGCCACGA CGGAAAATAC GGAGAAAGTT 

251 TACAACCGGA CATACGACAA AACGGCACAT TTATGGTCAA ACAGTGCGGC 

301 AACGGGCTGG ATATGAGCCT GTTTTGCGCC CGTTTCCGCT ACGACACCCA 

351 CGCCGATTTG ATGAACGGGC TGCCGGAAAC CGTTTTTCTG AACATTGCCC 

401 ATCCAAGTTT GCAGTATGTG GTTTCAATGC TGCAACTGGA AAGCGAAAAA 

451 CCTTTGACGG GGACGGTTTC CGTGGTCAAC GCATTACCGT CCGTCCTGCT 

501 GGTGCTTATC CTGCGCGCCT ATCTCGAACA GGATAAGGAT GTCGAACTCT 

551 CGGGCGTATT GAAAGGTTGG CAGGACAAAC GTTTGGGACA TTTGATCCAA 

601 AAGGTGATAG ACAAACCGGA AGACGAATGG AATATTGACA AAATGGTTGC 

651 CGCCGCCAAT ATGTCGCGCG CGCAACTGAT GCGCCGCTTC AAAAGCCAAG 

701 TCGGACTCAG CCCGCACGCC TTTGTGAACC ATATCCGCCT GCAAAAAGGC 

751 GCATTGCTGC TGAAGAAAAC CCCGGATTCG GTTTTGGAGG TCGCGCTGTC 

801 GGTGGGCTTT CAGTCGGAAA CGCATTTCGG CAAGGCGTTC AAACGGCAAT 

651 ATCACGTTTC GCCGGGGCAA TACCGGAAAG AAGGCGGGCA AAAATAA 

This corresponds to the amino acid sequence <SEQ ID 662; ORF 160.ng> : 

gl60.pep 

1 MDILDKLVDL AQLTGSADVQ CLLGGQWHET LQREGLVHIV TAGSGYLCID 

51 GETSPRPVGT GDIVFFPRGL GHVLSHDGKY GESLQPDIRQ NGTFMVKQCG 

101 NGLDMSLFCA RFRYDTHADL MNGLPETVFL NIAHPSLQYV VSMLQLESEK 

151 PLTG TVSWN ALPSVLLVLI L RAYLEQDKD VELSGVLKGW QDKRLGHLIQ 

201 KVI DKPEDEW NIDKMVAAAN MSRAQLMRRF KSQVGLSPHA FVNHIRLQKG 

251 ALLLKKTPDS VLEVALSVGF QSETHFGKAF KRQYHVSPGQ YRKEGGQK* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 663>: 

ml 60. seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 

51 GGATGTGCAG TGCCTTTTGG GCGGACAATG GTCGGTACGG CATGAAACC? 

101 TGCAACGCGA AGGATTGGTA CACATTGTTA CATCGGGCAG CGGCTATCTC 

151 TGCATCGACG GCGAAACTTC CCCGCGTCCG GTCAGTACAG GGGATATTGT 

201 ATTTTTCCCG CGCGGCTTGG GTCATGTGTT GAGCCACGAC GGAAAATGCG 

251 GAGAAAGTTT ACAACCGGAT ATGCGGCAGC ACGGTGCGTT TACGGTCAAG 

301 CAGTGCGGCA ACGGACAGGA TATGAGCCTG TTTTGCGCCC GTTTCCGCTA 

351 CGACACCCAC GCCGATTTGA TGAACGGGCT GCCTGAAACC GTTTTTCTGA 

401 ACATTGCCCA TCCGAGTTTA CAGTATGTGG TTTCAATGCT GCAACTGGAA 

451 AGCAAAAAAC CTTTGACGGG GACGGTTTCC ATGGTCAACG CATTGTCGTC 
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501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG TTTGGGACAT 

601 TTAATCCAAA AGGTGATAGA CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 AATGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 AAAGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

751 CAAAAAGGCG CGTTGCTGCT GAAAAAAAAC CCGGATTCGG TTTTGTCGGT 

801 CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AggCGGGCAA 

901 AAATAA 



This corresponds to the amino acid sequence <SEQ ID 664; ORF 160>: 
ml 60. pep 

1 MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR HETLQREGLV HIVTSGSGYL 
51 CIDGETSPRP VSTGDTVFFP RGLGHVLSHD GKCGESLQPD MRQHGAFTVK 
101 QCGKGQDMSL FCARFRYDTH ADLMNGLPET VFLNIAHPSL QYWSMLQLE 
151 SKKPLTG TVS MVNALSSVLL VLILRAYLEQ DKDVELSGVL KGWQDKRLGH 
201 LIQKVIDKPE DEWNVDKMVA AANMSRAQLM RRFKSRVGLS PHAFVNHIRL 
251 QKGALLLKKN PDSVLSVALS VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 
301 K* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



xaXSO / gl60 93.4% identity in 301 aa overlap 



10 20 30 40 50 60 

ml60.pep MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
I I I I I I I t I : I I I I ! I: I I I I I II I I I I I I I I I I I I M I I I : I I ! I I I I I i I 1 I I I I 
gl60 MDILDKLVDLAQLTGSADVQCLLGGQW HETLQREGLVHIVTAGSGYLCIDGETSPRP 

1C 20 30 40 50 

7C 80 90 100 110 120 

ml 60 . pep VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 
1 : I I I M I I I I I I I I i I I I I I I i I I I I I : I I : I : I I I I I I I I I I I I I I I I I I I I I I 
gl60 VGTGDIVFFPRGLGHVLSHDGKYGESLQPDIRQNGTFMVKQCGNGLDMSLFCARFRYDTH 
60 70 80 90 100 110 



130 140 150 160 170 1B0 

ml 60 . pep ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 

I I II I II I I I I I i I i I I I I I I I I I I ! I I I I I : I I 1 I I I I I : I I I I I I I II I I I I I I I I I 
gl60 ADLMNGLPETVFLNIAHPSLQYWSMLQLESEKPLTGTVSVVNALFSVLLVLILRAYLEQ 

120 130 140 150 160 170 



190 200 210 220 230 240 

ml 60 . pep DKD VE L S G VLKGWQ DKR LG HL I QKVI DKPE DEWN V DKMVAAANMS RAQ LMRRFKS RVG LS 

I E I I I 1 I I I 1 E I ) I I 1 1 I I t f I 1 1 t 1 1 I 1 1 I I 1 1 r | t i 1 I f I I I f f I t I I f 1 I I I r 1 1 [ | 
gl60 DKD VELSGVLKGWQDKRLGHLI QKVI DKPEDEWNIDKMVAAANMSRAQLMRRFKSQVGLS 

180 190 .200 210 220 230 



250 260 270 280 290 300 

ml 60 . pep PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

I II I II I I II I I I I I I I I I : II I I I I I II I I I I I I Ml I I I I I I I I I I I I I I I I I I I II 
gl60 PHAFVNHIRLQKGALLLKKTPDSVLEVALSVGFQSETHFGKAFKRCYHVSPGQYRKEGGQ 

240 250 260 270 2B0 290 



ml 60. pep KX 
I I 

gl60 KX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 665>: 

al60.seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 
51 GGATGTGCAG TGCCTTTTGG GCGGACAATG GTCGGTACGG CATGAAACCT 
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101 TGCAACGCGA AGGATTGGTA CACATTGTTA CATCGGGCAG CGGCTATCTC 

151 TGCATCGACG GCGAAACTTC CCCGCGTCCG GTCAGTACAG GGGATATTGT 

201 ATTTTTCCCG CGCGGCTTGG GTCATGTGTT GAGCCACGAC GGAAAATGCG 

251 GAGAAAGTTT ACAACCGGAT ATGCGGCAGC ACGGTGCGTT TACGGTCAAG 

301 CAGTGCGGCA ACGGACAGGA TATGAGCCTG TTTTGCGCCC GTTTCCGCTA 

351 CGACACCCAC GCCGATTTGA TGAACGGGCT GCCTGAAACC GTTTTTCTGA 

401 ACATTGCCCA TCCGAGTTTA CAGTATGTGG TTTCAATGCT GCAACTGGAA 

451 AGCAAAAAAC CTTTGACGGG GACGGTTTCC ATGGTCAACG CATTGTCGTC 

501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG TTTGGGACAT 

601 TTAATCCAAA AGGTGATAGA CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 AATGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 AAAGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

751 CAAAAAGGCG CGTTGCTGCT GAAAAAAAAC CCGGATTCGG TTTTGTCGGT 

801 CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AGGCGGGCAA 

901 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 666; ORF 160.a>: 
al60.pep 

1 MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR HETLQRSGLV HIVTSGSGYL 

51 CIDGETSPRP VSTGDIVFFP RGLGHVLSHD GKCGESLQPD MRQHGAFTVK 

101 QCGNGQDMSL FCARFRYDTH ADLMNGLPET VFLNIAHPSL QYWSMLQLE 

151 SKKPLTG TVS MVNALSSVLL VLIL RAYLEQ DKDVEL5GVL KGWQDKRLGH 

201 LIQKVIDKPE DEWNVDKMVA AANMSRAQLM RRFKSRVGLS PHAFVNHIRL 

251 QKGALLLKKN PDSVLSVALS VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 

301 K* 

ml60/al60 100.0% identity in 301 aa overlap 

10 20 30 40 50 60 

ml 60 . pep MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I II I j I I I I I I I I I I I I 1 M t M I I I I I I 
al60 MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 

1C 20 30 40 50 60 

70 80 90 100 110 120 

ml 60 . pep VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGA7TVKQCGNGQDMSLFCARFRYDTI1 

I I I I I II I I I I I I I I I II I I I I I I I II II I I I II M H I 1 1 I 1 1 1 1 II I i II I I I I 1 1 I 

al 6 0 VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 
70 83 90 100 110 120 

130 143 150 160 170 180 

ml 60 . pep ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 
I I I I I I 1 1 I I M I I I 1 1 I I I I 1 1 I M I I li I I 1 1 I I I I I I 1 I I I I 1 1 I 1 1 I I I I I M I I I 
a!60 ADLMNGLPETVFLNIAHPSLQYVVSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 60 . pep DKD VE L SG V LKGWQ DKR LGH L I QKV I DK PE DE WNV D KMVAAANM S RAQLMRR FK S R VG L S 

I I M I I I II I I I II I I I I I I I I I I I I I I I I I 1 1 I I I I I I M I I II 1 1 I II I I I I I M I I I 

al60 DKD VEL SG V LKGWQ DKRLG H L I QKV I DKPE DE WNV D KMVAAANM S RAQ LMRR FK S R VG L S 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 60 . pep PH A FVN H I R LQKG AL LLKKN P D S VL S V AL S VG FQS E T H FGKA FKR Q YH VS PGQ YRKEGG Q 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
al60 PHAFVNH I RLQKGALLLKKNPDSVLSVAL SVG FQS ETH FGKA FKRQYHVS PGQYRKEGGQ 

250 260 270 280 290 300 



ml 60. pep 
al60 



KX 
I I 
KX 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 667>: 

gl61. seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTCACCGTTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTACGC TCGGTGCTGC CGCCGTATTG CGGCGCGACA CCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGAC AACCGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTttg GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 CCGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGCAACC GGCGTGGCGA TGTCGTCggt ttgggcgacg 

601 Ctgaccggct ggCACAcccT GTCCTTTcca tcggcagttt ATCtgtCGGG 

651 CATCGGCGTG tccgcgCtgA TTGCCCAaCT GtcgatcAcg cGCGcctaca 

701 aaGTCGGCGA CAAATTCACG GTTGCCTCGC tttCCtaTAt gaccgtcGTC 

751 TTTTCCGCCC TGTCTGCCGC ATTTTTTCTg ggcgaagagc ttttctggCA 

801 GGAAATACTC GGTATGTGCA TCATTATcct CAGCGGCATT TTGAGCAGCA 

851 TCCGCCCCAT TGCCTTCAAA CAGCGGCTGC AAGCCCTCTT CCGCCAAAGA 

901 TAA 

This corresponds to the amino acid sequence <SEQ ID 668; ORF 16Lng>: 

g!61 .pep 

1 MDTAKKDILG SGWMLVAAA C FTVMNVLIKE ASAKFALGSG ELVFWRMLFS 

51 TVTLGAAAVL RRDTFRTPHW KNHLNRS MVG TGAMLLLFYA VTHL PLTTGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQ AVL LLGFAGVVLL LNPSF RSGQE 

151 PAALAGLAGG AMSGWAYLKV RELSLAGEPG WRVVFYLSAT GVAMSSVWAT 

201 LTGWHTLS FP SAVYLSGIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTW 

251 FSALSAAFFL GEELFW QEIL GMCIIILSGI LSSI RPIAFK QRLQALFRQR 

301 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 669>: 

ml 61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTTACCATTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTGCGC TCGGGGCTGC CGCCGTATTG CGTCGGGACA mCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGGC CACTGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTTTG GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 ACGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGTGACA GGTGTGGCGA TGTCGTCGGT TTGGGCGACG 

601 CTGACCGGCT GGCACACCCT GTCCTTTCCA TCGGCAGTTT ATCTGTCGTG 

651 CATCGGCGTG TCCGCGCTGA TTGCCCAACT GTCGATGACG CGCGCCTACA 

701 AAGTCGGCGA CAAATTCACG GTTGCCTCGC TTTCCTATAT GACCGTCGTT 

751 TTTTCCGCTC TGTCTGCCGC ATTTTTTCTG GGCGAAGAGC TTTTCTGGCA 

801 GGAAATACTC GGTATGTGCA TCATCATCCT CAGCGGTATT TTGAGCAGCA 

851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 

901 TAA 

This corresponds to the amino acid sequence <SEQ ID 670; ORF 16 1>: 

ml 61 .pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAVL RRDXFRTPHW KNHLNRSMVG TGA MLLLFYA VTHLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQ AVL LLGFAGVVLL LNPSF RSGQE 
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151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG WR WFYLSVT GVAMSSVWA T 
201 LTGWHTLS FP SAVYLSCIGV SALIA QL5MT RAYKVGDKFT VAS LSYMTW 
251 FSALSAAFFL GEELFWQ EIL GMCIIILSGI LSSI RPTAFK QRLQSLFRQR 
301 * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml 61 / gl61 97.0% identity in 300 aa overlap 

10 20 30 40 50 60 

MDTAKKDI LGSGWMLVAAACFT IMNVLI KEASAKFALGSGELVFWRMLFSTVAXGAAAVL 
I II 1 1 1 I I 1 1 I 11 I 1 1 1 I 1 1 1 1 : 1 I I I I I I I 1 1 I II M I I II I II I I I I I I I: 1 1 1 1 1 I I 
MDTAKKDILGSGWMLVAAACFTVMNVLIKEASAKFALGSGELVFWRMLFSTVTLGAAAVL 
10 20 30 40 50 60 

70 80 90 100 110 120 

RRDXFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 
I I I : I I I I I I I I I I II I I I I I I I I I I I I I I I I I I N : I I I || I I I I M I I I I I I I I M ! I 
RRDTFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLTTGVTLSYTSSIFLAVFSFLILKE 
70 80 90 100 110 120 

130 140 150 160 170 180 

RISVYTQAVLLLGFAGWLLLNPSFRSGQETAALAGLAGGAI4SGWAYLKVREL3LAGEPG 

MMIMIIilllllimiMIIMIIIi I ! I I I [ i I I I I I I I I I I I I Kill 

RISVYTQAVLLLGFAGVVLLLNPSFRSGQEPAALAGLAGGAI4SGWAYLKVRELSLAGEPG 
130 140 150 160 170 180 

190 200 210 220 230 240 

WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 
I I I I I i I I : I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I M 
WRWFYLSATGVAMSSVWATLTGWHTLSFPSAVYLSGIGVSALIAQLSMTRAYKVGDKFT 
190 200 210 220 230 240 

250 260 270 280 290 300 

VA S LS YMT W FS AL S AA FFLGEE L FWQE I LGMC IIILSGILSSIRPTA FKQRLQS L FRQR 

1 1 1 1 1 J i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 r r 1 1 1 1 1 i r 1 1 1 1 1 

VASLSYMTWFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPIAFKCRLQALFRQR 
250 260 270 280 290 300 

X 
I 

X 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 67 1>: 

al61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTTACCATTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

131 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTGCGC TCGGGGCTGC CGCCGTATTG CGTCGGGACA CCTTCCGCAC 

231 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGGC CACCGGCGTT 

331 ACCCTGAGTT ACACCTCGTC GATTTTTTTG GCGGTATTTT CCTTCCTGAT 

351 TTT GAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 ACGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGTGACA GGTGTGGCGA TGTCATCGGT TTGGGCGACG 

601 CTGACCGGCT GGCACACCCT GTCCTTTCCA TCGGCAGTTT ATCTGTCGTG 

651 CATCGGCGTG TCCGCGCTGA TTGCCCAACT GTCGATGACG CGCGCCTACA 

701 AAGTCGGCGA CAAATTCACG GTTGCCTCGC TTTCCTATAT GACCGTCGTT 

751 TTTTCCGCTC TGTCTGCCGC ATTTTTTCTG GCCGAAGAGC TTTTCTGGCA 

801 GGAAATACTC GGTATGTGCA TCATCATCCT CAGCGGTATT TTGAGCAGCA 



ml 61 .pep 
g!61 

ml 61 .pep 
gl61 

ml 61. pep 
gl61 

ml 61 .pep 
gi6i 

ml61 .pep 
g!61 

ml 61 .pep 
gl61 
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851 TCCGCCCCAC TCCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 
901 TAA 

This corresponds to the amino acid sequence <SEQ ID 672; ORF 1 61 .a>: 

al61.pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG E LVFWRMLFS 

51 TVALGAAAV L RRDTFRTPHW KNHLNRSMVG TG AMLLLFYA VTHLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQ AVL LLGFAGWLL LNPSF RSGQE 

151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG WR WFYLSVT GVAMSSVWA T 

201 LTGWHTLS FP SAVYLSCIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTW 

251 FSALSAAFFL AEELFWQ EIL GMCIIILSGI LSSI RPTAFK QRLQSLFRQR 

301 * 

ml61/al61 99.3% identity in 300 aa overlap 

10 20 30 40 50 60 

ml 61 . pep MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 

r 1 1 1 f 1 1 1 1 1 1 1 1 1 f 1 1 1 j 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 r i f 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 

al61 M DT AKK D I LG S G WML V AAAC FT I MN VL I KEAS AK FALG SGELVFWRMLF S T V ALG AAAVL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 61. pep". RRDXFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 
M I : I I ! I I I I I I I II M I II I I I I I I I II I I I I I I I I I I I M I I I I I M I I I I I I I I M 
al61 RRDTFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml61 . pep RI SV YTQAVLLLGFAGVVLLLN PS FRSGQETAALAGLAGGAMSGWAYLKVRELS LAGEPG 

I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II ! I II I I I I! I I I I I I I I I I 
al61 R I S V YTQAVLLLGFAGWLLLN PS FRSGQETAALAGLAGGAMS GWAYLKVRELS LAGE PG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml61 . pep WR VVBTLSVTGV AM SSVWATLTGWHTLSFPSAVYLSCIGVSAL I AQLSMT RAYKVGDKFT 

IIIMIill III III IIMI IIIIIIIIIIIIIIMIIIMMIIIII lltllllllll I 
al61 WRWFYLSVTGV AM SSVWATLTGWHTLSFPSAVYLSC I GVSAL I AQLSMT RAYKVGDKFT 

190 200 210 220 230 240 

250 260 270 280 290 300 

Itliei.pep VASLSYMTWF S AL S AA F FLGEE L FWQE I LGMC IIILSGILSSIR PT A FKQRLQ SL FRQR 

II I \ I I I I I I M I I I II II 1:1 I II | | | I || | | | | | | Ml II I I I II I I II I I I I I I II I 
al61 VASLSYMTWFSALSAAFFLAEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 

250 260 270 280 290 300 



ml 61. pep X 
I 

al61 X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 673>: 

gl63 .seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG gctCGACCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTt 

151 ctgGGTTTtC t get GAT ACT CTCGGTCAGC GGTTTGGGAA ACATcagGCT 

201 AGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGCC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGTCGGCG CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

401 CCTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 
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451 CGCTACAAAC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATGGGCTGG ATTGCCGAAA ACAGCTTCGG 

651 CGTGCAGGTC TTGATTATCG CCGCCGTAAT GTCCCTCGCC GTCGTTTCGG 

701 CAATATCCGG CGTGGGGAAG GGCGTGAAGG TGTTGAGCGA GTTGAACCTG 

7 51 GGCCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG ACCCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTGAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGgc 

951 gcCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGg cgcaccatCc 

1001 gcgagtttgt CTTCGGGGTT T7GCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAATG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAC TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CCCTGTTTTT TGTAACTTCT GCCGACTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGatgtcTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

1401 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGCT 

14 51 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTCAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCGGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACATACC GCATCGCCCG 

1601 CTATGCACGA GTTGCAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AGATGTTTCA TCAGGACGAG CCCGCAATCG AGTTCGTCAT 

1701 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CACCAGACAA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1901 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 

This corresponds to the amino acid sequence <SEQ ED 674; ORF 163.ng>: 

g 163. pep 

1 M VILTTLFFV CVLWLVL TV PDQVQMWLDR AKEVIFTEFS WFY VLTFSIF 

51 LGFLLILSVS GLGNIRLGRD EDVPEFGFLS WL AMLFAAGM GVGLMFFGV A 

101 EPLMHYFSDI TVGAPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI I?TLGF GASQ 

201 LGAGLQEMGW IAENSFGVQ V LIIAAVMSLA WSAIS GVGK GVKVLSSLNL 

251 GLAFLLLFFV LAAD PTVYLL SAFGDNIGNY LGNLVRLSLK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAFFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 

401 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PR WQAVMWGV LMSAVAVLLM 

451 RSGGLGNLQS M TLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVRIMSQTQE QDILKFLKHT AS PAMHELQR ELSEEYGLSV 

551 RVDKMFHQDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 675>: 

ml 63. seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG GCTCGATCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTT 

151 CTGGGTTTCC TGCTGATACT CTCGGTCAGC AGTTTGGGAA ACATCAGGCT 

201 CGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGTC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGCCGGCA CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

401 CTTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 

451 CGCTACAAGC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATCGCGTTGC 

S51 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATGGGCTGG ATTGCCGAAA ACAGCTTCAG 
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651 CGTGCAGGTT TTGATTATCG CCGCCGTCAT GTCCCTCGCC GTCGTTTCGG 

701 CAATATCCGG CGTGGGGAAG GGCGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGCCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG GACCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTTAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGGC 

951 GCCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGG CGCACCATCC 

1001 GCGAGTTTGT CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTfGC 

1101 GGGGGGAATG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAT TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CTCTGTTTTT TGTAACTTCT GCCGATTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGATGTCTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

1401 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGCT 

14 51 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTTAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCAGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACAGACT GCATCGCCCG 

1601 CTATGCACGA GTTGCAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AAATGTTTCA TCGGGACGAG CCCGCAATCG AGTTCGTCAT 

1701 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1831 CAT CAG AC AA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1931 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 

This corresponds to the amino acid sequence <SEQ ID 676; ORF 163>: 

ml 63 .pep 

1 MVTLTTLFFV CVLWLVLTV PDQVQMWLDR AKE VTFTEFS WFYVLTFSIF 

51 LGFLLILSVS SLGNIRLGRD EDVPEFGFLS WL AMLFAAGM GVGLMFFGV A 

101 EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLQEMGW IAENSFSV QV LIIAAVMSLA WSAIS GVGK GVKVLSELNL 

251 GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 

401 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM . 

4 51 RSGGLGNLQS MTLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVQIMSQTQE QDILKFLKQT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHRDE FAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml63 / gl63 98.6% identity in 660 aa overlap 

10 2C 30 40 50 60 

ml 63. pep M V I LT T L FFVC VL WL VLT V P DQVQMW LDRAKE V I FTE FS W FYVLT FS I FLG FLL I LS V S 

I I I I I I I 1 II ! N I 1 I I i I 1 1 I 1 1 M I I I I I M I I I I I I I I I I I I I I 1 1 I I I II I I I II I 
gl 63 MVILTTLFFVCVLWLVLTVPDQVQMWLDRAJCEVIFTEFSWFYVLTFSIFLGFLLILSVS 

10 2C 30 40 50 60 



70 8C 90 100 110 120 

ml 63 . pep SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 
•IIII1IMIMII IMil llliilll inililMIIIIMIIIII III II 
gl63 GLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITVGAPEHRQQ 

70 80 90 100 110 120 



130 140 150 160 170 180 
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jul 63 . pep QALLHTVFHWGVHAWSVYGTIAIALAYFGFRYKLP1ALRSCFYPLLKEKI3GRFGDAIDI 
I I I I 1 I I II I 1 1 I 1 1 1 1 M I I II II II I I I I I I I I 1 1 I I I I i I I I I II I 1 1 I I I I I I | I | 
gl63 QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml63.pep MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 
I I I I I I I I I I I II I I I I I I I I I I II I I I I II I I i I I : I I I I I I I I II I I I I I I I I I I I M 
gl63 MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFGVQVLIIAAVMSLAWSAISGVGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 63 . pep GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 
I I I I I I I I I I I I I I I I II I Ml I I I I I I I I I I I I I I I I II I I I I I I I : I I I I I I ! I II I 
g!63 GVKVLSELNLGLAFLLLFFVLAADPTVYLLSAFGDNIGNYLGNLVRLSLKTYAYEREHKP 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml63 . pep W FE S WT V LYWAWW C SWAP FVGLFI AR I SKGRT I RE FV FG VLL I PGLFG VLW FT V FGNTA I 

I I I I II I I I 1 1 I I II I 1 1 1 1 I I I 1 1 M II I I I 11 I 1 1 I I II I I I I I M 1 1 1 I I I I I I I I I 
gl63 W FE S WT V L YW AWW C SWA P FVG LF I AR I S KGRT I RE FV FG V LL I PGL FG VLW FT VFGNT A I 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml63.pep WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

I t I I 1 J I I I I I I II I I I M I M I I I I I I I I I II I I I I I I I I I I I I I I I II I i I I I 

gl63 WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 63 . pep ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 
I I 1 I I I I I I I I I I I I I I f I I I I I I I I I I | I | I | I I M I I I I I I I I I I I I I I I I I I I I I I I 
gl63 ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 6 3 . pep WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKQTASPAMHELQR 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I : I I I I I II I I I I M I I : I I I I I I II I I I 
g!63 WKGLSADKKYFETRVNPTSVFWTGGKWKERLVRIMSQTQEQDILKFLKHTASPAMHELQR 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 63. pep ELSEEYGLSVRVDKMFHRDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 
I I I ! I I I I I I I I I I I I I : I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
gl63 ELSEEYGLSVRVDKMFHQDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 63 . pep HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 
I t I I M I I I I I M I II I I I I I I I M I I I I ! I I M I I I I I I I I I I II I I I I I I I I I I I I I 
gl 63 H QT T YK P YA Y F FDGRVG Y D VQ YMNKDE L I AD I LKNYERYLMLLDDVGQELMAHEQVELAE 

610 620 630 640 650 660 



ml 63. pep X 
I 

gl63 X 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 677>: 

al63. seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG GCTCGATCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTT 

151 CTGGGTTTCC TGCTGATACT CTCGGTCAGC AGTTTGGGAA ACAT GAG GCT 
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201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 



CGGACGGGAT 
TGCTGTTTGC 
GAGCCGTTGA 
CAGGCAGCAG 
CTTGGTCGGT 
CGCTACAAGC 
AGAAAAAATT 
TTGCTACTTT 
CTGGGCGCCG 
CGTGCAGGTT 
CAATATCCGG 
GGTCTTGCGT 
TTACCTGTTG 
TGGTGCGCCT 
TGGTTTGAAT 
GCCGTTTGTG 
GCGAGTTTGT 
TGGTTTACCG 
GGGGGGAGTG 
AATTCTTTAA 
CTGGTCATTT 
CCTGAACAAT 
AGGCGGTTAT 
CGCTCGGGCG 
GCCGTTTGCC 
TGAGTGCGGA 
TTTTGGACGG 
GACGCAGGAG 
CTATGCACGA 
CGGGTCGATA 
TCGGAAAGAG 
AGGATGTATC 
CAT CAGAC AA 
GTACGATGTG 
AAAACTACGA 
ATGGCGCACG 



GAAGATGTGC 
GGCCGGGATG 
TGCATTATTT 
CAGGCATTGC 
GTACGGTACG 
TGCCGCTTGC 
TCCGGAAGGT 
TTTCGGCATC 
GATTGCAGGA 
TTGATTATCG 
CGTGGGGAAG 
TTTTGCTGCT 
TCGGCATTCG 
CAGTTTTAAA 
CTTGGACGGT 
GGTTTGTTTA 
CTTCGGGGTT 
TCTTCGGCAA 
CTCGAAAAGA 
TTACCTCCCC 
CTCTGTTTTT 
ATTACCTCTC 
GTGGGGCGTG 
GACTCGGCAA 
CTGCTGATGC 
TAAGAAATAT 
GCGGCAAGTG 
CAGGATATTT 
GTTACAACGG 
AGATGTTTCA 
ACGATGCGCG 
CGACCAGTTG 
C7TACAAACC 
CAGTATATGA 
ACGTTATTTG 
AGCAGGTGGA 



CGGAATTCGG 
GGCGTGGGTC 
TTCGGACATT 
TGCACACGGT 
ATTGCATTGG 
CCTGCGTTCT 
TCGGCGATGC 
ATCACCACAT 
AATAGGCTGG 
CCGCCGTCAT 
GGTGTGAAGG 
GTTTTTTGTT 
GCGACAACAT 
ACTTATGCGT 
GCTTTATTGG 
TCGCGCGCAT 
TTGCTCATCC 
TACGGCGATT 
TGACCTCCTC 
CTGCCCGAAT 
TGTAACTTCT 
GGGACAAAGG 
CTGATGTCTG 
CCTGCAGTCT 
TGATAATGTG 
TTTGAGACCC 
GAAAGAACGG 
TAAAATTCCT 
GAGCTTTCGG 
TCAGGACGAG 
ATTTTATGTA 
ATTAACGACG 
CTACGCTTAT 
ACAAGGACGA 
ATGTTGTTGG 
ATTGGCAGAG 



CTTCCTGTCG 
TGATGTTTTT 
ACGGCCGGCA 
GTTCCATTGG 
CTTTGGCTTA 
TGTTTTTACC 
CATTGATATT 
TGGGGTTCGG 
ATTGCCGAAA 
GTCCCTCGCC 
TGTTGAGCGA 
TTGGCGGCGG 
AGGGAACTAC 
ACGAACGGGA 
GCGTGGTGGT 
TTCAAAGGGG 
CCGGCCTGTT 
TGGCTGAATG 
TCCGGAAACG 
TGACGAGCAT 
GCCGATTCCG 
CTTGAGCGCG 
CCGTTGCCGT 
ATGACCCTGA 
TTTCAGCCTG 
GGGTTAACCC 
CTGGTGCAGA 
CAAACATACC 
AAGAATACGG 
CCCGCAATCG 
CGGGATTAAG 
GCAAGCTGCC 
TTTTTCGACG 
GCTGATTGCC 
ATGATGTCGG 
TAA 



TGGCTGGCGA 
CGGCGTGGCA 
CGCCGGAACA 
GGCGTTCACG 
TTTCGGTTTC 
CCCTGTTGAA 
APGGCGTTGC 
GGCTTCGCAA 
ACAGCTTCAG 
GTCGTTTCGG 
GTTGAACCTG 
GTCCCACTGT 
CTCGGAAATC 
ACACAAGCCG 
GTTCTTGGGC 
CGCACCATCC 
CGGCGTTTTG 
ACGGGGTTGC 
CTGCTTTTTA 
CGTCAGCCTG 
GGATTTATGT 
CCACGGTGGC 
TTTGCTGATG 
TTGTTTCCCT 
TGGAAAGGAT 
TACCAGTGTA 
TAATGAGCCA 
GCATCGCCCG 
CTTGAGCGTC 
AGTTCGTCAT 
TCTGTCGGGC 
GCATATCCGG 
GGCGCGTCGG 
GACATTTTGA 
TCAGGAACTG 



This corresponds to the amino acid sequence <SEQ ID 678; ORF 163.a>: 

a!63.pep 

MVILTTLFFV CVLWLVL TV PDQVQMWLDR AKEVIFTEFS WFYVLTFSI? 
LGFLLILSVS SLGNIRLGRD EDVPEFGFLS WLAMLFAAGM 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT 
CFYPLLKEKI SGRFGDAIDI MALLATFFGI 



EPLMHYFSDI 
RYKLPLALRS 

LGAGLQEIGW IAENSF5VQ V LIIAAVMSLA WSAI5 GVGK 
GLAFLLLFFV LAAGPTVYLL SAFGDNIGNY LGNLVRLSFK 



GVGLMFFGVA 
IALALAYFGF 
ITTLGFGASQ 



WFESWTVLYW 
WFTVFGNTAI 
LVISLFFVTS 
RSGGLGNLQS 
FWTGGKWKER 
RVDKMFHQDE 
HQTTYKPYAY 
MAHEQVELAE 



GVKVLSELNL 
TYAYEREHKP 
LLIPGLFGVL 



AWWCSWAPFV GLFIARISKG RTIREFVFGV 

WLNDGVAGGV LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 
ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM 
MTLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 
LVQIMSQTQE QDILKFLKHT ASPAMHELQR ELSEEYGLSV 
PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 
FFDGRVGYDV QYMNK DELIA DILKNYERYL MLLDDVGQEL 



m!63/al63 99.4% identity in 660 aa overlap 



10 20 30 40 50 60 

ml 63 . pep MVILTTLFFVCVLWLVLTVPDQVC^WLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 

MiiiiiiiiiMiiiiiiiiiMiiii nun iiMMiiiinii nun nun 

al63 MVILTTLFFVCVLV^/LVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 63 . pep SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 
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I I I I I I I I I I I I I I I I I I i I I ! I I I I I i I I I 1 I I I I I I I I I I I I I I I I M I I 1 1 I I I t I I 
al63 S LGN IRLGRDEDVP2FG FLS W LAML FAAGMG VGLM FFG VAE P LM H Y FS DI T AGT PE HRQQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 63 . pep QALLHTVFHWGVHAWSVYGTIALA1JVYFGFRYKLPLALRSCFYPLLKEKI3GRFGDAIDI 
I I I I I I I I I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 1 1 M I i I I 1 1 I 1 1 I I I I 1 1 I I 1 1 1 1 I I I M I 

al63 QALLHTVFHVJGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml 63 . pep MALLATFFGI ITTLGFGASQLGAGLQEMGWIAENSFSVQVLI IAAVMSLAWSAISGVGK 

I I I I I II I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
al63 MALLATFFGI ITT LGFGASQLGAGLQEIGWIAENSFSVQVLI IAAVMSLAWSAISGVGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 63 . pep GVKVLSET.NLGLAFTJJ.FFVI.AAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 
I I I I I I I I I I 1 1 I I I I I I I I I II 1 1 I I I I I II M I I I I I I 1 1 I I I I I I 1 1 II I M I I I I I 

al63 G VKVLSELNLG LAFLL L FFVLAAG PTV Y LLS AFGDN I GN YLGN LVR.LS FKTYAYEREHK P 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml63.pep WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I 
al63 W FE S WT V L YW AWW C SWA P FVG LF I AR I S KGRT I RE FV FG VLL I PG L FGVLW FT V FGNT A I 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml63.pep WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 
I I I I I I I I I: I I I I I I I I I II I I II II I II I I I II I I I I I I I I I II I I M I I II II I I I I 
al63 WLNDGVAGGVLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 63 . pep ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 
I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
al63 ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml63 . pep WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKQTASPAMHELQR 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I : I I I 11 I I I I I I 
al63 WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKHTASPAMHELQR 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml63 . pep ELSEEYGLSVRVDKI4FHRDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

II I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I II I I I I 1 I I I I I 1 I I I I I II I I I I I I 
al63 ELSEEYGLSVRVDK14FHQDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 63 . pep HQT T YKP Y A Y F FDG RVG Y D VQ YMNKDE L I AD I LKN YER Y LMLL D D VG QE LMAHEQVE LAE 

I I I II I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
al63 HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 

610 620 630 640 650 660 



ml63.pep 
al63 



X 
I 

X 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 679>: 

gl64.seq (partial) 

1 . .ATGAACACAT TTTTGAAAAA CAGCGAATAC GCGTATATCC TGAACGACTG 

51 CAAGGCGCGC TTCCTGTTCG CCTCGGCCGG CCTGTCAAAA GAATTGGCGG 

101 GCCTGAAGGC GCAAACGCCC GTCGAAAAAA TCATTTGGAC GGACAAAAGC 

151 CGGCCGGCCG GCGAAACGGC GGAAGGCGAT GCCTTTTTTG AAAACGTGCG 

201 CCGCTTCCCC GAAAAACCCG ACTTGGGCCG CCAACCCCGG ATAAATGATT 

251 TGGCACACAT CATCTACACC TCCGGCACGA CGGGGCATCC CAAAGGCGCG 

301 CTAATCAGTT ACGCCAACCT GTTCGCCAAC CTGAACGGCA TCGAACGCAT 

351 CTTtaaAATT TCCAAACGCG ACCGCTTTAT CGTTTTCctg ccgatgTTCC 

401 ACAGCTTCAC GCTGACGGCT ATGGTGCTGC TGCCGATTTA TATGGCGTGT 

451 TCGATTATTT TGGTCAAAtc cgttttCCCc ttttccaacG TTTTGAAACA 

501 GGCCCTGCTC AAACGCGCAA CCGTGTTTTT GGGCGTACCC GCGATTTACA 

551 CCGCGATGAG CAAGGCAAAA ATCCCTTGGT ATTTCAGATG GTTCAACCGC 

601 ATCCGCCTGT TTATCAGCGG CGGCGCGCCT TTGGCGGAAC AAACCATCCT 

651 CGATTTTAAA GCCAAGTTCC CCCGCGCCAA ATTGCTGGAA GGCTACGGAC 

701 TGAGCGAAGC CTCGCCCGTC GTCGCCGTCA ATACGCCCGA ACGGCAAAAA 

751 GCCCGCAGCG TCGGCATCCC CCTGCCCGGT TTGGAAGCCA AAGCCGTCGA 

801 TGAAGAATTG GTCGAAGTGC CGCGCGGCGA AGTGGGCGAA CTGATCGTCA 

851 GGGGCGGTTC GGTGATGCGG GGCTACCTCA ATATGCCTGC CGCCACCGAT 

901 GAAACCATCG TCAACGGCTG GTTGAAAACG GGCGATTTCG TTACCATAGA 

951 CGAGGACGGC TTTATCTTTA TCGTCGACCG CAAAAAAGAT TTGATTATTT 

1001 CCAAAGGTCA AAACGTCTAT CCGCGCGAGA TCGAAGAAGA AATCCACAAA 

1051 CTCGATGCCG TCGAAGCCGC CGCCGTCATC GGCGTGAAAG ACCGTTATGC 

1101 CGACGAGGAA ATCGTCGCCT TCGTCCAATT GAAGGAAGGT ATGGATTTGG 

1151 GCGAGGACGA aatccgccgc caccTGCGTA CCGTGCTGGC AAATTTCAAA 

1201 ATCCCCAAAC AGATCCACTT TAAAGACGGG CTGCCGCGCA ACGCTACGGG 

1251 CAAAGTATTG AAACGGGTGC TGAAGGAGCA GTTTGAAGGA AACAAATGA 

This corresponds to the amino acid sequence <SEQ ID 680; ORF 164.ng>: 

gl64.pap (partial) 

1 . .MNTFLKNSEY AYILNDCKAR FLFASAGLSK ELAGLKAQTP VEKIIWTDKS 

51 RPAGETAEGD AFFENVRRFP EKPDLGRQPR INDLAHIIYT SGTTGHPKGA 

101 LISYANLFAN LNGIERIFKI SKRDRFIVFL PMFHSFTL TA MVLLPIYMAC 

151 SIILV KSVFP FSNVLKQALL KRATVFLGVP AIYTAMSKAK IPWYFRWFNR 

201 IRLFI SGGAP LAEQTILDFK AKFPRAKLLE GYGLSEASPV VAVNTPERQK 

251 ARSVGIPLPG LEAKAVDEEL VEVPRGEVGE LIVRGGSVMR GYLNMPAATD 

301 ETIVNGWLKT GDFVTIDEDG FIFIVDRKKD LIISKGQNVY PRE IEEE IHK 

351 LDAVEAAAVI GVKDRYADEE IVAFVQLKEG MDLGEDEIRR HLRTVLANFK 

401 IPKQIHFKDG LPRNATGKVL KRVLKEQFEG NK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 68 1>: 

m!64 . seq 

1 ATGAACCGGA CTTATGCCAA TTTCTACGAA ATGCTCGCCG CCGCCTGCCG 

51 CAAAAACGGA AACGGCACGG CAGTGTTCGA CGGCAAGGAA AAAACCGCCT 

101 ACCGCGCGCT CAAGCAGGAG GCCGAAGCCG TCGCGGCGTA TCTGCAAAAT 

151 ATCGGCGTGA AGTTCGGCGA CACGGTCGCG CTGGCGGTTT CCAATTCCAC 

201 AGAATTTATT ACCGCCTATT TCGCCATCTC CGCCATCGGC GCGGTCGCCG 

251 TACCGATGAA CACATTTTTG AAAAACAGCG AATACGCGTA TATCCTGAAC 

301 GACTGCAAGG CGCGCTTCCT GTTCGCCTCG GCCGGCCTGT CAAAAGAATT 

351 GGCGGGCTTG AAGGCGCAAA CGCCCGTCGA AAAAAT CAT T TGGACGGACA 

401 AAAGCCGTCC GACCGGCGAA ACGGCGGAAG GCGATGCCTT TTTTGAAGAC 

451 GTGCGCCGCT TCCCCGAAAA ACCCGACTTG GGCCGCCAAC CCCGGATAAA 

501 TGATTTGGCA CACATCATCT ACACCTCCGG CACGACGGGG CATCCCAAAG 

551 GCGCGCTAAT CAGTTACGCC AACCTGTTCG CCAACCTGAA CGGCATCGAA 

601 CGCATCTTTA AAATTTCCAA GCGCGACCGC TTTATCGTTT TCCTGCCGAT 

651 GTTCCACAGC TTCACGCTGA CGGCTATGGT GCTGCTGCCG ATTTATATGG 

701 CGTGTTCGAT TATTTTGGTC AAATCCGTTT TTCCGTTTTC CAACGTTTTG 

751 AAACAGACAC TGCTCAAACG CGCGACCGTG TTTTTGGGCG TACCCGCGAT 

801 TTACACCGCG ATGAGCAAGG CGAAAATCCC TTGGTATTTC AGATGGTTCA 

851 ACCGCATTCG CCTGTTTATC AGCGGCGGCG CGCCTTTGGC GGAACAAACC 

901 ATCCTCGATT TCAAAGCCAA GTTCCCCCGC GCCAAATTGC TGGAAGGCTA 
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951 CGG ACT GAGC GAAGCCTCTC 

1001 AAAAAGCCCG CAGCGTCGGC 

1051 GTCGATGAAG AATTG3TCGA 

1101 CGTCAGGGGC GGTTCGGTGA 

1151 CCGATGAAAC CATCGTCAAC 

1201 ATAGACGAAG ACGGCTTTAT 

1251 TATTTCCAAA GGTCAAAATG 

1301 ACAAACTCGA TGCCGTCGAA 

1351 TATGCCGACG AGGAAATCGT 

14 01 TTTGGGCGAG AACGAAATCC 

14 51 TCAAAATCCC CAAACAAATC 

1501 ACGGGCAAGG TATTGAAACG 

1551 ATGA 



CCGTCGTCGC CGTCAATACG CCCGAGAGGC 
ATCCCCCTGC CCGGTTTGGA AGCCAAAGCC 
AGTGCCGCGC GGCGAAGTGG GCGAACTGAT 
TGCGGGGCTA CCTCAATATG CCTGCCGCCA 
GGCTGGTTGA AAACGGGCGA TTTCGTTACC 
CTTTATCGTC GACCGCAAAA AAGATTT GAT 
TCTATCCGCG CGAGATTGAA GAAGAAATCT 
GCCGCCGCCG TCATCGGCGT GAAAGACCGT 
CGCCTTCGTC CAATTGAAGG AAGGTATGGA 
GCCGCCACCT GCGTACCGTG CTGGCAAATT 
CACTTTAAAG ACGGGCTGCC GCGCAACGCT 
GGTGTTGAAG GAGCAGTTTG ACGGAAACAA 



This corresponds to the amino acid sequence <SEQ ID 682; ORF 164>: 

ml 64 . pep 

1 mnrtyanfye mlaaacrkng ngtavfdgke ktayralkqe aeavaaylqn 

51 IGVKUXjDTVA lavsnste fi tayfaisaig avavpm ntfl knseyayiln 

101 DCKARFLFAS AGLSK3LAGL KAQTPVEKII WTDKSRPTGE TAEGDAFFED 

151 VRRFPEKPDL GRQPRINDLA HIIYTSGTTG HPKGALISYA NLFANLNGIE 

201 RIFKISKRDR FIVFLPMFHS FTL TAMVLLP IYMACSIILV KSVFPFSNVL 

251 KQTLLKRATV FLGVPAIYTA MSKAKIPWYF RWFNRIRLFI SGGAPLAEQT 

301 ILDFKAKFPR AKLLEGYGLS EASPWAVNT PERQKARSVG IPLPGLEAKA 

351 VDEELVEVPR GEVGELIVRG GSVMRGYLNM PAATDETIVN GWLKTGDFVT 

4 01 IDEDGFIFIV DRKKDLIISK GQNVYPREIE EEIYKLDAVE AAAVIGVKDR 

451 YADEEIVAFV QLKEGMDLGE NEIRRHLRTV LANFKIPKQI HFKDGLPRNA 

501 TGKVLKRVLK EQFDGNK* 



Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from TV. gonorrhoeae m!64 / gl64 98.6% identity in 432 aa 

overlap 



60 70 80 90 100 110 

ml 64 , pep GDTVALAVSNSTEFITAYFAISAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSK 

I t I t I I I I I I I I I I ! I I I I I i I I I I I I 1 I I 
gl 64 MNT FLKNS E YAY I LN DCKARFL FAS AGLS K 

10 20 30 



120 130 140 150 160 170 

ml 64 . pep ELAGLKAQTPVEKI IWTDKSRPTGETAEGDAFFEDVRRFPEKPDLGRQPRINDLAHIIYT 

I I I M 1 1 I I 1 1 I 1 II I I I I I I I : I I 1 1 I M I 1 1 I: I II I I I I I MM I I I I 1 1 M I I I I I 
gl64 EliAGLKAQT PVEKI IWT DKS RPAGETAEGDAFFENVRRFPEKPDLGRQPRINDLAHII YT 

40 50 60 70 80 90 



180 190 200 210 220 230 

ml64.pep SGTTGHPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMAC 
M I I II I I I I I M M I I II I I I I I I I II 1 1 I I I I I I I M II I I M II I II I I I I I I II I I 
gl64 SGTTGHPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMAC 
100 110 120 130 140 150 



240 250 260 270 280 290 

ml64.pep SII LVKSVFPFSNVLKQTLLKRATVFLGVPAI YTAMSKAKI PWYFRWFNRIRLFI SGGAP 

II II M I I II I II I I I I : I I I I | I ! I I I I I I I I I I I I M I M I I I I I I I II I I I II I I I I 
gl64 SII LVK S VF P F S N V LKQ ALLKRAT V FLG V PA I YT AM SKAK I PWY FRW FNRI RL FI SGGA P 

160 170 180 190 200 210 



300 310 320 330 340 350 

ml 64 .pep LAEQTILDFKAKFPRAKLLEGYGLSEASPWAVNTPERQKARSVGIPLPGLEAKAVDEEL 
M I I M I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I 
g 1 64 LAEQT I LDFKAKFPRAKLLEGYGLS EAS P VVAVNT PERQKARSVG I PLPGLEAKAVDEEL 

220 230 240 250 260 270 

360 370 380 390 400 410 
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ml 64 .pep VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 

I I I I U I I 1 1 I I I I 1 1 I II II M III I I I I 1 1 I 1 1 I 1 1 1 1 I I I II I I 1 1 I II I I I I I I I I 
gl64 VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 
280 290 300 310 320 330 

420 430 440 450 460 470 

ml64 . pep LIISKGQNVYPREIEEEIYKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRR 
I 1 ! I M I i I I I I I I I I M : I M I I I I I I II I M I II M I I ! I I I I I I I I I I M I i : I M I 
gl64 LIISKGQNVYPREIEEEIHKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGEDEIRR 
340 350 360 370 380 390 

480 490 500 510 

ml64 . pep HLRTVLANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I 
g 1 6 4 HLRTVLAN FKIPKQ IHFKDGL PRN ATGKVLKRVLKEQFEGNKX 

400 410 420 430 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 683>: 

al64 .seq 

1 ATGAACCGGA CTTATGCCAA TTTCTACGAA ATGCTGACCG CCGCCTGCCG 

51 CAAAAACGGC AACGGCACGG CAGTGTTCGA CGGCAAGGAA AAAACCGCCT 

101 ACCGCGCGCT CAAGCAGGAA GCCGAAGCCG TTGCGGCGTA TCTGCAAAAT 

151 ATCGGCGTGA AGTTCGGCGA CACGGTCGCG CTGGCGGTTT CCAATTCCAC 

201 GGAATTTATT ACCGCCTATT TCGCCGTATC CGCCATCGGC GCGGTTGCCG 

251 TACCGATGAA CACATTTTTG AAAAACAGCG AATACGCGTA TATCCTGAAC 

301 GACTGCAAGG CGCGCTTCCT GTTCGCCTCG GCCGGCCTGT CAAAAGAATT 

351 GGCGGGC TTG AAGGCGCAAA CGCCCGTCGA AAAAATCATT TGGACGGGCC 

4 01 AAAGCCGTCC GGACGGCGAA ATGGCGGAAG GCGATGCCTT TTTTGAAGAC 

4 51 GTGCGCCGCT TCCCCGAAAA ACCCGACTTG GGCCGCCAAC CCCGGATAAA 

5C1 TGATTTGGCA CACATCATCT ACACCTCCGG CACGACGGGG CATCCCAAAG 

551 GTGCGCTAAT CAGCTACGCC AACCTGTTCG CCAACCTGAA CGGCATCGAA 

6C1 CGCATCTTTA AAATCTCCAA GCGCGACCGC TTTATCG7TT TCCTGCCGAT 

651 GTTCCACAGC TTCACGCTGA CGGCTATGGT GCTGCTGCCG ATTTATATGG 

7 CI CGTGTTCGAT TATTTTGGTC AAATCCGTTT TCCCCTTTTC CAACGTTTTG 

7 51 AAACAGGCAC TGCTCAAACG CGCGACCGTG TTTTTGGGCG TGCCCGCGAT 

801 TTACACCGCG AT GAGCAAGA CGAAAATCCC TTGGTATTTC AGATGGTTCA 

851 ACCGCATCCG CCTGTTTATC AGCGGCGGAG CACCTTTGGC GGAACAAACC 

901 ATCCTCGATT TCAAAGCCAA GTTCCCCCGC GCCAAATTGC TGGAAGGCTA 

951 CGGACTGAGC GAAGCCTCGC CCGTCGTCGC CGTCAATACG CCCGAGAGGC 

1001 AAAAAGCCCG CAGCGTCGGC ATCCCCCTGC CCGGTTTGGA AGTCAAAGCC 

1051 GTCGATGAAG AATTGGTCGA AGTGCCGCGC GGCGAAGTGG GCGAACTGAT 

1101 CGTCAGGGGC GGTTCGGTGA TGCGGGGCTA CCTCAATATG CCTGCCGCCA 

1151 CCGATGAAAC CATCGTCAAC GGCTGGTTGA AAACGGGCGA TTTCGTTACC 

1201 ATAGACGAAG ACGGCTTTAT CTTTATCGTC GACCGCAAAA AAGATTTGAT 

1251 TATTTCCAAA GGTCAAAATG TCTATCCGCG CGAAATCGAA GAAGAAATCT 

1301 ACAAACTCGA TGCCGTCGAA GCCGCCGCCG TCATCGGCGT GAAAGACCGT 

1351 TATGCCGACG AGGAAATCGT CGCCTTCGTC CAATTGAAGG AAGGTATGGA 

14 01 TTTGGGCGAG AACGAAATCC GCCGCCACCT GCGTACCGTG CTGGCAAATT 

1451 TCAAAATCCC CAAACAAATC CACTTTAAAG ACGGGCTGCC GCGCAACGCT 

1501 ACGGGCAAGG TATTGAAACG GGTGTTGAAG GAGCAGTTTG ACGGAAACAA 

1551 ATGA 



This corresponds to the amino acid sequence <SEQ ID 684; ORF 164.a>: 
al64 .pep 

1 MNRTYANFYE MLTAACRKNG NGTAVFDGKE KTAYRALKQE AEAVAAYLQN 

51 IGVKFGDTVA LAVSNSTE FI TAYFAVSAIG AVAVP MNTFL KNSEYAYILN 

101 DCKARFLFAS AGLSKELAGL KAQTPVEKII WTGQSRPDGE IVJAEGDAFFED 

151 VRRFPEKPDL GRQPRINDLA HIIYTSGTTG HPJCGALISYA NLFANLNGIE 

201 RIFKISKRDR FIVFLPMFHS FTL TAMVLLP IYMACSIILV KSVFPFSNVL 

251 KQALLKRATV FLGVPAIYTA MSKTKIPWYF RWFNRIRLFI SGGAPLAEQT 

301 ILDFKAKFPR AKLLEGYGLS EASPVVAVNT PERQKARSVG IPLPGLEVKA 

351 VDEELVEVPR GEVGELIVRG GSVMRGYLNM PAATDETIVN GWLKTGDFVT 

4 01 IDEDGFIFIV DRKKDLI3SK GQNVYPREIE EE I YKLDAVE AAAVIGVKDR 

451 YADEEIVAFV QLKEGMDLGE NEIRRHLRTV LANFKIPKQI HFKDGLPRNA 
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501 TGKVLKRVLK EQFDGNK* 

ml64/al64 98.3% identity in 517 aa overlap 

10 20 30 40 50 60 

ral64 . pep ^RTYANFYEMLAAACRKNGNGTAVFDGKEKTAYRALKQEAEAVAAYLQNIGVKFGDTVA 

I I I I I I I I t I II: I 1 1 I I I I I I I 1 I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I 
a 1 6 4 MNRT YANFYEMLTAACRKNGNGTAVFDGKEKTAYRALKQEAEAVAAYLQNIGVKFGDTVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 64 .pep LAVS N ST E FI T A YFAI S A IG AVA V PMN T FLKN S E Y A Y I LN DCKAR FL FAS AG L S KE LAG L 

II IMIIMIIII Ihllillllllllll Mlil! IIIMIIIIIilllllillllilll 

al64 LAVSNSTEFITAYFAVSAIGAVAVPMNTFLKNSEYAYILNDCECARFLFASAGLSKELAGL 
70 80. 90 100 110 120 

130 140 150 160 170 180 

ml 6 4 . pep KAQTPVEKIIWTDKSRPTGETAEGDT^FFEDVRRFPEKPDLGRQPRINDLAHIIYTSGTTG 
I II I I I I II I I I : I I I II I I I I I ! I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 1 
al64 KAQTPVEKIIWTGQSRPDGEMAEGDAFFEDVRRFPEKPDLGRQPRINDLAHIIYTSGTTG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml64 . pep HPKGALI S YANLFANLNGIERI FKISKRDRFIVFLPMFHSFTLTAMVLLPI YMACS I ILV 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I ! I I I M I I I ! I II 
al64 HPKGALISYAKLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMACSIILV 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 64 . pep KSVFPFSNVLKQTLLKRATVFLGVPAIYTAMSKAKIPWYFRWFNRIRLFISGGAPLAEQT 
I I I I 1 1 I I I I II : I II 1 1 I 1 1 I I I I I I I I 1 1 I I: I 1 1 I 1 1 MM 1 1 I II I I 1 1 I 1 1 I I I I 
al64 KSVFPFSNVLKQALLKRATVFLGVPAIYTAMSKTKIPWYFRWFNRIRLFISGGAPLAEQT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 64 . pep ILDFKAKFPRAKLLEGYGLSEAS PWAVNTPERQKARSVGIPLPGLEAKAVDEELVEVPR 

I I I II 1 1 I I I I I I I M II I I I I 1 1 I I 1 1 II I M II II ■ I I Ml I 1 1 I : I I I I I I 1 1 1 I I I 
al64 ILDFKAKFPRAKLLEGYGLSEAS PWAVNT PERQKARSVGI PLPGLEVKAVDEELVEVPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 64 . pep GEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTID5DGFIFIVDRKKDLIISK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I I M I I I I I I I I I I I 
al64 GEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTID2DGFIFIVDRKKDLIISK 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 64 . pep GQNVYPREIEEEI YKLDAVEAAAVIGVKDRYADEEIVAFVQLXEGMDLGENE I RRHLRTV 

I M 1 1 1 1 1 m ii 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 : 1 I I I I I I I I M I I I I I I I I I I I 

al 64 GQNVYPREIEEEI YKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRRHLRTV 

430 440 450 460 470 480 

490 500 510 

ml 64 . pep LANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 

I I I I I I I I I I I I I I U I I I I I I I I I II I I I I I I I II I 
al64 LANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 

490 500 510 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 685>: 

gl 65. seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 
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51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gTGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctgGTCGC GGAAGGCAAG 

301 TTGGAagaCA ATTCCTTCAT CAATGCcgtg ccgcatatGT Ctttggtgat 

351 gAacgaagac cactgCCgtt acCTGCAAAA ACGCTATGAT GTGTTTAAAA 

401 CGCAGAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CtccgCTGAT TATGCGCGGC CGGgacgaaA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG Aaggcacgga tgtcgATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CcTGCTGGgC gAaTTGCgtt aa 



This corresponds to the amino acid sequence <SEQ ID 686; ORF 165.ng>: 

gl65.pep 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALESSNAWN 

51 NAGTGHSALC ELNYAPLGAD GVINPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI 

151 SDWAPLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 687>: 

m!65.seq (partial) 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCCTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGgTACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCGCTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAA. . . 



This corresponds to the amino acid sequence <SEQ ID 688; ORF 165>: 

ml 65. pep (partial) 

1 MAEAT DWLV GGGIMSATLG VLL KELEPSW EITLIERLED VALESSNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 
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201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR N5NPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLCAGWA 

351 NMPLTK. . . 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml65 / gl€5 97.2% identity in 356 aa overlap 



10 20 30 40 50 60 

m!65 . pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 

gl65 MAE AT D WL VGGG IMS ATLG VLLKELE PS WE I T L I ERLE D VALE S S NAWNN AGTG H S ALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml65 . pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
I I I I I I I I I : I : I: I I I I M I I I I I I I I I I I I I I I I H I I I I I I I! I i I I I I I I I I I I I I 
gl65 ELNYAPLGADGVINPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml65 . pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRD3NQPVAANYSAEGTDVDF 
I I I II I I I I : I I I I I I I I I I I I I I I (I I It I I I I I: i I I II I I II I I II I ! I I I I I I I I 
gl65 HCRYLQKRYDVFKTQKLFENMEFSTDRNKISDWAPLIMRGRDENQPVAANYSAEGTDVDF 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 6 5 . pep GRLTRQMVKYIiQGKG\ r KTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 
gl 6 5 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDWQLTLRTRFLFLGA 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml65 . pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAXVYGQASVGAPPMSVPHL 
I I I I I I I I I I I I I I II I I I 1 I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl65 G GG ALT L LQK SG I PEG KG YGGLPV S G L FFRN S N P E T AEQHN AKVY GQAS VG A PPM S V P H L 

250 260 270 280 290 300 



310 320 330 340 350 

ml65 . pep DTRN VDGKRHLMFG PYAGFR SN FLKQG S LM DL PLS I HMDNLY PMLCAGW ANM PLT K 

I I I I I I I I I I I I I 1 I I I I I I I I II I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl65 DTRN V DG KR H LMFG P YAG FRSN FLKQG SFMDLPLSI HMDN L Y PMLRAGW ANM P LT KYLLG 

310 320 330 340 350 360 



gl65 ELRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 689>: 

al65. seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

.101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGTTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

4 01 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

4 51 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 
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601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 



AACCGCCACG 
CAAAACCGCC 
GCTTCCTCTT 
TCCGGCATCC 
GTTCTTCCGC 
TGTACGGGCA 
GACACACGCA 
AGGCTTCCGT 
TGTCCATCCA 
AATATGCCGC 
AGAACGCTTC 
ACTGGGAACT 
TCCGAAAAAG 
CGACGGCTCG 
CCGTGCCGCT 
CCGTCTTGGG 
GTTGAACGAA 
AAGTGTTGGA 



TCGAAGACAT 
GATACCCGCA 
CCTCGGCGCG 
CCGAAGGCAA 
AACAGCAACC 
GGCTTCCGTC 
ACGTGGACGG 
TCCAACTTCC 
TATGGACAAC 
TGACCAAATA 
GCCTCCCTGC 
CATCACCGCA 
GCGGCGTGTT 
CTCGCCGCAT 
GATGATCCGG 
AAGGCCGTCT 
AACCCCGAAA 
TATTTAA 



CAAACGCGAA 
ACCCCGACGG 
GGCGGCGGCG 
AGGCTACGGT 
CCGAAACCGC 
GGCGCGCCGC 
CAAACGCCAC 
TCAAGCAAGG 
CTCTATCCTA 
CCTGCTGGGC 
TGGAATACTA 
GGG CAACGCG 
GCAGTTTGGT 
TGCTGGGCGC 
CTGATGCACC 
GAAAGAGCTG 
GGGCGGATGA 



TCCGACGGCG 
GCAGCTCACC 
CGCTGACCCT 
GGCTTTCCCG 
CGAACAACAC 
CGATGTCCGT 
CTTATGTTCG 
CTCACTTATG 
TGCTGCGCGC 
GAATTGCGTA 
CCCCGAGGCA 
TTCAAATCAT 
ACGGAGATTG 
GTCGCCGGGC 
AATGCTTCCC 
GTACCGGGTT 
AATTATCGCC 



CGTGGGTGCT 
CTCCGTACCC 
GCTGCAAAAA 
TGTCCGGCCT 
AACGCCAAAG 
CCCGCACCTC 
GCCCTTACGC 
GATTTGCCGC 
CGGCTGGGCG 
AAACCAAAGA 
AACCCCGACG 
TAAAAAAGAC 
TCGCACACGC 
GCATCGACCG 
CGAACGCACC 
ACGGCATCAA 
TATACCGCGA 



This corresponds to the amino acid sequence <SEQ ID 690; ORF 165. a> 

al65.pep 

MAEATD VVLV GGGIMSATLG VLL KELEPSW EITLIERLED 

IAEQFHVSRQ 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 



VALES SNAWN 

NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 
LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 
SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 
NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 
SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 
DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLRAGWA 
NMPLTKYLLG ELRKTKEERF AS L LEY Y PEA NPDDWELITA GQRVQIIKKD 
SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 
PSWEGRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI * 



ml65/a!65 



99.7% identity in 356 aa overlap 



10 20 30 40 50 60 

ml 65 . pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I I I I I I I I I I I I I t I I i I I E I M I I I I I I I I I I [ I ! I I i I I I I I I I I I M I I I I ! I I I I I 
al65 MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml65.pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
H M II II II MM II MM Ml! Ml III II! MMIItllll M MMHllll II II 
al65 ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml65 . pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKI SDWAPLMMRGRDENQPVAAN YSAEGTDVDF 

I 1 I I I I I I I 1 = I I I I I I I I I I 1 I I t t I I I I I I I I I i t I I I I I I I t I I f I I I I I I 1 I I I I 1 
al65 HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml65 . pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
M II II II II MM M I III Ml I MMII Ml Ml MIMIII II I II IMMII II II 
a 1 65 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRN PDGQLTLRTRFLFLGA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml65.pep G GGALTLLQK SGI P EGKG YGG F P V S G L F FRN S N PET AE QHNAKV YGQAS VG A PPMS V PH L 

I I M M M I I I I I M I I M I I I M I I I I I I M I I I I M I i M I M M t I I M M I 

al65 GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPM3VPHL 

250 260 270 280 290 300 
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310 320 330 340 350 

ml 65 . pep DT RN V DGKRH LM FG P Y AG FR S N FLKQG SLMDLPLSIHMDN L Y PMLC AGW ANM PLT K 

I I I I I I I I I I I 1 I I I I t I I I II I I II I I I I I i II II I I I I I I I I I I I I I I II I I I 
al65 DTRNVDGKRHLMFGPYAGFRSNFLKOGSLMDLPLSIHI1DNLYPMLRAGWANMPT.TKYLLG 

310 320 330 340 350 360 

al65 E LRKT KEERFAS LLE Y Y P E AN P D DWE L I T AGQR VQ 1 1 KK DS E KG G V LQ FGT E I V AHADG S 

370 380 390 400 410 420 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 69 1>: 

gl65-l.seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gTGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctggTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAACGAAGAC CACTGCCGTT ACCTGCAAAA ACGCTATGAT GTGTTTAAAA 

401 CGCAGAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CtCCgCTGAT TATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

8C1 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTtt gCCTCCCTGC TGgaatacta cccGaggcag acccGACGAc 

1151 tggtactcat cacgcaggnc acGCGTcata tcattanata tgactCgaaa 

1201 ctgcgcgtgc tgcagttgta cgagattgtg ccaCGCGacg ctcgctcgcg 

1251 cattctggag cgtcgcggcg catcacgctn tgcgctgata tccgctgatg 

1301 acactgctcc gaGCGcgccc gtcttggaaa gtgtctga 

This corresponds to the amino acid sequence <SEQ ID 692; ORF 165-l.ng>: 

gl 65-1. pep 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAD GVINPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI 

151 SDWAPLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LUGKGVKTEF 

201 NRHVEDI KRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPRQ TRRLVLITQX TRHIIXYDSK 

401 LRVLQLYEIV PRDARSRILE RRGASRXALI SADDTAPSAP VLESV* 

The following partial DNA sequence was identified in K meningitidis <SEQ ED 693>: 

nl65-l.eeq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGTACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 
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851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 



TGTACGGGCA 
GACACACGCA 
AGGCTTCCGT 
TGTCCATCCA 
AATATGCCGC 
AGAACGCTTC 
ACTGGGAACT 
TCCGAAAAAG 
CGACGGCTCA 
CTGTGCCGCT 
CCGTCTTGGG 
GTTGAACGAA 
AAGTATTGGA 



GGCTTCCGTC 
ACGTGGACGG 
TCCAACTTCC 
TATGGACAAC 
TGACCAAATA 
GCCTCCCTGC 
CATCACCGCA 
GCGGCGTGCT 
CTCGCCGCAT 
GATGATCCGG 
AAGACCGTCT 
AACCCTGAAA 
TATTTAA 



GGCGCGCCGC 
CAAACGCCAC 
TCAAGCAAGG 
CTCTATCCTA 
CCTGCTGGGC 
TGGAATACTA 
GGGCAACGCG 
CCAGTTTGGT 
TGCTGGGCGC 
CTGATGCACC 
GAAAGAGCTG 
GGGC5GATGA 



CGATGTCCGT 
CTTATGTTCG 
CTCGCTTATG 
TGCTGTGCGC 
GAATTGCGTA 
CCCCGAGGCA 
TCCAAATCAT 
ACGGAGATTG 
GTCGCCGGGC 
AATGCTTCCC 
GTACCGGGTT 
AATTATCGCC 



CCCGCACCTC 
GCCCTTACGC 
GATTTGCCGC 
CGGCTGGGCG 
AAACCAAAGA 
AACCCCGACG 
TAAAAAAGAC 
TCGCCCACGC 
GCATCGACCG 
CGAGCGCGCC 
ACGGCATCAA 
TATACCGCGA 



This corresponds to the amino acid sequence <SEQ ED 694; ORF 165-1>: 

ml 65-1 .pep 

1 MAEATDWLV 



51 NAGTGHSALC 

101 LEDNSFINAV 

Ibl SDWAPLMMRG 

201 NRHVEDIKRE 

251 SGIPEGKGYG 

301 DTRNVDGKRH 

351 NNPLTKYLLG 

401 SEKGGVLQFG 

451 PSWEDRLKEL 



GGGIMSATLG 
ELNYAPLGAN 
PHMSLVMNED 
RDLNQPVAAN 
SDGAWVLKTA 
GFPVSGLFFR 
LMFGPYAGFR 
ELRKTKEERF 
TEIVAHADGS 
VPGYGIKLNE 



VLLKELEPSW 
GIIDPARALN 
HCSYLQKRYD 
YSAEGTDVDF 
DTRNPDGQL7 
NSNPETAEQH 
SNFLKQGSLM 
ASLLEYYPEA 
LAALLGASPG 
NPERADE2 IA 



EITLIERLED 
IAEQFHVSRQ 
AFKTQKLFEN 
GRLTRQMVKY 
LRTRFLFLGA 
NAKVYGQASV 
DLPLSIHMDN 
NPDDWELITA 
ASTAVPLMIR 
YTAKVLDI * 



VALES SK AWN 
FWATLVAEGK 
MEFSTDRNKI 
LQGKGVKTEF 
GGGALTLLQK 
GAPPMSVPHL 
LYPMLCAGWA 
GQRVQIIKKD 
LMHQCFPERA 



ml65-l/gl65-l 89.7% identity in 428 aa overlap 

10 20 30 40 50 60 

m!65-l . pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I I t I I I I I I E i I ! i i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ( I I I I I I I I I! I I I ! I 
gl65-l MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 65-1. pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

I I I I i I II I: I: 1:1 I I I I I I 1 I I I I ! I I I I I I ! I I I I I I I I II I I I I I (I ! I I I I I I I I 
g!65- 1 ELNYAPLGADGVINPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 65-1 . pep HCSYLQKRYDAFKTQKLFENMEF3TDRNKISDWAPLMMRGRDENQPVAAN YSAEGTDVDF 

II I I I I I I I : I I I ! II I I I I I II I I I I II I I I I I I ; I I I I I I I I I I I I I I I I I I I I II I 
g 165-1 HCRYLQKRYDVFKTQKLFENMEFSTDRNKISDWAPLIMRGRDENQPVAAN YSAEGTDVDF 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 65-1 . pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGOLTLRTRFLFLGA 
I II I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl 65- 1 GRLTRQMVKYLQGKGVKTEFNRHVED3 KRESDGAWVLKTADTRNPDWQLTLRTRFLFLGA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 65- 1 . pep GGGALTL LQKSG I PEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I I I I I I I I I I i I I i I I I I I I I : I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 1 I I ! I 
gl65-l GGGALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 65-1. pep DTRKVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 
I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 1 1 1 1 I 1 1 I I I I • I 
g 165-1 DTRNVDGKRHLMFGPYAGFRSNFLKQGSFMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 65-1. pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 

I I I I I I I I I I I I I I 1 I I I : : III I:|| I t I I I I : : I : 

gl65-l ELRKTKEERFASLLEYYPR-O/TRRLVLITQXTR-HIIXYDS-KLRVLOLYEIVPRDARSR 

370 380 390 400 410 



430 



440 



450 



4 60 



470 



480 
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ml65-l . pep LAALLGASPGASTAVPI^IRLMHOCFPERAPSWEDRLKELVPGYGIKLNENPERADEIIA 
III 

gl65-l ILERRGASRXALI SADDTAPSAPVLESVX 

420 430 440 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 695>: 

al65-1.8oq 

1 ATGGCTGAAG CGACAGACGT TGTCT7GGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGCATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGTTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCC3T 

501 CGCCGCCAAC TACTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGT GGCTTTCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCACTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAA?ACTA CCCCGAGGCA AACCCCGACG 

1151 ACT3GGAACT CATCACCGCA GGGCAACGCG TTCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGTT GCAGTTTGGT ACGGAGATTG TCGCACACGC 

1251 CGACGGCTCG CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 CCGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAACGCACC 

1351 CCGTCTTGGG AAGGCCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

1401 GTTGAACGAA AACCCCGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

1451 AAGTGTTGGA TATTTAA 

This corresponds to the amino acid sequence <SEQ ID 696; ORF 165-l.a>: 

al65-l.pep 

1 MAEAT DVVIV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SHAWN 

51 NAGTGHSALC ELNYAPLGAN GItDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNK1 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

401 SEKGGVXQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 

451 PSW3GRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 

al65-l/ml65-l 99.4% identity in 486 aa overlap 

10 20 30 40 50 60 

al 65- 1 . pep MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIEKLEDVALESSNAWNNAGTGHSALC 
I I I I I I I 1 I I I 1 I I I I I I I I I ! I I I I M I I I > I I ! I I ] I I I I I I I I I I I I I I I I I I I I M 
ml €5-1 MAEATDVVLVGGGIMSATLGVLLKEI»EPSWEITLIERLEDVAIiESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

al 65-1 .pep ELN Y A P L G AN G 1 1 D P ARALH I AEQFH V S RQFW ATLVAEG KL EDNS F I N AVPHM S L VMN E D 
I I I 1 I I I I I I I I 1 I t I t I I r t I I I I I I I I I I I I I I I I I I I I | f I I I I I | I | | | 1 | | 1 | | | 
ml 65- 1 ELNYAPLGANGI I DP ARALW I A EQFHVSRQFWATLVAEGKLEDNSFINAV PHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

al65-l.pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 

i ti tiiiiiiiifiiiiiiiiiiiiintiiiiiiiii iiiiiiiiiin ii ii ti nil 

ml 65-1 HCS YLQKRYDAFKTQKLFENMEFSTDRNK IS DWAPLMMRGRDENQPVAAN YSAEGTDVDF 

130 140 150 160 170 180 



190 200 210 220 230 240 
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al65-l .pep GRLTRCMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1! I I I I I I I I I I I I I I I I I f I I I t I I | | E | 
ml 65-1 G RLTRQMVKYLQGKG VKTE FN RHV ED I KRE S DGAW VLKT A DT RN P DGQLTLRTRFL FLG A 

190 200 210 220 230 240 

250 260 270 280 29C 300 

a 165-1 . pep GGGALTLLQKSGIPEGRGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

mmmmmmmmmmmmmmmmm mmmmmmmmmmi 

ml65-l GGGALTLLQKSGIPEGKGYGGFPVS3LFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
250 260 270 280 290 300 

310 320 330 340 350 360 

a 165-1 .pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

I I 1 1 I I I I J I 1 1 1 1 1 1 1 1 I 1 1 I I I r 1 t I 1 1 I 1 I I I I I t r I I I 1 I I I I! I MM MINI 
ml 65-1 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 

310 320 330 340 350 360 

370 380 390 400 410 420 

a 1 65-1 . pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQI IKKDSEKGGVLQFGTEI VAHADGS 
M M M M M 1 1 i I M t I M M M H M M M M M M M M M M M M M M M I M I 
ml 65-1 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQI IKKDSEKGGVLQFGTEI VAHADGS 

370 380 390 400 410 420 

430 440 450 460 470 480 

a 1 65-1 . pep LAALLGASPGASTAVPLMIRLMHQCFPERTPSWEGRLKELVPGYGIKLNENPERADEIIA 

MMIIIilllll Illlllllllhllll IIIIIIMIIIIIIIIIIIIIMM 

ml 65-1 LAALLGASPGASTAVPLMIPJiMHQCFPERAPSWEDRLKELVPGYGIKLNENPERADEIIA 

430 440 450 4G0 470 480 



489 

al65-l.pep YTAKVLDIX 
M i M M M 
ml 65-1 YTAKVLDIX 



al65-l/p33940 

sp|P33940|YOJH_ECOLI HYPOTHETICAL 60.2 KD PROTEIN IN ECO-ALKB INTERGENIC REGION 
>gi|1736851|gnl|PID|dl016718 (D90850) ORF_ID: o372#5; similar to [SwissProt Accession Number 
P33940] {Escherichia coli] >gi| 1788539 (AE000310) f548; This 548 aa ORF is 100 pet identical 
to 490 residues of YOJH_ECOLI SW: P33940 {492 aa) but contains 56 additional N-ter aa; 100 pet 
identical to GB: ECOHU49_33 

ACCESSION: U00008 (490 aa) but contains 58 aditional N-term resi... Length = 548 
Score = 458 bits (1167), Expect = e-128 

Identities = 233/490 (47%), Positives = 303/490 (61%), Gaps = 5/490 (1%) 

Query: 3 EATDVVLVGGGI MS ATLGVLLKELE PS WEI TLI ERLEDVALES SNAWNN AGTGHS ALCEL 62 

+ TDV+L+GGGIMSATLG L+ELEP W -HT++ERLE VA ESSN WNNAGTGHSAL EL 
Sbjct: 30 QETDVLL I G GG I MS AT LGT YLRELE PEWSMTMVERLE GVAOES SNGWNN AG TGHS ALMEL 89 

Query: 63 NYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLED-NSFINAVPHMSLVMNEDH 121 

NY P A+G I +A+ I E F +SRQFWA V G L SFIN VPHMS V ED+ 
Sbjct: 90 NYTPQNADG S I S I EKA VAI NE AFOI S RQFWAHQVERG VLRT PR S F I NTV PHMS FVWG EDN 14 9 

Query: 122 CSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDFG 181 

++L+ RY A + LF M +S D +1 +WAPL+M GRD Q VAA + GTDV++G 
Sbjct: 150 VNFLRARYAALQQSS LFRGMRYS EDHAQ I KEWAPL VMEGRDPQQKVAATRTEI GT DVN YG 209 

Query: 182 RLTRQMVKYLQGKG-VKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTXXXXXXXXXX 240 

+TRQ++ LQ K + + V +KR D W + AD +N Q 

Sbjct: 210 E I TRQL I ASLQKKSN FSLQLS S E VRALKRNDDNTWTVTVADLKNGT AQ- N I RAKFVFI GA 268 

Query: 241 XXXXXXXXQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 300 

Q+SGIPE K Y GFPV G F + NP+ H AKVYG+ASVGAPPMSVPH+ 
Sbjct: 269 GGAALKLLQESGIPEAKDYAGFPVGGQFLVSENPDVVNHHLAKVYGKASVGAPPMSVPHI 328 

Query: 301 DTRNVDGKRHI^FGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 360 

DTR +DGKR ++FGP+A F + FLK GSL DL S N+ PM+ G N L KYL+ 
Sbjct: 329 DTRVLDGKRWLFGPFATFSTKFLKNGSLWDLWSSTTTSNVMPMMHVGLDNFDLVKYLVS 388 

Query: 361 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVXXXXXX 420 

++ ++E4RF +L EYYP+A +DW L AGQRVQIIK+D+EKGGVL+ GTE+V 
Sbjct: 389 QVMLSEEDRFEALKEYYPQAKKEDWRLWQAGQRVQI I KRDAEKGGVLRLGTEWSDQQGT 448 



Query: 421 XXXXXXXXXXXXXXVPLMIRLMHQCFPER-- TPSWEGRLKELVPGYGIKLNENPERADEI 478 
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P+M+ L+ + F +R + P W+ LK +VP YG KLN + + 
Sbjct: 449 I AA L L G AS PG AS T AAP I M LN LL EKV FG DRVS S PQWQAT LKA I V PS YGRKLNG D V AAT ERE 508 

Query: <79 IAYTAKVLDI 4 88 

+ YT++VL + 
Sbjct: 509 LQYTSEVLGL 518 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 697>: 

g204 .seq 

1 atggcggcgg cggaaataaa acgccccctc gctgtcgatt tccagcacat 

51 agcgtccgtt ctgcacggcg gcatagccgc ttttgcctgc ctgatagggt 

101 tgcagggcgg aatgcgaaat caggtaatca gtcagtttgc cgccgtcttc 

151 ggcgatattg cccaccagtt tggcaaacaa ggtatggcac acgccgtttt 

201 ccgcccagcc cgaaggcgcg tcctttccgt cggtttccat acatctgccg 

251 acgacggctt ccaagtcgtt cggatgcttt ccggtcagcc ggacggcgtt 

301 ttgttccggc aagcctttaa tcggataact gatttgtttt ttgccgtcgt 

351 tggttttgcc ttcgctactt tgtcccaaag ccaaaccggc aatcgccgta 

4 01 ttgtcgatgt atttgacttt gaaaaccggt ttcggcgcgc tttgtgccgc 

451 attttgcggc tgttccgccg tattttcgga tttgccgcag gcggcaagca 

501 gcaggcagcc gcccaacacg gcaaaaggta ttttcagcat tccgcactcc 

551 tgatggtttc aaaatgccgt ctgaaatgcc gtctgaaacg tggcaggcgg 

601 aggttcggac ggcattgggt ttatttcaac gggcggatgc cgaccgcatc 

651 gcgtacttta tccaacaact cgcgcgcttc tttgcgcgct ttttgcgcgc 

701 ctgcctgcaa aatctcttcg atttgcgaag gattagaggt caatgcgttg 

751 tag 

This corresponds to the amino acid sequence <SEQ ID 698; ORF 204.ng>: 
g204 .pep 

1 MAAAEIKRPL AVDFQ HIASV LHGGIAAFAC LIG LQGGMRN QVISQFAAVF 

51 GDIAHQFGKQ GMAHAVFRPA RRRVLSVGFH TFADDGFQW GMLSGQPDGV 

101 LFRQAFNR IT DLFFAWGFA FATL SQSQTG NRRIVDVFDF ENRFRRALCR 

151 ILRLFRRIFG FAAGGKQQAA AQHGKRYFQH SALLMVSKCR LKCRLKRGRR 

201 RFGRHWVYFN GRMPTASRTL SNNSRASLRA FCAPACKISS ICEGLEVNAL 

251 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 699>: 

ra204 .seq 

1 ATGGCGGCGG CGGAAATAAA ACGCCCCTTC GCTGTCGATT TCCAGCACAT 

51 AGCGTCCGTT CTGCACGGCG GCATAGCCGC TTTTGCCTGC CTGATAGGGT 

101 TGCAGGGCGG CATGCGAAAC TAGGTAATCC GTCAGTTTGC CGCCGTCTTC 

151 GGCGATATTG CCCACCAGTT TGGCAAACAA GGTATGGCAC ACGCCGTTTT 

201 CTGCCCAACC TGCCGGACTG TCCTTATCAT CGGTTTCCAT ACATTTGCCG 

251 CTGACGGCTT CCAAGTCGCC GGGATGCTTG CCGATCAGTC GGATAACATT 

301 TTGTTCCGGC AAGCCTTTAA TCGGATAACT GATTTGTTTT TTGCCGTCGT 

351 TGGTTTTGCC TTCGCTGCTT TGTCCCAAAT CCAAACCGGC AATCGCCGTA 

401 TTGTCGATAT ATATGACTTT GAAAACCGGT TTCGGCGCGC TTTGTACCGC 

451 GTTTTGCGGC TGTACCGCCG TATTTwCGGA TTTGCCGCaC GGCaArGCAG 

501 CAGGCAGCCG CCCAATACGG CAAAArAwGT wTTCAGCATT CCACAyTCCT 

551 GATGGTTTCA AAATGCCGTC TGAAACGCGG CAGGCGGAGG TTCGGACGGC 

601 ATCGGGTTCA TTTCAACGGG CGGATGcCGA CCGCATCgGT ACTTTGTCCA 

651 ATAATTCGCG TGCTTCTTTA CGCGCTTTCG CCGCGCCTGC CTGCAAAATC 

701 TCTTCGATTT GCGAAGGGTC GGCGGTCAGC TCGTTGTAG 

This corresponds to the amino acid sequence <SEQ ID 700; ORF 204>: 

m204 .pep 

1 MAAAEIKRPF AVDF QHIASV LHGGIAAFAC LIG LQGGMRN *VIRQFAAVF 

51 GDIAHQFGKQ GMAHAVFCPT CRTVLIIGFH TFAADGFQVA GMLADQSDNI 

101 LFRQAFN RIT DLFFAWGFA FAAL SQIQTG NRRIVDIYDF ENRFRRALYR 

151 VLRLYRRIXG FAATAXQQAA AQYGKXXXQH STXLMVSKCR LKRGRRRFGR 

201 HRVHFNGRMP TASGTLSNNS RAS LRAFAAP ACKISSICEG SAVSSL* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from K gonorrhoeae 

ORF 204 shows 82.0% identity over a 250 aa overlap with a predicted ORF (ORF 204.ng) 
from AT. gonorrhoeae: 

m204/g204 

10 20 30 40 50 60 

m204 . pep MAAAEI KRPFAVDFQHIASVLHGGIAAFACLI GLQGGMRNXVI RQFAAVFGDI AHQFGKQ 

IMMMI MINI III II hill III MINIMI III II IIIIIIMINIIIM 

g2 0 4 MAAAEI KRPLAVDFQHI ASVLHGGIAAFACLI GLQGGMRNQVI SQFAAVFGD I AHQFGKQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 04 . pep GMAHAVFCPTCRTVLI I GFHTFAADG FQVAGMLADQSDN I LFRQAFNR I TDLF FAWG FA 

I II II 1 1 h I II MM Ml MM hi lh I hM I MUM II MM Mill I 

g2 0 4 GMAHAV FRPARRRVLS VGFHT FAEDG FQWGML SGQ PDGVLFRQAFNRI TDLFFAWGFA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 04 . pep FAALSQ I QTGNRRI VDI YDFENRFRRALYRVLRLYRRIXGFAATAXQQAAAQYGKXXXQH 

I hi II I MM M!h: II I II II Ml hill MM MM : II II lh II II 

g2 04 FATLSQSQTGNRRIVDVFDPENRFRRALCRILRLFRRIFGFAAGGKQQAAAQHGKRYFQH 
130 140 150 160 170 180 

190 200 210 220 230 

m2 0 4 . pep STXLMVSKCRLK RGRRRFGRHRVHFNGRM PTASGTL SNNSRAS LRAFAAP AC KISS 

h IIIIIMM IMIIIIII h ill: II MIIIMIIIIM llllllll 

g2 04 SALLMVSKCRLKCRLKRGRRRFGRHWVYFNGRMPTASRTLSNNSRASLRAFCAPACKISS 
190 200 210 220 230 240 



m2 04 . pep 
g204 



240 
ICEGSAVSSLX 

I I I I hM 
ICEGLEVNAL 
250 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 701 >: 

a204.seq 

1 ATGGCGGCGG CGGAAATAAA ACGCCCCCTC GCTGTCGATT TCCAGCACAT 

51 AGCGTCCGTT CTGCACGGCG GCATAGCCGC TTTTGCCTGC 3TGATAGGGT 

101 TGCAGGGCGG AATGCGAAAT CAGGTAATCC GTCAGTTTGC CGCCGTCTTC 

151 GGCGATATTG CCCACCAGTT TGGCAAACAA GGTATGGCAC ACGCCGTTTG 

201 CCGCCCAGCC CGAAGGCGCG CCCTTTCCGT CGGTTTCCAT ACATTTGCCG 

251 ACGACGGCTT CCAAGTCGTT GGGATGCTTG CCGGTCAGCC GGACGACGTT 

301 TTGTTCCGGC AAGCCTTT 

351 

401 

451 ; 

501 

551 AAGAG GTTCGGACGG 

601 CATTGGGTTT ATTTCAACGG GCGGATACCG ACCGCATCAC GTACTTTGCC 

651 CAATAATTCG CGTGCTTCTT TACGCGCTTT TTGCGCGCCT GCCTGCAAAA 

701 TCTCTTCGAT TTGCGAAGGG TCGGCGGTCA GCTCGTTGTA G 

This corresponds to the amino acid sequence <SEQ ID 702; ORF 204.a>: 

a204.pop 

1 MAAAEIKRPL AVDFQH IASV LHGGIAAFAC LIG LOGGMRN QVIRQFAAVF 

51 GDIAHQFGKQ GMAHAVCRPA RRRALSVGFH TFADDGFQW GMLAGQPDDV 

101 LFRQAF 

151 KRFGR 

201 HWVYFNGRIP TASRTLPNNS RASLRAFCAP ACKISSICEG 5AVSSL* 
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m204/a204 54.5% identity in 246 aa overlap 

10 20 30 



40 



50 



60 



m204 .pep MAAAEIKRPFAVDFQHIASVLHGGIAflFACLIGLQGGMRNXVIRQFAAVFGDIAHQFGKQ 
I I I I I I I I I: I i I I I I I I I 1 I I I I I I I I I I ! I I I I I I I I I { I i i ( I I I I I I I I I I I i I I 
a204 MAAAEIKRPLAVDFQHIASVLHGGIAAFACLIGLQGGMRNQVIRQFAAVFGDIAHQFGKQ 
10 20 30 40 50 60 

70 80 90 100 110 120 

m204 .pep GMAHAVFCPTCRTVLI IGFHTFAADGFQVAGMLADQSDN I LFRQAFNRI TDLFFAVVGFA 

I I I I I I I: I :l rlllltl 11111 = 1111 I |::IIUII 
a 2 0 4 GMAIIAVCRPARRRALSVGFHTFADDGFQVVGMLAGQPDDVLFRQAF 

70 BO 90 100 



m204 .pep 



130 140 150 160 170 180 

F AALSQIQTGNRRI VD I YD FEN RFRRAL YRV LRL Y RR I XGFAAT AXQQAAAQ YGKXXXQH 



a204 



190 200 210 220 230 240 

m204 .pep STXLMVSKCRLKRGRRRFGRHRVHFNGRMPTASGTLSNNSRASLRAFAAPACKISSICEG 

:IMII hllllMIM II I t i 1 I I I J 1 I I 1 1 t I I I I I t M 

a204 KRFGRHWVYFNGRIPTASRTLPNNSRASLRAFCAPACKISSICEG 

110 120 130 140 150 



m204 .pep 
a204 



SAVSSLX 

iimi ii 

SAVSSLX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 703>: 

g205. seq 

1 atgctgaaaa taccttttgc cgtgttgggc ggctgcctgc tgcttgccgc 

51 ctgcggcaaa tccgaaaata cggcggaaca gccgcaaaat gcggcacaaa 

101 gcgcgccgaa accggttttc aaagtcaaat acatcgacaa tacggcgatt 

151 gccggtttgg ctttgggaca aagtagcgaa ggcaaaacca acgacggcaa 

201 aaaacaaatc agttatccga ttaaaggctt gccggaacaa aacgccgtcc 

251 ggctgaccgg aaagcatccc aacgacttgg aagccgtcgt cggcaaatgt 

301 atggaaaccg acggaaagga cgcgccttcg ggctgggcgg aaaacggcgt 

351 gtgccatacc ttgtttgcca aactggtggg caatatcgcc gaagacggcg 

401 gcaaactgac tgattacctg atttcgcatt ccgccctgca accctatcag 

451 gcaggcaaaa gcggctatgc cgccgtgcag aacggacgct atgtgctgga 

501 aatcgacagc gagggggcgt tttatttccg ccgccgccat tattga 
This corresponds to the amino acid sequence <SEQ ID 704; ORF 205 .ng>: 

g205 .pep 

1 MLKIPFAVLG GCZjLLA ACGK SENTAEQPQN AAQSAPKPVF KVKYIDNTAI 

51 AGLALGQSSE GKTNDGKKQI SYPIKGLPEQ NAVRLTGKHP NDLEAWGKC 

101 METDGKDAPS GWAENGVCHT LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ 

151 AGKSGY AAVQ NGRYVLEIDS BGAFYFRRRH Y 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 705>: 

m205 . seq 

1 ATGCTGAAwA CwTyTTTTGC CGTATTGGGC GGCTGCCTGC TGCyTtGCCG 

51 tGCGGCAAAT CCGwAAATAC GGCGGTACAG CCGCAAAACG CGGTACAAAG 

101 CGCGCCGAAA CCGGTTTTCA AAGTCATATA TATCGACAAT ACGGCGATTG 

151 CCGGTTTGGA TTTGGGACAA AGCAGCGAAG GCAAAACCAA CGACGGCAAA 

201 AAACAAATCA GTTATCCGAT TAAAGGCTTG CCGGAACAAA ATGTTATCCG 

251 ACTGATCGGC AAGCATCCCG GCGACTTGGA AGCCGTCAGC GGCAAATGTA 

301 TGGAAACCGA TGATAAGGAC AGTCCGGCAG GTTGGGCAGA AAACGGCGTG 

3 51 TGCCATACCT TGTTTGCCAA ACTGGTGGGC AATATCGCCG AAGACGGCGG 

4 01 CAAACTGACG GATTACCTAG TTTCGCATGC CGCCCTGCAA CCCTATCAGG 

4 51 CAGGCAAAAG CGGCTATGCC GCCGTGCAGA ACGGACGCTA TGTGCTGGAA 

5 01 ATCGACAGCG AAGGGGCGTT TTATTTCCGC CGCCGCCATT ATTGA 
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This corresponds to the amino acid sequence <SEQ ID 706; ORF 205>: 
m205.pep 

1 MLXTX F AVIiG GCLLX CRCGK SXNTAVQPQN AVQSAPKPVF KVIYIDNTAI 
51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPEQ NVIRLIGKHP GDLEAVSGKC 
101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 
151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y+ 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 205 shows 88.4% identity over a 181 aa overlap with a predicted ORF (ORF 205.ng) 
from N. gonorrhoeae: 

m205/g205 

10 20 30 40 50 60 

MLX TXFAVLGGCLLXCRCGKSXNTAVQ PQNAVQ SAP KP V F KV I Y I DNTAI AGLDLGQS S E 

it i ii i iii ii 1 1 ii iii iMihiiiiimii ii ii i ii mi ii nil 

MLKIPFAVLGGCLLLAACGKSEhrTAEQPQNAAQSAPKPVFKVKYIDNTAIAGLALGQSSE 
10 20 30 40 50 60 

70 80 90 100 110 120 

GKTNDGKKQ I S YP I KGLPEQNVI RLI GKHPGDLEAVSGKCMETDDKDS PAGWAENGVCHT 

IIIMIIIIMIIIMIIIII-II lllhlllll lllllll Ihhllllllllll 

GKTNDGKKQ I S YP I KGLPEQNAVRLTGKH PNDLE AWG KCMETDGKDAPSGWAENG VCHT 
70 80 90 100 110 120 

130 140 150 160 170 180 

LFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNGRYVLEIDSEGAFYFRRRH 

III IIIIMII III MllhlhllM III III II lllllll lllllll Ml III Mil 

LFAKLVGNI AEDGGKLTDYLI SHSALQP YQAGKSGYAAVQNGRYVLEI DSEGAF YFRRRH 
130 140 150 160 170 180 



m205 .pep 
g205 

m205 -pep 
g205 

m205 -pep 
g205 



m2 05.pep YX 
I 

g205 Y 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 707>: 

a205.aeq (partial) 

1 TCCGAACCTC TTAAAGGCTT GCCGGAACAA AACGTCGTCC GGCTGACCGG 

51 CAAGCATCCC AACGACTTGG AAGCCGTCGT CGGCAAATGT ATGGAAACCG 

101 ACGGAAAGGG CGCGCCTTCG GGCTGGGCGG CAAACGGCGT GTGCCATACC 

151 TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG GCAAACTGAC 

201 GGATTACCTG ATTTCGCATT CCGCCCTGCA ACCCTATCAG GCAGGCAAAA 

251 GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGC?GGA AATCGACAGC 

301 GAGGGGGCGT TTTATTTCCG CCGCCGCCAT TATTGA 

This corresponds to the amino acid sequence <SEQ ID 708; ORF 205. a>: 

a205.pep (partial) 

1 SEPLKGLPEQ NVVRLTGKHP NDLEAVVGKC METDGKGAPS GWAANGVCHT 

51 LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ AGKSGYAAVQ NGRYVLEIDS 

101 EGAFYFRRRH Y* 

m205/a205 88.3% identity in 1 1 1 aa overlap 

50 60 70 80 90 100 

m205 . pep KVIYIDNTAIAGLDLGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKC 

i I ' I I ; I II I I : I I I II I : I I I I I Mi 
a 2 0 5 SE PLKGLPEQNWRLTGKHPNDLEAWGKC 

10 20 30 



110 120 130 140 150 160 

m205 . pep METDDKDSPAGWAENGVCHTLFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQ 
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I I I I I : I : I I I I I I I I I I I ! I t I I I t I I I f I I I II I i : I I : I I I I I I I I I I I I I I I I 
a205 METDGKGAPSGWAANGVCHTLFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQ 
40 50 60 70 80 90 



170 180 
m2 0 5 . pep NGRYVT.E I DSEGAFYFRRRHYX 

I I I I I I I I I I II II I I I I I I I I 
a2 0 5 NGRYVLE I DSEGAFYFRRRHYX 

100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 709>: 

g205-l.seq (partial) 

1 ATGCTGAAAA TAcCTTTTGC CGTGTTGGGC GGCTGCCTGC TGCTTGCCGC 

51 CTGCGGCAAA TCCGAAAATA CGGCGGAACA GCCGCAAAAT GCGGCACAAA 

101 GCGCGCCGAA ACCGGTTTTC AAAGTCAAAT ACATCGACAA TACGGCGATT 

151 GCCGGTTTGG CTTTGGGACA AAGTAGCGAA GGCAAAACCA ACGACGGCAA 

201 AAAACAAATC AGTTATCCGA TTAAAGGCTT GCCGGAACAA AACGCCGTCC 

251 GGCTGACCGG AAAGCATCCC AACGACTTGG AAGCCGTCGT CGGCAAATGT 

301 ATGGAAACCG ACGGAAAGGA CGCGCCTTCG GGCTGGGCGG AAAACGGCGT 

351 GTGCCATACC TTGTTTGCCA AACTGGTGGG ♦CAATATCGCC GAAGACGGCG 

401 GCAAACTGAC TGATTACCTG ATTTCGCATT CCGCCCTGCA ACCCTATCAG 

451 GCAGGCAAAA GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA 

501 AATCGACAGC GAGGGGGCGT TTTA 



This corresponds to the amino acid sequence <SEQ ID 710; ORF 205-1 .ng>: 

g205-l.pep (partial). 

1 MLKI PPAVLG GCLLIAA CGK SENTAEQPQN AAQSAPKPVF KVKYIDNTAI 
51 AGLALGQSSE GKTNDGKKQI SYPIKGLPBQ NAVRLTGKHP NDLEAWGKC 
101 METDGKDAPS GWAENGVCHT LFAKLVGNIA SDGGKLTDYL ISHSALQPYQ 
1S1 AGKSGYAAVQ NGRYVLEIDS EGAF 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 71 1>: 

m205-l.seq. . 

1 ATGCTGAAAA CATCTTTTGC CGTATTGGGC GGCTGCCTGC TGCTTGCCGC 

51 CTGCGGCAAA TCCGAAAATA CGGCGGAACA GCCGCAAAAC GCGGTACAAA 

101 GCGCGCCGAA ACCGGTTTTC AAAGTCAAAT ATATCGACAA TACGGCGATT 

151 GCCGGTTTGG ATTTGGGACA AAGCAGCGAA GGCAAAACCA ACGACGGCAA 

201 AAAACAAATC AGTTATCCGA TTAAAGGCTT GCCGGAACAA AATGTTATCC 

251 GACTGATCGG CAAGCATCCC GGCGACTTGG AAGCCGTCAG CGGCAAATGT 

301 ATGGAAACCG ATGATAAGGA CAGTCCGGCA GGTTGGGCAG AAAACGGCGT 

351 GTGCCATACC TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG 

401 GCAAACTGAC GGATTACCTA GTTTCGCATG CCGCCCTGCA ACCCTATCAG 

451 GCAGGCAAAA GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA 

501 AATCGACAGC GAAGGGGCGT TTTATTTCCG CCGCCGCCAT TATTGA 



This corresponds to the amino acid sequence <SEQ ID 712; ORF 205-l>: 

xo205-l. pep 

1 MLKTSFAVLG GCLLLAA CGK SENTAEQPQN AVQSAPKPVF KVKYIDNTAI 
51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPBQ NVIRLIGKHP GDLEAVSGKC 
101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 
151 AGKSGYAAVQ NGRYVL.EIDS EGAFYFRRRH Y* 



m205-l/g205-l 92.0% identity in 1"4 aa overlap 



10 20 30 40 50 60 

g205- 1 . pep MLJCIPFAVLGGCLLIAACGKSENTAEQPQNAAQSAPKPVFKVKYIDNTAIAGIiAIiGQSSE 

III 1 1 1 1 1 ! I! 1 1 1 1 1 1 !! 1 1 1 1 1 1 1 1 1 1 - II 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 II II II 

m2 0 5 - 1 MLKTS FAVLGGtHJ^IJUVCGKSENTAEQPQNAVQS APK P VFKVKY I DNTA LAG LD LGQS S E 

10 20 30 40 50 60 



70 80 90 100 110 120 

g205 - 1 . pep GKTNDGKJ<QISYPIKGLPEQJIAVRLTGKHPNDLEAVVGKCMETDGKDAPSGWAENGVCHT 

IIIIIIIIIIIIIIIIIIM|::|I lllhlllll M M Ihhllllllllll 
m205- 1 GKTNDGKKQI SYPIKGLPEQNVIRliIGKHPGDLEAVSGKCMErrDDKDSPAGWAENGVCHT 
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70 80 90 100 110 120 

130 140 150 160 170 

g205 -1 . pep LFAKLVGN IAEDGGKLTDYLI SHS ALQP YQAGKSG YAAVQNGR YVLE ID S EGAF 

I llllllll Ml Ml II MMhM MMIIIMMI II III II lllllllll 
m2 0 5 - 1 LFAKLVGNI AEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNGRYVLE IDSEGAFYFRRRH 

130 140 150 160 170 180 

m205-l YX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 713>: 

a205-l.seq (partial) 

1 CCTCTTAAAG GCTTGCCGGA ACAAAACGTC GTCCGGCTGA CCGGCAAGCA 

51 TCCCAACGAC TTGGAAGCCG TCGTCGG CAA ATGTATGGAA ACCGACGGAA 

101 AGGGCGCGCC TTCGGGCTGG GCGGCAAACG GCGTGTGCCA TACCTTGTTT 

151 GCCAAACTGG TGGGCAATAT CGCCGAAGAC GGCGGCAAAC TGACGGATTA 

201 CCTGATTTCG CATTCCGCCC TGCAACCCTA TCAGGCAGGC AAAAGCGGCT 

2 51 ATGCCGCCGT GCAGAACGGA CGCTATGTGC TGGAAATCGA CAGCGAGGGG 

301 GCGTTTTATT TCCGCCGCCG CCATTATTGA 

This corresponds to the amino acid sequence <SEQ ID 714; ORF 205-1. a>: 

a205-l.pep (partial) 

1 PLKGLPBQNV VRLTGKHPND LEAWGKCME TDGKGAPSGW AANGVCHTLF 
51 AKLVGNIABD GGKLTDYLIS HSALQPYQAG KSGYAAVQNG RYVLEIDSEG 
101 AFYFRRRHY* 

xn205-l/a205-l 89.0% identity in 109 aa overlap 

50 60 70 80 90 100 

m205 -1 . pep KYIDNTAIAGLDIGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKCME 

MMMMMMI MINIMI MM! 

a20 5 -1 PLKGLPEQNWRLTGKHPNDLEAWGKCME 

10 20 30 

110 120 130 140 150 160 

m2 0 5 - 1 . pep TDDKDS P AGWAENG VCHTLFAKLVGN I AEDGGKLTD YL VSHAALQ PYQAG KSG YAAVQNG 

\\ I MMM M M M M M M 1 1 1 M M M M M h M M I M M M M M M M M 

a205-l TDGKGAPSGWAANGVCHTLFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNG 
40 50 60 70 80 90 

170 180 
m20 5 -1 . pep RYVLEIDSEGAFYFRRRHYX 

I I I I I I I I I I I > I I I I I I I I 
a205-l RYVLEIDSEGAFYFRRRHYX 
100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 715>: 

ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 
ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 
gcaaatccaa gccgtccgca tcagccacat cggccgcaca 
aggaactcat gctccacagc ctcggactca tcggcacgcc 
ggcggcagca gcaccgcaac cggcttcgac tgcagcggca 
ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 
tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 
atcgtattct tcaacaccgg cggcgcacac cgctactcac 
ctacatcggc aacggcgaat tcatccatgc ccccggcagc 
tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 
gcgcacacgt tttttacaga atga 

This corresponds to the amino acid sequence <SEQ ID 716; ORF 206.ng>: 

g206 -pep 

1 MFSPPKTLFL CLGALLLASC GTT5G KHRQP KPKQTVRQIQ AVRISHIGRT 



g206 . seq 




1 


atgtttcccc 


51 


cgcctcatgc 


101 


agacagtccg 


151 


caaggctcgc 


201 


ctacaaatgg 


251 


tgattcaatt 


301 


gcccgcgaca 


351 


ggccggcgac 


401 


acgccggact 


451 


ggcaaaacca 


501 


ctacctcgga 
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51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 
101 ARDMAAASRK I PDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEF1HAPGS 
151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 717>: 

m206 . seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGOTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGsAAA ATCCCCGAcA GCCGCyTCAA 

3 51 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 
4 51 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 
501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 71 8; ORF 206>: 

m2 0 6 . pep - . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 
51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 
101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 
151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from JV. gonorrhoeae 

ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from N. gonorrhoeae: 

m206/g206 

10 20 30 40 50 60 

MFPPDKTLFLCLSALLLAS CGTTSGKHRQ PKPKQTVRQ I QAVRI SHI DRTQGSQELMLHS 

ii iiiiiiiihiiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiii iiiimimi 

MFSPDKTLETjCLGALLIjAS CGTTSGKHRQ PKPKQTVRQ I QAVR I SH I GRTQGSQELMLHS 
10 20 30 40 50 60 

70 80 90 10O 110 120 

LGLIGTPYKWGGSSTATGFECSGMI QFVYKNALNVKLP RTARDMAAASR KI PDSRXKAGD 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mil 

LG L I GT P Y KWGGS S TATG FDCS GMI QLVYKNALNVKLP RTARDMAAAS RKI PDS RLKAGD 
70 80 90 100 110 120 

130 140 150 160 170 

LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAICNYLGAHTFFTEX 

M II II 1 1 III II I III II II II INI hi III I II II IMII III 111 I Nlll II 

IVFFOTGGAHRYSHVGLYIGNGEFIHAPGSGKTIKTEKLSTPFYAKNYLGAHTFFTE 
130 140 150 160 170 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 719>: 

a206.seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 



m206 .pep 
9206 

m206 .pep 
g206 

m206 -pep 
g206 
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This corresponds to the amino acid sequence <SEQ ID 720; ORF 206.a>: 

a206. pep 

1 MFPPD KTLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 99.4% identity in 1 77 aa overlap 

10 20 30 40 50 60 

m206.pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 
1 1 1 1 I I I I I I I I I I I I I I II I I 1 1 I I 1 1 I 1 1 1 I I I I I II I I I I I I I 1 1 I I I i I M I II I I 
a206 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 
10 20 30 40 50 60 

70 80 90 100 110 120 

m20 6.pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 
I I I II I I I i I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I 
a206 LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

m206.pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 : 1 I I I I I I I I I I I I I I M I I I 

a206 L V FFNT GGAH R Y S H VG L Y IGNGEFIHAPSSGKTI KTEKL S T P F Y AKN YLGAHT F FT EX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 721>: 

g209. seq 

1 atgctgcggc atttaggaaa cgacttcgcc ttgggcgcgt tgtttttcga 

51 tgctgcggtt gatgtgccac tgctgggcga tggtcaggag gttgttgacc 

101 acccagtaga gaaccaaacc ggcagggaag aagaagaaca tgacggagaa 

151 aaccaacggc atgattttca tcattttcgc ctgcatcggg tcggtcggcg 

201 gcgggttcag ataggtttgg gcgaacatcg ttgccgccat aatgatgggc 

251 aggatgtagt aggggtcggc gcggctgagg tcggtaatcc agcccagcca 

301 aggtgcctgg cgcaattcta cggaggcgaa caatgcccag tacaagccga 

351 tgaagacggg gatttgcaac agcataggca gacagccgcc cagcgggttg 

401 atttcctcgt cttcgaaaag ctgcatcatc gcttgctgtt gcgccatacg 

451 gtcgtcgccg tatttttctt tgatggtctg cagttcgggt gcggcggcac 

501 gcattttcgc catcgaacgg taggaggcgt tggtcaatgg atacagtacg 

551 gctttgacga tgatggtcaa aacgacgatt gcccagcccc agttgccgat 

601 aatgttgtgc agttggttca ggagccagaa gagcggcgat gcgaaccagt 

651 gtactttacc gtagtctttt gccagttgca ggttgtcggc gatgtttgcg 

701 ataacggatg tggtttgcgg accggcatac aggttgaccg ccattttcgg 

751 ttttggcccc cgggttggga tagcggttaa 

This corresponds to the amino acid sequence <SEQ ID 722; ORF 209.ng>: 
g209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLLG DGQE WDHPVENQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDWGVG AAEVGNPAQP 

101 RCLAQFYGGE QCPVQADEDG DLQQHRQTAA QRVDFLVFEK LHHRLLLRHT 

151 VVAVFFFDGL QFGCGGTHFR HRTVGGVGQW IQYGFDDDGQ NDDCPAPVAJD 

201 NWQLVQEPE ERRCEPVYFT VVFCQLQWG DVCDNGCGLR TGIQVDRHFR 

251 FWPPGWDSG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 723>: 

m209. seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGgGCGTT GTTTTTCGAT 

51 GCTGCGGTTG ATGTGCCATT GCTGGGCGAT GGTCAGGAGG TTGTTGACTA 

101 CCCAGTACAA TACCAGAC CG GCAGGGAAGA AGAAGAACAT GACGGAGAAA 

151 ACCAACGGCA TGATTTTCAT CATTTTCGCC TGCATCGCGT CGGTCGGCGG 

201 CGGGTTCAGA TAAGTTTGGG CGAACATCGT TGCCGCCATA ATGATGGGCA 

251 GGATGTAGTA GGGGTCGGCG CGGCTGAGGT CGGTAATCCA ACCCAGCCAA 
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301 GGTGCCTGGC GCAATTCTAC GGAGGCGAAC AATGCCCAAT ACAATCCGAT 

351 GAAGACGGGG ATTTGCAACA GCATAGGCAG GCAGCCGCCC AGCGGGTTGA 

401 TTTTCTCGTC TGTGTAAAGC TGCATCATCG CCTGTTGTTG CGCCATACGG 

451 TCGTCGCCGT ATTTCTCTTT GATGGCTTGC AGTTTGGGTG CGGCGGCACG 

501 CATTTTCGCC ATAGAGCGGT AAGAGGCGTT GGTCAATGGA TACAGTACGG 

551 CTTTGACGAT GATGGTTAAA ACGATAATCG CCCAGCCCCA GTTGCCGATG 

601 ATGTTGTGCA GTTGGTTCAG GAGCCAGAAG AGCGGGGAGG CGAACCAGTG 

651 TACTTTGCCG TAGTCTTTGG CCAGTTGCAG GTTGTCGGCG ATGTTTGCGA 

701 TGACGGATGT GGTCTGCGGG CCGGCGTAGA GGTTGATGGA GGCTTCGgTT 

751 TCGCGCCGTT TTGGATGGCG GCTAAAGGCA CGCTGACGCT GGTGCTGTAC 

801 AGCTTGTCGT TGCGGCGTTT GATGTCGATG TTGCACTCGC CTGCGGCGCA 

851 AACGCTTTGT CTGCCTTTAG GTTGGAGAAT CCAGGTGGAC ATGAAGTGGT 

901 GTTCAATCAT GCCGAGCCAG CCGGTCGGGG TTTTGCGGAT GTATTCGGCC 

951 TCGGATTTGC CGGATTTGGC ATCGTCGTCC AAGTCGGAAA AGCTGACTTT 

1O01 TTGGAAGTTG CCTTCAGGGG TATAA 

This corresponds to the amino acid sequence <SEQ ID 724; ORF 209>: 

m209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE WDYPVQYQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ ISLGEHRCRH NDGQDVVGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHRLLLRHT 

151 WAVFLFDGL QFGCGGTHFR HRAVRGVGQW IQYGFDCDG* NDNRPAPVAD 

201 DWQLVQEPE ERGGEPVYFA WFGQLQWG DVCDDGCGLR AGVEVDGGFG 

251 FAPFWMAAKG TLTLVLYSLS LRRLMSMLHS PAAQTLCLPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 209 shows 88.5% identity over a 253 aa overlap with a predicted ORF (ORF 209.ng) 
from N. gonorrhoeae: 

m209/g209 

10 20 30 40 50 60 

m2 0 9 . pep MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDYPVQYQTGREEEEHDGENQRHDFHHFR 
I I I I I I I II f II I I I I II I I I I II I 1 II I I I ( I : I I : I I I I I I I I I I I 1 I I I I I I I I I I 
g209 MLRHLGNDFALGALFFDAAVDVPLLGDGQEVVDHPVENQTGREEEEHDGENQRHDFHHFR 
10 20 30 40 50 60 

70 80 90 100 110 120 

m2 0 9 . pep LHRVGRRRVQI SLGEHRCRHNDGQDWGVGAAEVGN PTQPRCLAQFYGGEQC P IQS DE DG 

I I I I I I I I I I I : I I I I I I II I I I I I II I I I M I I I I I : M II I I i I I I I II I I: I: I M I 
g2 0 9 LHRVGRRRVQ IGLGEHRCRHN DGQ DWGVGAAE VGN PAQ PRC LAQFYGGEQC P VQADE DG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m209.pep DLQQHRQAAAQRVDFLVCVKLHHRLLLRHTWAVFLFDGLQFGCGGTHFRHRAVRGVGQW 
I I II I I I - I I I ! I I I I I I I I I I I I I I I I I I I I I : | M i M I I I I I I I I I I : I Mill 
g2 0 9 DLQQHRQTAAQRVDFLVFEKLHHRLLLRHT WAVFFFDGLQFGCGGTHFRHRTVGGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m209.pep IQYGFDDDGXNDNRPAPVADDWQLVQEPEERGGEPVYFAVVFGQLQWGDVCDDGCGLR 
I II I I I I I I II: I II I I |: I I I I I I ! I I I I t I I I I : I I I I I I II I I M I : I I II I 
g209 IQYGFDDDGQNDDCPAPVADNWQLVQEPEERRCEPVYFTVVFCQLQWGDVCDNGCGLR 
190 200 210 220 230 240 

250 260 270 280 290 299 

m2 0 9 . pep AGVEVDGGFG F- APFWMAAKGTLTLVL YS LS LRRLMSMLHS PAAQTLCLPLG WRI QVDMK 

: I : : I I II II 
g2 0 9 TGIQVDRHFRFWPPGWDSG 

250 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 725>: 

a209.seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGGCGCGT TGTTTTTCGA 
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51 TGCTGCGGTT GATGTGCCAT TGCTGGGCGA 

X01 ACCCAGTACA ATACCAGACC GGCAGGGAAG 

151 AACCAAAGGC ATGATTTTCA TCATTTTCGC 

201 GCGGGTTCAG ATAGGTTTGG GCGAACATCG 

251 AGGATGTAGT AGGGGTCGGC GCGGCTGAGG 

301 AGGTGCCTGG CGCAATTCTA CGGAGGCGAA 

351 TGAAGACGGG GATTTGCAAC AGCATAGGCA 

401 ATTTTCTCGT CTGTGTAAAG CTGCATCATG 

451 GTCGTCGCCG TATTTCTCTT TGATGGCTTG 

501 GCATTTTCGC CATCGAACGG TAAGAGGCGT 

551 GCTTTGACGA TGATGGTTAA AACGATAATC 

601 GATGTTGTGC AGTTGGTTCA AAAGCCAAAA 

651 GTACTTTGCC GTAGTCTTTG GCCAGTTGCA 

701 ATAACGGATG TGGTCTGTGG GCCGGCGTAG 

751 TTCGCACCGT TTTGGATAGC GGCTAAAGGC 

801 CAGCTTGTCG TTGCGGCGTT TGATGTCGAT 

851 AAACGCTTTG TCCGCCTTTG GGTTGGAGGA 

901 TGTTCAATCA TGCCGAGCCA GCCGGTCGGG 

951 CTCGGATTTG CCGGATTTGG CATCGTCGTC 

1001 TTTGGAAGTT GCCTTCAGGG GTATAA 

This corresponds to the amino acid sequence <SEQ ID 726; ORF 209.a>: 

a209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE VVDHPVQYQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDWGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHGLLLRHT 

151 WAVFLFDGL QFGRGGTHFR HRTVRGVGQW IQYGFDDDG* NDNRPAPVAD 

201 DWQLVQKPK EGGGEPVYFA WFGQLQWG DVCDNGCGLW AGVEVDGGFG 

251 FAPFWIAAKG TLTLVLYSLS LRRLMSIRQS PAAQTLCPPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

m209/a209 95.6% identity in 341 aa overlap 

10 20 30 40 50 60 

m209.pep M LRHLGND FALGALFFDAAVDV PL LGDGQEWDYPVQYQT GREEEEHDGE NQRHDFHHFR 

IIIIIIMM III IMtll llllllll III 111:1111111 IIMI III IIMIIIIIII 
a209 MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDHPVQYQTGREEEEHDGENQRHDFHHFR 
10 20 30 40 50 60 

70 80 90 100 110 120 

m209.pep L HRVGRRRVQ ISLGEHRCRHNDGQ D WG VG AAEVGN PT Q PRC L AQFYGGEQ C P I Q S DE DG 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I 
a209 LHRVGRRRVQIGLGEHRCRHNDGQDWGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m209.pep DLQQHRQAAAQRVDFLVCVKLHHRLLLRHTVVAVFLFDGLQFGCGGTHFRHRAVRGVGQW 
I M I I I I I I I I I I I M I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I 1 I I I : I I I II I I 
a209 DLQQHRQAAAQRVDFLVCVKLHHGLLLRHTVVAVFLFDGLQFGRGGTHFRHRTVRGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m209 . pep I QYGFDDDGXN DNRPAPVADDWQLVQE PEERGGE PVYFAWFGQLQWGDVCDDGCGLR 

I I I I I I I I I I I I I I I I I I I I I! I I I I I : I : I I I I I I I I I I I I I II I I I I I I I I : I I I I 
a209 IQYGFDDDGXNDNRPAPVADDVVQLVQKPKEGGGEPVYFAVVFGQLQVVGDVCDNGCGLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

m209.pep AGVE V DGG FG FAP FWMAAKGT LTL VL YS L S LRRLM SMLH S P AAQT LC LPLGWR I QV DMKW 

I I I I I I I I I I I II I I : I I I I I i I I I I I I I I I I I I I I : : I I I I I 1 1 I I II I I I II I I I I 
a209 AGVEVDGGFGFAPFWIAAKGTLTLVLYSLSLRRLMSIRQSPAAOTLCPPLGWRIQVDMKW 

250 260 270 280 290 300 



TGGTCAGGAG 
AAGAAGAACA 
CTGCATCGGG 
TTGCCGCCAT 
TCGGTAATCC 
CAATGCCCAA 
GGCAGCC3CC 
GCTTGTTGCT 
CAGTTTGGGC 
TGGTCAATGG 
GCCCAGCCCC 
GAGGGGGGAG 
GGTTGTCGGC 
AGGTTGATGG 
ACGCTGACGC 
ACGGCAGTCG 
TCCAGGTGGA 
GTTTTGCGGA 
CAAGTCGGAG 



GTTGTTGATC 
TGACGGAGAA 
TCGGTCGGCG 
AATGATGGGC 
AACCCAGCCA 
TACAATCCGA 
CAGCGGGTTG 
GCGCCATACG 
GCGGCGGCAC 
ATACAGTACG 
AGTTGCCGAT 
GCGAACCAGT 
GATGTTTGCG 
AGGCTTCGGT 
TGGTGCTGTA 
CCAGCGGCGC 
CATGAAGTGG 
TGTATTCGGC 
AAGCTGACTT 



m209.pep 



310 320 330 340 

CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKLPSGVX 
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I M I I 1 1 I I I I I II 1 i I I 1 1 I 1 1 M 1 I 1 t I I I I I I I I I 1 1 1 1 
a209 CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKLPSGVX 
310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 727>: 

g211 .seq 

1 atgttgcgga ttgctgctgc caatcagttg ggcggtcgaa atggtgcggc 

51 ggtgggaaac ggggtcgata agtttgggcg tggtgctgat aatcaggctg 

101 agtttttgga aggaaacctg attgtagtcg gcgcgtccgg gcgtgccgct 

151 gtaacggtag ccgtggcgca attcgagcgt gcgtttgttg tccttcagcg 

2 01 agaagttacc ttctttggcg aagatgatgt tgtcgccgcc gtttttgtcc 
251 tgttcgcgca ggaacaggtt tttcatgatg ccggattcgg tgtcaaaggt 

3 01 ttcgacgaaa taaaccctgc cgttgcgctt gcccaagtta ttgaactcgc 

3 51 cggcttccac caaagacaat tcctgcttct gcttcaaaat ttcggcatat 

4 01 tcgcggctgc gcagctctgc ccacggtatc acccaaagct gcatgacggc 
4 51 aatcaggatg gcaaacggca cggcaaactg catgacgggg cgtatccact 
501 gtttcaacgc caatccgcag gatag 

This corresponds to the amino acid sequence <SEQ ID 728; ORF 21 l.ng>: 

g211 .pep 

1 MLRIAAANQL GGRNGAAVGN GVDKFGRGAD NQVEFLEGN L IWGASGRAA 

51 VTVAVA QFER AFWLQRE VT FFGEDDWAA VFVLFA QEQV FHDAGFGVKG 

101 FDEINPAVAL AQV IELAGFH QRQFLLLLQN FGIFAAAQLC PRYHPKLHDG 

151 NQDGKRHGKL HDGAYPLFQR QSAG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 729>: 

m211.seq 

1 ATGTTGCGGG TTGCTGCTGC CAATCAGTTG GG CGGTCGG A ATGGTACGGC 

51 GGTGGGAAAC GGGGTCGATG AGTTTGGGCG TGGTGCTGAT AATCAGGTTG 

101 AGTTTTTGGA AGGAAACCTG ATTGTAGTCG GCGCGTCCGG GCGTGCCGCT 

151 GTAACGGTAG CCGTGGCGCA ATTCGAGCGT GCGTTTGTTG TCGTTCAGCG 

201 AGAAGTTACC TTCTTTGGCG AAGATGATGT TGTCGCCGCC GTTTTTGTCC 

251 TGTTCGCGCA GGAACAGGTT TTTCATGATG CCGGATTCGG TATCGAAGGT 

301 TTCGACAAAA TAAACCCTGC CGTTGCGCTT GCCCAAACTG TTGAACTCGC 

. 351 CTGCCTCCAC CAAAGACAAT TCCTGCTTCT GCTTCAGGAT TTCAGCGTAT 

4 01 TCGCGGCTGC GTAGCTCTGC CCACGGTATC ACCCAAAGCT GCATGACGGC 

4 51 AACCAAAACG GCAAACGGCA CGGCAAACTG CATCACCGGG CGTATCCATT 

501 GTTTCAATGC CAATCCGCAg GATAG 

This corresponds to the amino acid sequence <SEQ ID 730; ORF 21 1>: 

m211 .pep 

1 MLRVAAANQL GGRNGTAVGN GVDEFGRGAD NQVEFLEGNL IWGASGRA A 

51 VTVAVAQ FER AFVWQRE VT FFGEDDWAA VFVLFA QEQV FHDAGFGIEG 

101 FDKINPAVAL AQ TVELACLH QRQFLLLLQD FSVFAAAXLC PRYHPKLHDG 

151 NQNGKRHGKL HHRAYPL FQC QSAG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 2 1 1 shows 89. 1 % identity over a 1 74 aa overlap with a predicted ORF (ORF 2 1 1 .ng) 
from N. gonorrhoeae: 

m211/g211 

10 20 30 40 50 60 

MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNL IWGASGRAA VTVAVAQFER 
M|:| II III I II Ihl I III II: III I IIMIII II I Mill III III I I Nil II III 
MLR I AAANQLGGRNG AAVGNGVD KFGRGADNQ VE FLEGNL I WGASGRAAVTVAVAQFE R 
10 20 30 40 50 60 



m211.pep 
g211 



70 80 90 100 110 120 

m211 . pep AFVWQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVELACLH 
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g2ii 



I 1 1 h 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 I II I 1 1 1 1 I I h 1 1 1 I I I 1 1 h = 1 1 1 = I 

AFWLQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGVKGFDE I NPAVALAQVI ELAGFH 
70 80 90 100 110 120 



130 140 150 160 170 

m2 1 1 . pep QRQFLLLLODFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

. I I I I I t I I I = I = : I I I !* I I I I I I I I I I I I I I : II I I I I I I HUM Mil 
g2 11 QRQFLLLLQNFGI FAAAQLCPRYHPKLHDGNODGKRHGKLHDGAYPLFQRQSAG 

130 140 150 160 170 . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 73 1>: 



a211 . seq 








1 


ATGTTGCGGG 


TTGCTGCTGC 


CAATCAGTTG 


51 


GGTGGGAAAC 


GGGGTCGATG 


AGTTTGGGCG 


101 


AGTTTTTGGA 


AGGAAACCTG 


ATTGTAGTCG 


151 


GTAACGGTAG 


CCGTGGCGCA 


ATTCGAGCGT 


201 


AGAAGTTACT 


TTCTTTGGCG 


AAGATGATGT 


251 


TGTTCGCGCA 


GGAACAGGTT 


TTTCATGATG 


301 


TTCGACAAAA 


TAAACCCTGC 


CGTTGCGCTT 


351 


CTGCCTCCAC 


CAAAGACAAT 


TCCTGCTTCT 


401 


TCGCGGCTGC 


GTAGCTCTGC 


CCACGGTATC 


451 


AACCAAAACG 


GCAAACGGCA 


CGGCAAACTG 


501 


GTTTCAATGC 


CAATCCGCAG 


GATAG 



GGCGGTCGGA 
TGGTGCTGAT 
GCGCGTCCGG 
GCGTTTGTTG 
TGTCGCCGCC 
CCGGATTCGG 
GCCCAAACTG 
GCTTCAGGAT 
ACCCAAAGCT 
CATCACCGGG 



ATGGTACGGC 
AATCAGGTTG 
GCGTGCCGCT 
TCGTTCAGCG 
GTTTTTGTCC 
TATCGAAGGT 
TTGAACCCGC 
TTCAGCGTAT 
GCATGACGGC 
CGTATCCATT 



This corresponds to the amino acid sequence <SEQ ID 732; ORF 21 l.a>: 

a211.pep 

1 MLRVAAANQL GGRNGTAVGN GVDEFGRGAD NQVEFLEGN L IWGASGRAA 

SI VTVAVA QFER AFVWQRE VT FFGEDDWAA VFVLF AQEQV FHDAGFGIEG 

101 FDKINPAVAL AQTVEPACLH QRQ FLLLLQD F5VFAAA*LC PRYHPKLHDG 

151 NQNGKRHGKL HHRAYPLFQC QSAG* 



m21 l/a21 1 99.4% identity in 174 aa overlap 

10 20 

m211 .pep 



50 



60 



a211 



30 40 

MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLI WGASGRAAVTVAVAQFER 
I I I I I I I I I I ! I t I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I 
MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLI WGASGRAAVTVAVAQFER 
10 20 30 40 50 60 



70 80 90 100 110 120 

m211 . pep AFVWQREVT FFGEDDVVAAVFVLFAQEQVFH DAG FG I EGFDKIN PA VALAQTVELACLH 

I I I I I I I I I I I I I I II I 1 I I I I I I I I I I I I I II I I 1 I I I I I I I I I I I I I I I I I I I I I I I 
a211 AFWVQRE VT FFG ED D WAAV FV LFAQE Q V FH DAG FGIEGFDKIN PAV ALAQTVE PAC L H 

70 80 90 100 110 120 



130 140 150 160 170 

m211 . pep QRQFLLLLQDFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

I III III llllllllill III IIMIillMIKIIMII IIMIMIIIIIMI 
a211 QRQFLLLLQDFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 
130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 733>: 

g212.seq (partial) 

1 atggacaatc tcgtatggga cggcattccc gacatccgca cactcgacca 

51 aaccatccgc aaacacgcac acccgctcaa cctgattgtc tgcctccccg 

101 ataatcagat tcccgatttt caaaccgcac aagatgcttc ggactcggaa 

151 tgccgcctga agcaccgttt ggatcaggca acccagtgcc tccagttcga 

201 cagcatcaac ctcatcgaac acatcctgcc cgatgtccgc ttctggctgg 

251 ttcccccttc acgcacccgc cgcctgcacg aacacttcca ccacatttcc 

301 tggcagaccg aagccatccc gcaaaccgaa agcaagtccg acaaaccctg 

351 gtttgcactt ccacaaacat ccgaacggaa aaaaccggaa cacgtcctcg 
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4 01 tcatcggtgc aggcattgcc ggcgcatcga ccgcccacgc cttagcatca 

451 cacggcattt ccgttaccgt attggaagcc cgaaaagccg ctcaagccgc 

501 cagcggcaac cggcaagggc tgctttacgc caaaatctcg ccgcacgaca 

551 ccggacagac cgaactgctg cttgccggct acggctacac caaacgcctg 

601 ctcggacaca tcctgcccga ctccgacact tggggcggca acggcatcat 

651 ccacctcaat tacagccgca ccgaacaaca acgcaatcac gaattgggtt 

701 tgcaaaaaca ccataaccac ctctaccgca gcatcacgtc tgcagaagcc 

751 gaaaaaatcg ccggcatccc gctgaacacg ccctacgccg aaccattatg 

801 cggactctac tggcaacacg gcgtatggct caatccgccc gcattcgtcc 

851 gcaccctcct cagccatccg ctgatcgaac tatatgaaaa cacaacgtta 

901 accggcattt cccacgacgg agaaaagtgg attgcaagca cgccaaacgg 

951 cacatttacc gccacacaca tcatctactg caccggcgcg cacagcccct 

1001 gcctgcccga aaccaacctc gccgccctac ccctcaggca aatacgcgga 

1051 caaaccggcc tcacaccgtc caccccgttt tccgaacaac tgcgttgcgc 

1101 cgtttcaggc gaaagctaca tcagcccgtc gtggcacgga ctgcactgct 

1151 acggcgcgag ttttattccc aacagcagca ataccggatg gaacgaagcc 

1201 gaagaagcct caaaccgcca agcattggca caccttaacc ccgcccttgc 

1251 cgaatcattg ttt... 

This corresponds to the amino acid sequence <SEQ ID 734; ORF 212.ng>: 

g212.pep (partial) 

1 MDNLVWDGIP DIRTLDQTIR KHAHPLNLIV CLPDNQIPDF QTAQDASDSE 

51 CRLKHRLDQA TQCLQFDSIN LIEHILPDVR FWLVPPSRTR RLHEHFHHIS 

101 WQTEAIPQTE SKSDKPWFAL PQTSERKKPE H VLVIGAGIA GASTAKALAS 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTGQTELL LAGYGYTKRL 

201 LGHILPDSDT WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITSAEA 

251 EKIAGIPLNT PYAEPLCGLY WQHGVWLNPP AFVRTLLSHP LIELYENTTL 

301 TGISHDGEKW IASTPNGTFT ATHI IYCTGA HSPCLPETNL AALPLRQIRG 

3 51 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSNTGWNEA 

4 01 EEASNRQALA HLNPALAESL F. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 735>: 

m212 . seq 

1 ATGGACAATC TCGTATGGGA CGGCATTCCC GACATCCGCA CACTCGACCA 

51 AGCCATCCGC AAACACGCAC CCCCGCTCAA CCTGATTATC TGCCTCCCCG 

1C1 ATAATCAGAT TCCCGATTTT CAAACCGCAC AAGATGCTTC GGACGCGGAA 

151 TGCCGTCTGA AGCACCGTTT GGATCAGGCA ATGCAGTGCC TCCAGTTCGA 

2 CI CAGCATCAAC CTCATCGAAC ACATCCTGCC CGATGTCCGC TTCTGGCTGG 

251 TTCCCCCTTC ACGCACCCAC CACCTGCACG AACATTTCCA CCACATTTCC 

3 01 TGGCAGACCG AAGCCATCCC GCAAACCGAA AGCAAGCCCG ACAAACCCTG 
351 GTTTGCACTT CCACAAACAT CCGAACGGCA AAAACCGGAA CACATCCTCG 
401 TTATCGGCGC GGGCATATCC GGCGCGGCAA CCGCCCACGC CTTAGCATCA 

4 51 CACGGCATTT CCGTTACCGT ATTGGAAGCC CGAAAAGCCG CCCAAGCCGC 
501 CAGCGGCAAC CGCCAAGGGC TGCTCTACGC CAAAATCTCG CCGCACGACA 
551 CCGAACAGAC CGAACTTTTG CTTGCCGGCT ACGGCTACAC CAAACGCCTG 
601 CTCGGACACA TCCTGCCCGA ATCCGAAACC TGGGGCGGCA ACGGCATCAT 
651 CCACCTCAAT TACAGCCGCA CCGAACAACA ACGCAATCAC GAATTGGGTT 
701 TGCAAAAACA CCATAACCAC CTCTACCGCA GCATCACATC TGCAGAAGCC 
751 GAAAAAATCG CCGGTATCCC ACTGTCCGTC CCATACGACC ACCCTTCATG 
801 CGGACTCTAC TGGCAACACG GCGTATGGCT CAATCCACCC GCATTCGTCC 
8 51 GCACCCTCCT CAACCATCCG CTCATTGGAC TACACGAAGA CACACCCTTG 
901 ACCGACATTT CCCACGACGG GGaAAAGTGG ATTGCAAGCA CGCCAAACGG 
951 CACATTTACC GCCACACACA TCATCTACTG CACCGGTGCG AACAGCCCCT 

1001 ACCTACCCGA AACCAACCTC GCCGCCCTGC CTCTCAGGCA AATACGCGGA 

1051 CAAACCGGCC TCACACCGTC CACCCCGTTT TCCGAACAAC TGCGTTGCGC 

1101 CGTTTCAGGC GAAAGCTACA TCAGCCCGTC GTGGCACGGA CTGCACTGCT 

1151 ACGGCGCGAG TTTTATTCCC AACAGCAGCC ATACCGGATG GAACGAAGCC 

1201 GAAGAAGCCT CAAACCGCCA AGCATTGGCA CACCTTAACC CCGCCCTTTC 

1251 CGAATCATTG TTTGCCGCCA ACCCAAACCC CCAAAAACAC CAAGGGCACG 

1301 CCGCCATACG CTGCGACAGC CCCGACCACC TTCCCCTAGT CGGCGCACTC 

1351 GGCGACATTG CCGCCATGCG GCAGACCTAC ACCAAACTCG CGCTGGACAA 

1401 AAACTACCGC ATCGACACCC CATGCCCATA CCTGCCTAAT GCCTACGTCA 

1451 ACACCGCGCA CGGCACCCGC GGACTCGCCA CCGCCCCCAT CTGCGCCGCC 
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1501 GmCAwTG C AG CCCAAATCsT AGGCyTGCCC CATCCCTTTT yAcAAcGCCT 
1551 gCGCCACGCC CTACACCCCA ACCGCACCAT CATCCGCGCC ATCGTCAGAA 
1601 GGAAGGATCT AACCCCTTAA 

This corresponds to the amino acid sequence <SEQ ID 736; ORF 212>: 
m212 .pep 

1 MDNLVWDGIP DIRTLDQAIR KHAPPLNLII CLPDNQIPDF QTAQDASDAE 

51 CRLKHRLDQA MQCLQFDSIN LIEHILPDVR FWLVPPSRTH HLHEHFHHIS 

101 WQTEAIPQTE SKPDKPWFAL PQTSERQKPE H ILVIGAGIS GAATAHALA S 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGHILPESET WGGNGI IHLN YSRTEQQRNH ELGLOKHHNH LYRSITSAEA 

251 EKIAGIPLSV PYDHPSCGLY WQHGVWLNPP AFVRTLLNHP LIGLHEDTPL 

301 TDISHDGEKW IASTPNGTFT ATHIIYCTGA NSPYLPETNL AALPLRQIRG 

351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 

401 EEASNRQALA HLNPALSESL FAANPNPQKH QGHAAIRCDS PDHLPLVGAL 

451 GDIAAMRQTY TKLALDKNYR IDTPCPYLPN AYVNTAHGTR GLATAPICAA 

501 XXAAQIXGLP HPFXQRLRHA LHPNRTIIRA IVRRKDLTP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 212 shows 92.9% identity over a 421 aa overlap with a predicted ORF (ORF 21 2.ng) 
from N. gonorrhoeae: 
m212/g212 

10 20 30 40 50 60 

MDNLVWDGI PDIRTLDQAIRKHAPPLNLI I CLPDNQI PDFQTAQDASDAECRLiKHRIjDQA 

1 1 II II II I III III I hi II II I Ml hi I III Nil III I II I hi 1 II I III III 

h4DNLVWDGI PDIRTLDQTIRKHAHPLNLI VCLPDNQI PDFQTAQDASDS ECRLKHRIiDQA 
10 20 30 40 50 60 

70 80 90 100 110 120 

MQCLQFDS I NL I E HI L PDVR FWLV PPSRTHHLHEH FHH I S WQ TE AI PQTES KPDKPWFAL 

I II IIIH II U Mill III MIIUM- MINI I IN II Mill Mi I lllllll 

TQCLQFDSINLI EHILPDVRFWLVPPSRTRRLHEHFHHISWQTEAI PQTES KSDKPWFAL 
70 80 90 100 110 120 

130 140 150 160 170 180 

PQTS ERQKPEHI LVI GAGI S GAATAHALAS HG I SVTVLEARKAAQAASGNRQGLLYAKI S 

I Mllhlllhill llihlhlllllllillllll III III IIIIIIMIII III III 

PQTSERKKPEHVLVI GAGI AGASTAHALASHGI SVTVLEARKAAQAASGNRQGLLYAKI S 
130 140 150 160 170 180 

190 200 210 220 230 240 

PHDTEQTELLLAGYGYTKIUjLGHILPESETWGGNGIIH 

Mil III! Ml III III III II llhhll III III III I INN II III II I III III 

PHDTGQTELLLAGYGYTKRLLGH I LPDSDTWGGNG 1 1 HLN YSRTEQQRNHELGLQKHHNH 
190 200 210 220 230 240 

250 260 270 280 290 300 

LYRS I TSAEAEKI AG I PLSVPYDHPS CG L YWQHGVWLNP PAFVRTLLNH PL I GLHEDT P L 

I Ml I M II II I III II I- II • I II I II Ml I II III MM II hill I hh- 1 I 

LYRSI TSAEAEKI AGI PLNTPYAEPLCGLYWQHGVWLNPPAF^/RTLLSHPLI EL YENTTL 
250 260 270 280 290 300 

310 320 330 340 350 360 

TDI SHDGEKWI AS TPNGTFTATHI I YCTGANS P YLPETNLAALPLRQI RGQTGLTPSTP F 

i iimimimiiiiiimiiiiihii iiiiiiiiiiiiiiiiiiiiiiini 

TGI SHDGEKWI ASTPNGTFTATH 1 1 YCTGAHS PCLPETNLAALPLRQI RGQTGLTPSTP F 
310 320 330 340 350 360 

370 380 390 400 410 420 

SEQLRCAVSGESYIS PSWHGLHCYGASFI PNSSHTGWNEAEEASNRQALAHLNPALSESL 



m212 .pep 
g212 

m212 -pep 
g212 

m212 .pep 
g212 

m212.pep 
g212 

ra212 .pep 
g212 

m212 .pep 
g212 

m212.pep 
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g212 



I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I : I I I I I I I I I I I I I | M | I | | | I : | I ! 
SEQLRCAVSGESYISPSWHGLHCYGASFIPKSSNTGWNEAEEASNROALAHLNPALAESL 
370 380 390 400 410 420 



430 440 450 460 470 480 

m2 12 . pep FAAN PNPQ KHQGHAAI RCDS PDHLP LVGALGD I AAMRQTY T KLALDKNY RIDT PC P YLPN 

I 

g212 F 



The following partial DNA sequence was 

a212 . saq 

1 ATGGACAATC TCGCATGGAA CGGCATTCCC 

51 AACCATCCGC AAACACGCAC ACCCGCTCAA 

101 ATAATCAGAT TCCCAATTTT CAAACCGCAC 

151 TGCCGTCTGA AGCACCGTTT GGATCAGGCA 

201 CAGCATCAAC CTGATTGAAC ACATCCTGCC 

251 TTCCCCCTTC ACGCACCCGC CGCCTGCACG 

301 TGGCAGACCG AAGCCATCCC GCAAACCGAA 

351 GTTTGCACTT CCACAAACAT CCGAACGGCA 

401 TTATCGGAGC GGGCATATCC GGCGCGGCAA 

451 TACGGCATTT CCGTTACCGT ATTGGAAGCC 

501 CAGCGGCAAC CGCCAAGGGC TGCTCTACGC 

551 CCGAACAAAC CGAACTGCTG CTTGCCGGCT 

601 CTCGGACATA TCCTGCCCGA ATCCGAAACC 

651 CCACCTCAAT TACAGCCGCA CCGAACAACA 

701 TGCAAAAACA CCATAACCAC CTCTACCGCA 

751 GAAAAAATCG CCGGCATCCC TCTGAACACG 

801 CGGACTGTTT TGGCAGTACG GCGTATGGCT 

851 GCGCCCTCCT CAGCCATCCG CTCATTGGAC 

901 ACCGACATTT CCCACGACGG GGAAAAGTGG 

951 CACATTTACC GCCACACACA TCATCTACTG 

1001 ACCTACCCGA AACCAACCTC GCCACCCTGC 

1051 CAAACCGGCC TCACACCGTC CACCCCGTTT 

1101 CGTTTCAGGC GAAAGCTACA TCAGCCCGTC 

1151 ACGGCGCGAG TTTTATTCCC AACAGCAGCC 

1201 GAAGAAGCCT CAAACCGCCA AGCATTGGCA 

1251 CGAATCATTG TTTGCCGCCA ACCCAAACCC 

1301 CCGCCATACG CTGCGACAGC CCCGACCACC 

1351 GGCGACATTG CCGCTATGCA ACAAACTTAC 

1401 AAACTATCGC ATCGATGCCC CCTGCCCGTA 

14 51 ACACCGCCCA CGGCACACGC GGGCTTGCCA 

1501 GCCGTTGCAG CCGAAATCCT AGGCTTGCCC 

1551 GCGCCACGCC CTACACCCCA ACCGCGCCAT 

1601 GGAAGGATCT AACCCCTTAA 



identified inN. meningitidis <SEQ ID 737>: 



GACATCCGCA 
CCTGATTGTC 
AAGATGCTTC 
ACCCAGTGCC 
CGATGTCCGC 
AACACTTCCA 
AGTAAGCCCG 
AAAACCGGAA 
CCGCCCACGC 
CGAAAAGCCG 
CAAAATCTCG 
ACGGCTACAC 
TGGGGCGGCA 
ACGCAATCAC 
GCATCACGCA 
CCCTACGCCG 
CAATCCTCCC 
TACACGAAGA 
ATTGCAAGCA 
CACCGGTGCG 
CCCTCAGGCA 
TCCGAACAAC 
GTGGCACGGA 
ATACCGGATG 
CACCTTAACC 
C CAAAAAC AC 
TTCCCCTAGT 
GCCAAACTCG 
CCTGCCCAAT 
CCGCCCCCAT 
CATCCCCTCT 
CATCCGCGCC 



CACTCGACCA 
TGCCTCCCCG 
GGACGCGGAA 
TCCAGTTCGA 
TTCTGGCT GG 
CCACATTTCC 
ACAAACCCTG 
CACATCCTCG 
CTTAGCATCA 
CCCAAGCCGC 
CCGCACGACA 
CAAACGCCTG 
ACGGCATCAT 
GAATTGGGTT 
GGCAGAAGCC 
AACCATTATG 
ACATTCGTCC 
CACACCGTTA 
CGCCAAACGG 
AACAGCCCCT 
AATACGCGGA 
TGCGTTGCGC 
CTGCACTGCT 
GAACGAAGCC 
CCGCCCTTTC 
CAAGGGCACG 
CGGCGCACTC 
CGCTGGACAA 
GCCTACGCCA 
CTGCGCCGCC 
CAAAACGCCT 
ATCGTCAGAA 



This corresponds to the amino acid sequence <SEQ ID 738; ORF 212.a>: 

a212 .pep 

1 MDNLAWNGIP DIRTLDQTIR KHAHPLNLIV CLPDNQIPNF QTAQDASDAE 

51 CRLKHRLDQA TQCLQFDSIN LIEHILPDVR FWLVPPSRTR RLHEHFHHIS 

101 WQTEAI PQTE SKPDKPWFAL PQTSERQKPE H ILVIGAGIS GAATAHAL AS 

151 YGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGHILPESET WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITQAEA 

251 EKIAGIPLNT PYAEPLCGLF WQYGVWLNPP TFVRALLSHP LIGLHEDTPL 

301 TDISHDGEKW IASTPNGTFT ATHIIYCTGA NSPYLPETNL ATLPLRQIRG 

351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 

401 EEASNRQALA HLNPALSESL FAANPNPQKH QGHAAIRCDS PDHLPLVGAL 

451 GDIAAMQQTY AKLALDKNYR IDAPCPYLPN AYANTAHGTR G LATAPICAA 

501 AVAAEILGLP HPLSKRLRHA LHPNRAIIRA IVRRKDLTP* 



m212/a212 93.7% identity in 539 aa overlap 

10 20 30 40 50 60 

m512 .pep MDNLVWDGIPDIRTLDQAIRKHAPPLNLIICLPDNQIPDFQTAQDASDAECRLKHRLDQA 
II 11:1:1 llllil II I : I I I I I I I I i : I I I I I I I I : I I I i I I i I I I I I I I i I I I I I I 
a2 12 MDN LAWNG I PDIRTLDQTI RKHAK PLNL I VCL PDNQI PN FQT AQD AS D AECRLKHR I*DQA 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m212. pep MQCLQFDSINLIEHILPDVRFWLVPPSRTHHLHEHFHHISWQTEAIPQTESKPDKPWFAL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I I i I I I I I II I I I I I I I I I I I I I I I 1 I I I 
a212 TQCLQFDSINLIEHILPDVRFWLVPPSRTRRLHEHFHHISWQTEAIPQTESKPDKPWFAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m212 . pep PQTSERQKPEHILVIGAGISGAATAHALASHGISVTVLEARKAAQAASGNRQGLLYAKI S 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I ! I I I I I I I I I I I I I I M 
a21 2 P QT S ERQKP EH I LVT GAG I S G AAT ARALAS YGI S VTV LE ARKAAQAAS GN RQG LL Y AKI S 

130 140 150 160 170 180 

190 200 210 220 230 240 

m212 .pep PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 
III I II I II il II III II II Ml I. MINI III III II Mlill Mill I III II I III! 
a212 PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 

190 200 210 220 230 240 

250 260 270 280 290 300 

m212.pep LYRSITSAEAEKIAGIPLSVPYDHPSCGLYWQHGVWLNPPAFVRTLLNHPLIGLHEDTPL 
I I I I I I : M I I 1 1 I I I I I I I : I I I I : I I : II I I I I ! : I I I : | I : | | I I I II I I II I 
a212 LYRSITQAEAEKIAGIPLNTPYAEPLCGLFWQYGVWLNPPTFVRALLSHPLIGLHEDTPL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m212.pep T D I S H DGEKW I AST PNGT FT ATH 1 1 YCTG ANS PY L P ETNL AAL P LRQI RG QT G LT FS T P F 

I II I M I I I I I I I I I I II II I I I I II I I I II I I II I I I I I I : II II I II I I M I II I I I I 
a212 TDISHDGEKWIASTPNGTFTATHIIYCTGANSPYLPETNLATLPLRQI RGQTGLTPSTPF 

310 320 330 340 350 360 

370 380 390 400 410 420 

m212 . pep SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 
I I II 1 1 I I I I I I I I I I M II I I I It I I M I 1 1 I I I I II I 1 1 I I I I II M I M I I I M I I I 
a212 SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 
370 380 390 400 410 420 

430 440 450 460 470 480 

m212 .pep FAAN PN PQKHQGHAA I RC DS PDHL PLVGALGDI AAMRQTYTKLAL DKN YR I DT PC P Y L PN 

I f I I II I I I I I I I II I I I I I II I II I I I I [ I II I I I : I I I: I I I I I I I I I I I : II I I I I I 
a212 FAAN PN PQKHQGHAAI RC DS PDHL PLVGALG DI AAMQQT YAKLALDKN YR IDAPCPYLPN 

430 440 450 460 470 480 

490 500 510 520 530 540 

m212.pep AYVNTAHGTRGLATAPICAAXXAAQIXGLPHPFXQRLRHALHPNRTIIRAIVRRKDLTPX 

II : 1 1 I I II II 1 1 I I I I I M ICI Hill: : I I M 1 1 1 1 II : I I I I 1 1 1 ! I I I II I 
a212 AY ANTAHGTRGLAT AP I CAAAV AA2I LGLPH PLS KRLRHALH PN RA 1 1 RA I VRRK DLT PX 

490 500 510 520 530 540 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 739>: 

g214 .seq 

1 atgatacaaa agatatgtaa gctatttgtt ttaattgtaa tttttgcaac 

51 ttctcccgct tttgcccttc aaagcgacag cagacggccc atccaaatcg 

101 aagccgacca aggttcgctc gatcaagcca accaaaggac cacatttagc 

151 ggcaatgtca tcatcagaca gggtacgctc aacatttccg cctcgtgtgt 

201 caacgtcaca cgcggcaggc aaaggcggcg aatccgtgag ggcggaaggt 

251 tcgcccgtcc gcttcagcca aacgttggac gggggcaaag ggacggtgcg 

301 cggtcaggca aacaacgtta cctattcctc cgcaggaagc actgtcgttc 

351 tgaccggcaa tgccaaagtg cagcgcggcg gcgacgttgc cgaaggtgcg 

401 gtcattacct acaacaccaa aaccgaagtc tataccatca acggcagcac 

451 gaaatcgggt gcgaaatccg cttccaaaac cggcagggtc agcgtcgtca 

501 tccagccttc aagcacacaa aaaaccgaat aaccccgatg ccgtctgaaa 

551 cggaaacgca gttcagacgg catttgccga ccgaaafcgcc gagaagagat 

601 tattga 

This corresponds to the amino acid sequence <SEQ ID 740; ORF 214.ng>: 
g2i4.pep 
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1 MIQKICKLFV LIVIFATSPA FAL QSDSRRP IQI EADQGSL DQANQRTTFS 
51 GNVI IRQGITj NISASCVNVT RGRQRRRIRE GGRFARPLQP NVGRGQRDGA 
101 RSGKQRYLFL RRKHCRSDRQ CQSAARRRRC RRCGHYLQHQ NRSLYHQRQH 
151 EIGCEIRFQN RQGQRRHPAF KHTKNRITPM PSETETQFRR HLPTEMPRRD 
201 Y 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 74 1>: 

m214.seq (partial) 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCAGTCCG CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 

301 GGACAGGCAA ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 

351 AACCGGTAAT GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 

401 TGATTACATA CAACACCAAA ACCGAAGTCT ATACCATCAG CGGCAGCACA 

451 AAATT. . . 

This corresponds to the amino acid sequence <SEQ ID 742; ORF 214>: 

m214.pep (partial) 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQI EADQGSL DQANQSTTFS 
51 GNWIRQGTL NISAARVNVT RGRQRRRIRE GGRFASPLQP DIGRRQRHGA 
101 RTGKQRCIjFT CRQHRSLNR* CQSTARRRCR RRCGDYIQHQ NRSLYHQRQH 
151 KI. . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 214 shows 80.3% identity over a 152 aa overlap with a predicted ORF (ORF 214.ng) 
from K gonorrhoeae: 

m214/g214 



10 20 30 40 50 60 

m214 .pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGKTVVIRQGTL 

llllllllllll-l-IIIIIMIIIhllilllllllllllll I I I I I I I = I I I I I I 
g214 MIQKICKLFVLIVIFATSPAFALQSDSRRPIQIEADQGSLDQANQRTTFSGNVI IRQGTL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m214 . pep NI SAARVNVTRGRQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFI CRQHRSLNRX 

lllh Mlllllllllllllllll ||||::|| II Ilhllll ||. hi :| 
g214 NI SAS CVNVTRGRQRRRI REGGRFARPLQPNVGRGQRDGARSGKQRYLFLRRKHCRSDRQ 

70 80 90 100 110 120 



130 140 150 

m2 1 4 . pep CQST ARRRCRRRCGDY I QHQNRS L YHQRQHK I 

Ilhllll 1 1 1 1 hIMIIIIIIIIIhl 

9214 CQSAARRRRCRRCGHYLQHQNRSLYHQRQHEIGCE I RFQNRQGQRRHPAFKHTKNRI TPM 

130 140 150 160 170 180 



g 2 1 4 PS ETETQFRRHLPTEMPRRDY 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 743>: 

a214 . seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

2 CI CAATGTTACA CGCGGC.GGC AAAGGCGGCG AATCCGTGAG GGCGG AAGGT 

251 TCGCCAGTCC GCTTCAGCCA GACATTGGAC GGCGGCAAAG GCACGGTGCG 

301 CGGACAGGCA AACAACG7TG CTTATTCATC TGCAGGCAGC ACCGTAGTCT 
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351 
401 
451 
501 
551 
601 



TAACCGGTAA 
GTGATTACAT 
AAAATCCGGC 
TCCAGCCTTC 
CATAAACCTG 
TTATGA 



TGCCAAAGTA 
ACAACACCAA 
GCAAAATCCG 
GAGTACGCAA 
GTTCGGACGG 



CAGCGCGGCG 
AACCGAAGTC 
CTTCCAAATC 
AAATCCGAAT 
CATTTGCCGA 



GCGATGTCGC 
TATACCATCA 
CGGCAGGGTC 
AATCCCAATG 
CCGAAATATT 



CGAAGGTGCG 
GCGGCAGCAC 
AGCGTCGTTA 
CCGTCTGAAA 
GAAGAGATAT 



This corresponds to the amino acid sequence <SEQ ID 744; ORF 214.a>: 

a214 .pep 

1 MIQKICKLFV LIAFFSASPA FA LCjSDSRQP IQIEADQGSL DQANQSTTFS 

51 GNWIRQGTL NISAARVNVT RGXQRRRIRE GGRFA3 PLQP DIGRRQRHGA 

101 RTGKQRCLFI CRQHRSLNR* CQSTARRRCR RRCGDYIQHQ NRSLYHQRQH 

151 KIRRKIRFQI RQGQRRYPAF EYAKIRIIPM PSET*TWFGR HLPTEILKRY 

201 L* 



m214/a214 99.3% identity in 152 aa overlap 

10 20 



30 



40 



50 



60 



m214.pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 
I I 1 1 I I I I I I I I I I I I 1 1 I 1 1 I I I M I 1 M I I 1 1 i I 1 1 I I I I I I I I I I I 1 1 1 1 I I I I I II 
a214 MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 
10 20 30 40 50 60 

70 80 90 100 110 120 

m214 . pep NISAARVNVTRGRQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFICRQHRSLNRX 
I II I I II I I I II I I I I I II I I I I I II I I I I I I ! I I I I I I I I M I I I I I I I I I I I I I II I 
a2 1 4 N I S AAR VNVT RGX QRRR I RE GGR FAS PLQ PD I GRRQRHGARTG KQRCL F I CRQHR S LN RX 

70 80 90 100 110 120 

130 140 150 

m214 . pep CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKI 
I I 1 1 I I I I I I M I I I I I I I I I I I I I I 1 1 I 1 1 I 
a2l4 CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKIRRKIRFQIRQGQRRYPAFEYAKIRIIPM 

130 140 150 160 170 180 



a214 



P S ETXTW FGRHLPTE I LKRYLX 
190 200 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 745>: 

9214-1 .seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT 7TAATTGTAA TTTTTGCAAC 

51 TTCTCCCGCT TTTGCCCTTC AAAGCGACAG CAGACGGCCC ATCCAAATCG 

101 AAGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGTAC CACATTTAGC 

151 GGCAATGTCA TCATCAGACA GGGTACGCTC AACATTTCCG CCTCGCGCGT 

201 CAACGTCACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCCGTCCG CTTCAGCCAA ACGTTGGACG GGGGCAAAGG GACGGTGCGC 

301 GGTCAGGCAA ACAACGTTAC CTATTCCTCC GCAGGAAGCA CCGTCGTTCT 

351 GACCGGCAAT GCCAAAGTGC AGCGCGGCGG CGACGTTGCC GAAGGTGCGG 

401 TCATTACCTA CAACACCAAA ACCGAAGTCT ATACCATCAA CGGCAGCACG 

451 AAATCGGGTG CGAAATCCGC TTCCAAAACC GGCAGGGTCA GCGTCGTCAT 

501 CCAGCCTTCA AGCACACAAA AAACCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 746; ORF 214-l.ng>: 

g214-l .pep 

1 MIQKICKLFV LIVIFATSPA FA LOSDSRRP IOIEADQGSL DQANQSTTFS 
51 GNVI IRQGTL NISASRVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVTYSS AGSTWLTGN AKVQRGGDVA EGAVITYNTK TEVYTINGST 
151 KSGAKSASKT GRVSWIQPS STQKTE* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 747>: 

m2l4-l .soq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 
51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 
101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 
151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 
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201 CAATGTTACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCAGTCCG CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 

301 GGACAGGCAA ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 

351 AACCGGTAAT GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 

401 TGATTACATA CAACACCAAA ACCGAAGTCT AT AC CATC AG CGGCAGCACA 

451 AAATCCGGCG CAAAATCCGC TTCCAAATCC GGCAGGGTCA GCGTCGTTAT 

501 CCAGCCTTCG AGTACGCAAA AATCCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 748; ORF 214-1>: 

m214-l.pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DOANQSTTFS 

51 GNWIRQGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 

101 GQANNVAYSS AGSTVVLTGN AKVQRGGDVA EGAVITYNTK TEVYTISGST 

151 KSGAKSASKS GRVSWIQPS STQKSE* 



jn214-l/g214-l 93.8% identity in 176 aa overlap 

10 20 3C 40 50 6C 

m214-l .pep MIQKI CKLFVLI AFFS AS PAFALQSDSRQP I QT EADQGSLDQANQSTTFS GNWIRQGTL 
I I It I I I t i I M :: I :: I t I I I i I i I I 1 : I I i 1 I I t I I t I I I I I i I I I I [ I t I: I I I I i I 
g214-l MIQKI CKLFVLI VI FAT SPAFALQSDSRRP I QIEADQGSLDQANQSTTFSGNVI I RQGTL 

10 20 30 40 50 6C 

70 80 90 100 110 120 

ni214-l .pep NISAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSSAGSTWLTGN 
I I II :H I I I I I IN I I I I I II I I I II I I I I I I I III I II I I | | I |:| | | 1 | | | | t | | | 
g214-l NISASRVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVTYSSAGSTVVLTGN 

70 80 90 100 110 120 

130 140 150 160 170 

in21 4- 1 . pep AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSWIQPSSTQKSEX 
I I I I I I I I I I ! I I I I I I I I I I I I I t I : t I I I I I I I I I I !: I I I I I I I I I I I I I I : I | 
g214- 1 AKVQRGGDV AEG AVI T YNTKTE V YT I NGS TK S G AKS AS KTGRV S VV I QPS STQKT EX 

130 140 150 160 170 

g214-l/p38685 

sp|P38685|YHBN_ECOLI 17.3 KD PROTEIN IN MURA-RPON INTERGENIC REGION PRECURSOR (ORF185) 
>gi 1 551336 (U12684) orfl85 {Escherichia coli] >gi| 606139 (018997) ORF_ol85 [Escherichia coli) 
>gi | 1789592 (AE000399) orf, hypothetical protein [Escherichia coli] Length = 185 
Score =97.1 bits (238), Expect = 6e-20 

Identities = 57/126 (45%), Positives = 74/126 (58%), Gaps = 3/126 (2%) 

Query: 19 PAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTLNISAARVNVTR — GGKGG 76 

PAFA+ D+ QPI IE+DQ SLD TF+GNV++ QGT+ I+A +V VTR G +G 

Sbjct: 24 PAFAVTGDTDQPIHIESDQOSLDMQGNVVTFTGNVIVTQGTIKINADKWVTRPGGEQGK 83 

Query: 77 ES VRAEG S P VRFS QT LDGGKGTVRGQANN V AYS S AGS T WLT GN AKVQRGGD VAEGAV I T 136 

E+ GP FQ DGK VGA+ + Y A WLTGNA +Q+ +G IT 

Sbjct: 84 EV1DGYGKPATFYQMQDNGK-PVEGHASQMHYELAKDFWLTGNAYLQQVDSNIKGDKIT 142 

Query: 137 YNTKTE 142 

Y K + 
Sbjct: 14 3 YLVKEQ 148 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 749>: 

a214-l.soq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCAGTCCG CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 

301 GGACAGGCAA ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 

351 AACCGGTAAT GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 

401. TGATTACATA CAACACCAAA ACCGAAGTCT ATACCATCAG CGGCAGCACA 

451 AAATCCGGCG CAAAATCCGC TTCCAAATCC GGCAGGGTCA GCGTCGTTAT 

501 CCAGCCTTCG AGTACGCAAA AATCCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 750; ORF 214-1 .a>: 

a214-l.pop 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
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51 GNWIRQGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVAYSS AGSTWLTGN AKVQRGGDVA EGAVITYNTK TEVYTISGST 
151 KSG&KSASKS GRVSVVIQPS STQKSE* 

a214-l/m214-l 100.0% identity in 176 aa overlap 

10 20 30 40 50 60 

a214-l.pep M I QKI CKLFVL I A FFS A S P AFALOS D S RQ F I QI E ADQGS L DQANQST TFSGNVVIRQGTL 
I I I i I I I 1 I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 
m214-l MIQKICKLFVLIAFFSASPAFALQSDSRQFIQIEAEX)GSLDQANQSTTF3GNWIRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

a214-l . pep NISAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSSAGSTVVLTGN 
I It II! II It III lllll II II 11 Kill lllli I I I I I ! I I I I I 1 I I I !! I I I I I I I I I 
m214-l NISAARVNVTRGGKGGESVRAEGSPVRFSCTLDGGKGTVRGQANNVAYSSAGSTVVLTGN 

70 80 90 100 110 120 

130 140 150 160 170 

a2 1 4 - 1 . pep AKVQRGGDVAEGAVI TYNTKTEVYTI SGSTKSGAKSASKSGRVSVVI QPSSTQKSEX 

1 1 1 1 1 \ 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 it m i n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

m21 4 - 1 AKVQRGGDVAEGAVI TYNTKTEVYTI SGSTKSGAKSASKSGRVSVVI QPSSTQKSEX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 75 1>: 



g215 .seq 








1 


atgaaagtaa 


gatggcggta 


cggaattgcg 


51 


tgccttgggc 


agcctgtcgg 


catggttggg 


101 


tcgaggaagt 


caggctcaat 


cccgacgaac 


151 


ttggacggaa 


ggcggtttga 


cgaacaggga 


201 


cgcgaaaggt 


gcgaaacagt 


ttcccgaaaa 


251 


cgccgcatct 


cgtgttcttc 


caagaaggca 


301 


agcgatgaag 


ccgtttacca 


taccgaaaac 


351 


caacgttgtg 


ctgaccaaaa 


ccgccgacgg 


401 


tcgaaaccga 


aaaactgcac 


gtcgataccg 


451 


gatacgcctg 


tcagtttcca 


atatggcgcg 


501 


tatgacctac 


aaccacaaaa 


caggcatgtt 


551 


aagccgcgat 


ttatgataca 


aaagatatgt 



This corresponds to the amino acid sequence <SEQ ID 752; ORF 215.ng>: 
g215 .pep 

1 MKVRWRYGIA FPLILAVALG SLS AWLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSAKG AKQFPENSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNW LTKTADGRRQ AGKVETEKLH VDTESQYAQT 

151 DTPVSPQYGA SHGQAGGMTY NHKTGMLNFS SKVKAAIYDT KDM* 

The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 753>: 

m215.seq (partial) 

1 . .AGCCTGTCGG CATGGTTGGG TCGTATCAGC GAAGTCGAGA TTGAAGAAGT 

51 CAGGCTCAAT CCCGACGAAC CGCAATACAC AATGGACAGC TTGGACGGCA 

101 GGCGGTTTGA CGAACAGGGA TACTTGAAAG AACATTTGAG CGCGAAGGGC 

151 GCGAAACAGT TTCCGGAAAG CAGCGACATC CATTTTGATT CGCCGCATCT 

201 CGTGTTCTTC CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC AGCGACGAAG 

251 CCGTTTACCA TACCGAAAAC AAACAGGTTC TTTTTAAAAA CAACGTTGTG 

301 CTGACCAAAA CCGCCGACGG CAAACGGCAG GCGGGTAAAG TTGAAGCCGA 

351 AAAGCTGCAC GTCGATACCG AATCTCAATA TGCCCAAACC GATACGCCTG 

401 CAGTTTCCA ATATGGTGCA TCGCACGGTC AGGCGGGCGG CATGACTTAC 

451 GACCACAwwA CAGGCATGTT GAACTTCTCA TCTAAAGTGA AAGCCACGAT 

501 TTATGATACA AAAGATATGT AA 

This corresponds to the amino acid sequence <SEQ ID 754; ORF 215>: 

m215.pep (partial) 

1 . . SLSAWLGRIS EVEIEEVRLN PDEPQYTMDS LDGRRFDEQG YLKEHLSAKG 

51 AKQFPESSDI HFDSPHLVFF QEGRLLYEVG SDEAVYHTEN KQVLFKNNW 

101 LTKTADGKRQ AGKVEAEKLH VDTESQYAQT DTPVSFQYGA SHGQAGGMTY 
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151 DHXTGMLNFS SKVKATIYDT KDM* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 215 shows 96.0% identity over a 173 aa overlap with a predicted ORF (ORF 215.ng) 
from N. gonorrhoeae: 

Ttl215/g215 

10 20 30 40 

SLSAWLGRISEVEIEEVRLNPDEPQYTMDSLDGRRFDEQG 

I Mill MM II Mil II I II II 1IMM -II III II 1 1 1 

MKVRWRYGIAFPLILAVAU3SLSAWIX5RISEVEIEEVRLNPDEPQYTMIX3LDGRRFDE 

10 20 30 40 50 60 

50 60 70 80 90 100 

YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRIiLYEVGSEEAVYHTENKQVLFKNNW 

IMIIM MIIIIMIMMMIIM I II i llllll III II 1 1 MINI Ml IIIMIM 

YLKEHLSAKGAKQFPENSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 
70 80 90 100 110 120 

110 120 130 140 150 160 

LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 

! M M 1 1 M M I M M II 1 1 M M M M 1 1 1 1 1 1 1 1 1 M M 1 1 M i M M M MM I 

LTKTADGRRQAGKVETEKLHVDTE SQYAQTDTPVS FQYGASHGQAGGMT YNHKTGMLNFS 
130 140 150 160 170 180 

170 

S KVKAT I YDTKDMX 

1 1 M M I II M I 

SKVKAAIYDTKDM 
190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 755>: 

a215.seq 

1 ATGAAAGTAA GATGGCGGTA CGGAATTGCG TTCCCATTGA TATTGGCGGT 

51 TGCCTTGGGC AGCCTGTCGG CATGGTTGGG ACGCATCAGC GAAGTCGAGA 

101 TTGAAGAAGT CAGGCTCAAT CCCGACGAAC CGCAATACAC AATGGACGGA 

151 TTGGATGGCA GGCGGTTTGA CGAACAGGGA TACTTGAAAG AACATTTGAG 

201 TTCGAAGGGC GCGAAACAGT TTCCCGAAAG CAGCGACATT CATTTCGACT 

251 CACCGCATCT CGTGTTCTTC CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC 

301 AGCGATGAAG CCGTTTACCA TACCGAAAAC AAACAGGTTC TTTTTAAAAA 

351 CAACGTTGTG CTGACCAAAA CCGCCGACGG CAAACGGCAG GCGGGTAAAG 

401 TTGAAGCCGA AAAGCTGCAC GTCGATACCG AATCTCAATA TGCCCAAACC 

451 GATACGCCTG TCAGTTTCCA ATATGGTGCA TCGCACGGTC AGGCGGGCGG 

501 CATGACTTAC GACCACAAAA CAGGCATGTT GAACTTCTCA TCTAAAGTGA 

551 AAGCCACGAT TTATGATACA AAAGATATGT AA 

This corresponds to the amino acid sequence <SEQ ID 756; ORF 21 5.a>: . 

a215.pep 

1 MKVRWRYGIA FPLILAVALG SLS AWLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSSKG AKQFPESSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNW LTKTADGKRQ AGKVEAEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY DHKTGMLNFS SKVKATIYDT KDM"* 

m215/a215 98.3% identity in 173 aa overlap 

10 20 30 40 

m215.pep SLSAWLGRISEVEIEEVRLNPDEPQYTMDSLDGRRFDEQG 

I I II I 1 I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I 
a215 MKVRWRYGIAFPLILAVALGSLSAWLGRISEVEIEEVRLNPDEPQYTMDGLDGRRFDEQG 
10 2C 30 40 50 60 

50 6C 70 80 90 100 

m2 1 5 . pep YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 



m2l5.pep 
g215 

m215.pep 
9 215 

m215 .pep 
9215 

m215 .pep 
g215 
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I I I I I I I : I 1 1 I I I I I I I I I I I I I I 1 1 I 1 1 I 1 1 I I I I I 1 1 I 1 1 I 1 1 i I I I 1 1 I 1 1 M I I I 
a215 YLKE HL S S KG AKQ FPESSDIHFDS PHLV F FQE G RL L YE V G S DEAV YHTEN KQ VL FKNN V V 

70 80 90 100 110 120 

110 120 130 140 150 160 

m215 . pep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVS FQYGASHGQAGGMTYDHXTGMLNFS 

I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || II I I 
a2 1 5 LTKTADGKRQAGKVEAEKLHVDTESQYAQTDT PVS FQYGASHGQAGGMTYDHKTGMLNFS 

130 140 150 160 170 180 

170 

m215.pep SKVKAT I YDTKDMX 

I I I I I I I I I I I I I I 
a215 SKVKATI YDTKDMX 

190 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 757>: 

g216.seq (partial) 

1 . .atgatatcga tttcgagctc ggtacccagc gacgaaatca ccgccatcat 

51 ccccgcactc aaacgcaaag acattaccct cgtctgcatc accgcccgcc 

101 ccgattcaac catggcgcgc catgccgata tccacatcac cgcatcggtt 

151 tcgcaagaag cctgcccgtt ggggcttgcc ccgaccacca gcaccaccgc 

201 cgttatggct ttgggcgacg cgttggcggt cgtcctgctg cgcgcccgcg 

251 cgttcacgcc cgacgacttc gccttgatcc accctgccgg cagcctcggc 

301 aaacgcctgc ttttgcgcgt tgccgacatt atgcacaaag gcggcggcct 

351 gcccgccgtc cgactcggca cgcccttgaa aggagccatc gtcagcatga 

401 gcgagaaagg tttgggcatg tgggcgggaa cggacgggca aaggctgtct 

451 gaaaggcctt tttacnga 

This corresponds to the amino acid sequence <SEQ ED 758; ORF 216.ng>: 

g216.pep (partial) 

1 ..MISISSSVPS DEITAIIPAL KRKDITLVCI TARPD3TMAR HADIHITASV 

51 SQEACPLGLA PT TSTTAVMA LGDALAWLL RARAFTPDDF ALIHPAGSLG 

101 KRLLLRVADI MHKGGGLPAV RLGTPL.KGAI VSMSEKGLGM WAGTDGQRLS 

151 ERPFY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 759>: 

m216 . seq 

1 ATGGCAATGG CAGAAAACGG AAAATATCTC GACTGGGCAC GCGAAGTGTT 

51 GCACGCCGAA GCGGAAGGCT TGCGCGAAAT TGCAGCGGAA TTGsACAAAA 

101 ACTTCGTCCT TGCGGCAGAC GCGTTGTTGC ACTGCAAGGG CAGGGTCGTT 

151 ATCACGGGCA TGGTCAAGTC GGGACATATC GGGCGCAAAA TGGCGGCAAC 

201 TATGGCCTCG ACCGGCACGC CTGCGTTTTT CGTCCACCCT GCGGAAGCGG 

251 CACACGg CGA TTTGGGTATG ATTGTGGACA rCGACGTGGT CGTCGCGATT 

301 TCCAATTCCG GCGAAAGCGA CGAAATCGCC GCCATCATCC CCGCACTCAA 

351 ACGCAAAGAC ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

4 01 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 

451 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TCATGGCTTT 

501 GGGCGATGCG TTGGCGGTCG TCCtGCTGCG CgcACGCGCG TTCACGCCCG 

551 ACGATTTCGC CTTGAGCCAT CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 

601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

651 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

701 TGGGCATGTT GGCGGTAACG GACGGG CAAG GCCGTCTGAA AGGCGTATTC 

751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 

801 TCTTT CGATA GACGAAGTCA TGCATACGCA TCCTAAAACC ATCTCCGCCG 

851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 

901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG OGCTGAATAT 

951 GCACGACCTG CTGGCGGCAC GGATTGTATA G 

This corresponds to the amino acid sequence <SEQ ID 760; ORF 216>: 

m216 -pep 

1 MAMAENGKYL. DWAREVLHAE AEGLREIAAE LXKNFVLAAD ALLHCKGRW 
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51 ITGMVKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDXDVWAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 

151 CPLGLAPTT S TTAVMALGDA LAWLL RARA FTPDDFALSH PAGSLGKRLL 

201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

2 51 TDGDLRRLFQ ECDNFTGLSI DEVMHTHPKT ISAERLATEA LKVMQANHVN 

3 01 GLLVTDADGV LIGALNMHDL LAARIV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 216 shows 91.8% identity over a 147 aa overlap with a predicted ORF (ORF 216.ng) 
from N. gonorrhoeae: 
m2l6/g216 

70 80 90 100 110 120 

TMASTGT PAF FVH P AEAAHGDLGM I VDXDVWA IS NSGESDE I AAI I PALKRKDITLVCI 

— 1 1 I lllhllllllllllllllll 

MISISSSVPSDEITAIIPALKRKDITLVCI 
10 20 30 

130 140 150 160 170 180 

TARPDSTMARHAD I H I TASVS KEAC PLGLAPTTSTTAVMALGDALAWLLRARAFTPDDF 

MIlllMIIIIIIMIIIIhlllllllllllllllllllllllllll IIMIMIM! 

TARPDSTMARHADI H I TAS VSQEACPLGLAPTTSTTAVMALGDALAWLLRARAFTPDDF 
40 50 60 70 80 90 

190 200 210 220 230 240 

ALSHPAGS LGKRLLLRVAD IMHKGGGLPAVRLGTPIiKEAI VSMS EKGLGMLAVTDGQGRL 

II IIIIIIIIIIIIMIIIMIIIIIIIIIIIIIII IMII1IIIIII I 1 1 1 1 

ALIHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKGAIVSMSEKGLGMWAGTDGQRIjS 
100 110 120 130 140 150 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 761 >: 

a216.seq 

1 ATGGCGATGG caggaaacga aaaatatctt gattgggcac gcgaagtgtt 
51 gcacaccgaa gcggaaggct tgcgcgaaat tgcggcggat ttggacgaaa 

101 ACTTCGCCCT TGCGGCGGAC GCGTTGTTGC ACTGCAAAGG CAGGGTCGTT 

151 ATCACGGGCA TGGGCAAGTC GGGACATATC GGGCGCAAAA TGGCGGCAAC 

201 CATGGCCTCG ACCGGCACGC CCGCGTTTTT CGTCCACCCT GCGGAAGCGG 

251 CACACGGCGA TTTGGGCATG ATTGTGGACA ACGACGTGGT CGTCGCGATT 

301 TCCAATTCCG GTGAAAGCGA CGAAATCGCC GCCATCATCC CCGCGCTCAA 

351 ACGCAAAGAT ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

401 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 

451 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TTATGGCTTT 

501 GGGCGATGCG TTGGCGGTTG TCCTGCTGCG CGCCCGCGCG TTCACGCCCG 

551 ACGACTTCGC CTTGAGCCAC CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 

601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

651 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

701 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 

751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 

801 TCTTTCGATA GACGAAGTCA T3CATACGCA TCCTAAAACC ATCTCCGCCG 

851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 

901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTGAATAT 

951 GCACGACCTT TTGGCGGCGC GGATTGTATA G 

This corresponds to the amino acid sequence <SEQ ID 762; ORF 216.a>: 

a216 .pep 

1 MAMAGNEKYL dwarevlhte aeglreiaad LDENFALAAD ALLHCKGRW 

51 ITGMGKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDNDVWAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 

151 CPLGLAPTT S TTAVMALGDA LAWLL RARA FTPDDFALSH PAGSLGKRLL 

201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

251 TDGDLRRLFQ ECDNFTGLSI DEVMHTHPKT ISAERLATEA LKVMQANHVN 



m216 .pep 
g216 

m216 .pep 
9216 

m216 -pep 
g216 
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301 GLLVTDADGV LIGALNMHDL LAARIV* 

m216/a216 97.2% identity in 326 aa overlap 

10 20 30 40 50 60 

m21 6 . pep MAMAENGKYLDWAREVLHAEAEGLREIAAELXKNFVLAADALLHCKGRWITGMVKSGHI 
MM I MIMIIIIMMMIIMIIIM :||:M II I II II I II I III II Mill 
a216 MAMAGNEKYLDWAREVLHTEAEGLREIAADLDENFALAADALLHCKGRVVITGMGKSGHI 
10 20 30 40 50 60 

70 80 90 100 110 120 

m216 . pep GRKMAATMASTGTPAFFVHPAEAAHGDLGMIVDXDWVAI SNSGES DE I AAI I PALKRKD 

I I M I I I M I II I I I! II II M I I I I I I I I I M M M M I M I I I I I I II ; M I M I I I 
a2l6 GRKMAATMASTGTPAFFVHPAEAARGDLGMIVDNDVWAISNSGESDEIAAII PALKRKD 

70 80 90 100 110 120 

130 140 150 160 170 180 

m21 6 . pep ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAVVLLRAK^ 
I I I I M I I I M I I I I II I I I I II 1 1 M M I I 1 1 I M I I I I I I I I I I I II I ; 1 1 M i I I I 
a216 ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m216.pep FTPDDFALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVT 
I I M I I I I I I I I I I I I II I II I II II II I II I I I I I I I I M II I I I I I I II M I I I I I I I 
a216 FTPDDFALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVT 

190 200 210 220 230 240 

250 260 270 280 290 300 

m21 6 . pep DGQGRLKGVFTIX3DLRRLFQECDNFTGLSIDEVMHTHPKTISAERLATEALKVM0ANHVN 
I I M I 1 I I I I M M I I II I II I I I M II I M I II I II I II II I I I I I I I I I I M I I I I I I 
a216 DGQGRLKGVFTDGDLRRLFQECDNFTGLSIDEVMHTHPKTISAERLATEALKVMQANHVN 

250 260 270 280 290 300 

310 320 
m21 6 . pep GLLVTDADGVLIGALNMHDLLAARIVX 
I I I I I I I I II II I I I I I I I I II II I I I 
a21 6 GLLVTDADGVLIGALNMHDLLAARIVX 

310 320 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 763>: 



g217.seq 




1 


atggcggatg 


51 


tgctctcttc 


101 


ttttggtcat 


151 


acgcgccacc 


201 


tatccgacag 


251 


tgattcacgt 


301 


gccttcgggc 


351 


aggtaaacag 


401 


tggacgttca 


451 


ggtcagtccg 


501 


ctttgccgac 


551 


gcataaaaaa 


601 


cggcaatgcc 


651 


aaggcgtata 


This corresponds to the 


g2l7.pep. 




l 


MADDGLLRQL 


51 


TRHPFADRRR 


101 


APGOHPAOYH 


151 


GOSETAOHGR 



ttcacgtccg aaacggaata 
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201 RQCLRAGLRL SEHQFHKRRI GFDIRG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 765>: 



m217. seq 








1 


ATGGCGGATG 


ACGGTGTGCG 


GCGGCAACTG 


51 


CGGTTTCCGC 


CTrCCATTTG 


ACCCATTCGT 


101 


TTTTGGTCAT 


CGGCTTCAGC 


TTGGAACAAT 


151 


ACGCGCCACC 


CATTCGCCGA 


CCGTTGCGGG 


201 


TATCCGTCAG 


GGTTTCGAGG 


AAGGCGGCAA 


251 


TGATTCACGT 


CGGCATACCA 


CGCGCTGACA 


301 


GCCTTTGGGC 


AGCATCCAGC 


CCAATATCAT 


351 


AGGTGAACAG 


CTGATCCGTG 


CCGCCGCGCA 


401 


TGGACGTTCA 


AATCGGCAAC 


CATGTCGTGC 


451 


AGTCAGTCCG 


AAACGGCGCA 


ACACGGGCGC 


501 


CTTTATCGAC 


TTCAAATCGG 


CTTTCCAACA 


551 


GCATGAAACA 


GCGGTTGGCG 


GCGGCTGATT 


601 


CGGCAATGCC 


TGCGCACCgG 


GctGCGCCTG 


651 


AAGGCGTATA 


GGATTCGATA 


TTCGGGGTTA 



TTGGACTGAC 



GCCTGCTTCC 
CAAACCCCAG 



This corresponds to the amino acid sequence <SEQ ID 766; ORF 217>: 

m217.pep 

1 MADDGVRRQL SGKLRQFGFR LPFDPFVFKV LDXLLVI GFS LEQCFKQIPA 

51 TRHPFADRCG LPPYPYNIRQ GFEEGGKTSB HGGLIHVGIP RADILPHRIA 

101 AFGQHPAOYH AFYRLLPGEQ LIRAAAHFSV QTPVDVQIGN HWQKRXIVL 

151 SQSETAQHGR GFXKHKHFID FKSAFQQVEQ AXQSMKQRLA AADFHVXHGI 

201 RQCLRTGLRL SEHGFDKRRI GFDIRG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 217 shows 80.5% identity over a 226 aa overlap with a predicted ORF (ORF 217.ng) 
from N. gonorrhoeae: 

m217/g2l7 



10 20 30 40 50 60 

m2 17 . pep MADDGVRRQLSGKLRQFGFRLPFDPFVFKVLDXLLVIGFSLEOCFKQI PATRHPFADRCG 

11111= UN I I : lllllllhlll Mill :|:|||||IMIIIIIIII 
g217 MADDGLLRQLSEKPSQSALFLPFDPFVFEVLDCLLVIGPGLKQCFKQIPATRHPFADRRR 
10 20 30 40 50 60 



70 80 90 100 110 120 

m217.pep LP ? YP YNI RQGFEEGGKTS EHGGLI HVGI PRAD I LPHRI AAFGQHPAQYHAFYRLLPGEQ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I i I I I I I h I I II I « I 
g217 LP P YPGNI RQGFEEGGKTS EHGGLiI HVG I PRAD I LPHRVAAFGQHPAQYHTVCRLLPGKQ 

70 80 90 100 110 120 



130 140 150 160 170 180 

m2 1 7 . pep LI RAAAHFSVQTPVDVQIGNHWQKRXI VLSQSETAQHGRGFXKHKHFIDFKSAFQQVEQ 

lllimtllhhllllllhllll II -11111111111 |::|| llhlllll : 
g217 LI RAAAHFSVQSPMDVQI GNHI VQKRQI VPGQSETAQHGRGFQKREHFADFKTAFQQVGH 

130 140 150 160 170 180 



190 200 210 220 

m217 . pep AXQSMKQRLAAADFHVXHGIRQCLRTGLRLSEHGFDKRRIGFDIRGX 

I I :hll llllll Hlllllhlllllllll llllllllll 
g 2 1 7 ALQRI KKRLPAADFHVRNGI RQCLRAGLRLSEHGFHKRR I GFDIRG 

190 200 210 220 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 767>: 

a217.seq 

1 GTGGCGGATG ACGGTGTGCA GCGGCAACTG TCCGGAAAAT TGCGCCAATT 

51 CGGTTTCCGC CTGCCATTTG ACCCATTCGT TTTCGAGGCT TTGGACTGCC 

101 TTTTGGTCAT CGCCTTCGAC TTGGAACAAT GTTTCAAGCA AATCCCGGCA 

151 ACGCGCCACC CATTCGTCAA CCGTCGCAGG TTGCCGCCAT ATCCGTACAA 
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201 TATCCGTCAG GGTTTCGAGG AAGGCGGCAA AACGTCCGAA CAGGGCGGTT 

251 TGGTTCACGT CGGCATACCA CGCGCTGACC CCCTGCCACA TCGGATTGCC 

301 GCCTTCGGGC AGCATCCAGC CCAATATCAT GCGTTCTACC GCCTGCTTCC 

351 AGGTGAACAG CTGATCCGTG CCGCCGCGCA TTTCTCCGTC CAAACCCCAG 

4 01 CGGACGTTCA AATCGGCAAC CATGTCGTGC AAAAGCGGCA AATCGTCCTC 

4 51 AGTCAGTCCG AAATGGCGCA ACACGGGCGC GGTTTCTAAA AGCACAAGCA 

501 CTTTATCGAC TTCAAATCGG CTTTCCAACA AGTCGAACAG GCATGACAAA 

551 GCATGAAACA GCGGTTGTCG GCGGCTGATT TTCACATCCG AAACGGAATA 

601 CGGCAATGCC TGCGCGCCGG GCTGCGCCTG TCCGAACACG GCTTCGATAA 

651 AAGGCGTATA GGATTCGATA TTCGGGGTTA A 

This corresponds to the amino acid sequence <SEQ ID 768; ORF 217.a>: 

a217 .pep 

1 VADDGVQRQL SGKLRQFGFR LPF DPFVFEA LDCLLVIAFD L EQCFKQIPA 

51 TRHPFVNRRR LPPYPYNIRQ GFEEGGKTSE QGGLVHVGIP RADPLPHRIA 

101 AFGQHPAQYH AFYRLLPGEQ LIRAAAHFSV QTPADVQIGN HWQKRQIVL 

151 SQSEMAQHGR GF*KHKHFID FKSAFQQVEQ A*QSMKQRLS AADFHIRNGI 
201 RQCLRAGLRL SEHGFDKRRI GFDIRG* 

m217/a217 90.3% identity in 226 aa overlap 

10 20 30 40 50 60 

m2i7 . pep maddgvrrqlsgklrqfgfrlpfdpfvfkvldxllvigfsleqcfkqi PATRHPFADRCG 

: I I I I I : I I I I I I I I I I I I I I I I I I I I I : : I I I I I I : I : I I I I I 1 1 I I I I I I I I : : I 
a2 17 VADDGVQRQLSGKLROFGFRLPFDPFVFEALDCLLVI AFDLEQCFKQI PATRHPFVNRRR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m217 . pop LPPYPYNIRQGFEEGGKTSEHGGLIHVGIPRADILPHRIAAFGQHPAQYHAFYRLLPGEQ 
I I I I 1 I t i I I I I I i I I I I ] I : I I I : I I I I I I I I I I 1 I I I I 1 I I I M I 1 I 1 M I I i E I I 1 
a217 LPPYPYNIRQGFEEGGKTSEQGGLVHVGIPRADPLPHRIAAFGQHPAQYHAFYRLLPGEQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

m217 . pep LIRAAAHFSVQTPVDVQIGNHWCKRXIVLSQSETAQHGRGFXKHKHFIDFKSAFQQVEQ 

I I I I ! I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
a217 LI RAAAHFSVQTPADVQIGNHVVQKRQIVLSQSEMAQHGRGFXKHKHF ID FKSAFQQVEQ 

130 140 150 160 170 180 

190 200 210 220 

m217 . pep AXQSMKQRLAAADFHVXHGIRQCLRTGLRLSEHGFDKRRIGFDIRGX 

I I I I I I I I I : I I I I I : : I I I I I I I : ! I I I I I I I I I I I I 1 I I I I I I I 
a2 1 7 AXQSMKQRLSAADFHIRNGIRQCLRAGLRLSEHGFDKRRIGFDI RGX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 769>: 

g218 . seq 

1 atggttgcgg tggatcctta tacggcaaaa gtggtcaaca ccatgccgcg 

51 caatcagggt tggtatcaca ctatggatga aatccacggc gatatgatgc 

101 tcggtgcggc aggcgattat cttttggaaa cggcagcttc actgaccatt 

151 attatggttg tcagcggctt gtacctttgg tgggcgaaac agcgcggcat 

201 taaagcgatg ctgctgccgc caaaaagcag ggcgcgttct tggtggcgga 

251 atctgcacgg cgcgtttgga acttgggtgt cgttgatttt actgtcgttc 

301 tgcctgtcgg gtattgcttg ggcaggtatt tggggcggca aattcgtgca 

3 51 ggcttggaat cagttcccgg ccggcaaatg gggtgtcgaa ccgaaccccg 
401 tttcaatcgt gccgacccac ggcgaggtat tgaatgacgg caaggttaag 

4 51 gaagtgccgt ggattttgga gcttatgcct atgcctgtct cagggacgac 
501 tgtgggtgaa aacggcatta accccaccga gcccaaCaac attggaaacc 
551 gtcgaccgtt tcgcgcggga aatcggtttc aaagggcgtt atcagttgaa 
601 tttgcccaaa ggcgaggacg gggtatggac tttgtcgcag gattctatga 
6 51 gttatga 

This corresponds to the amino acid sequence <SEQ ID 770; ORF 218.ng>: 

g218 .pep 

1 MVAVDPYTAK WNTMPRNQG WYHTMDEIHG DMMLGAAGDY LLETAASLTI 
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51 IMWSGL YLW WAKQRGIKAM LLPPKSRARS WWRNLHGAFG TWV SLILLLF 

101 CLSGIAWAGI WGGKFVQAWN QFPAGKWGVE PNPVSIVPTH GEVLNDGKVK 

151 EVPWILELMP MPVSGTTVGE NGINPTEPNN IGNRRPFRAG NRFQRALSVE 

201 FAQRRGRGMD FVAGFYEL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 771 >: 

m218 .seq 

1 ATGGTCGCGG 

51 CAATCAGGGT 

101 TCGGTGCGGC 

151 ATTATGGTTG 

2 01 CAAGGCGATG 

251 ATCTGCACGG 

301 TGCCTGTCGG 

351 GGCTTGGAGT 

401 TTTCAGTCGT 

4 51 GAAGTGCCGT 

501 yGCgGGCAAA 

551 TCGACCGCTT 

601 TTTGCCCAAA 

651 GTTA 

This corresponds to the amino acid sequence <SEQ ED 772; ORF 2 1 8>: 

m218 .pep 

1 MVAVDPYTAK WSTMPRNQG WYYTMDEIHS DMMLGAAGDY LLETAASLTI 

51 IMWSGL YLW WVKRRGI KAM LLPSKGXARS WWRNLHGTFG TWV SLILLLF 

101 CLSGIAWAGI WGGKFVQAWS QFPAGKWGVE PNPVSWPTH GEVLNDGKVK 

151 EVPWVLELTP MPVSGTTVGK DGINPDEPMT LETVDRFARX NRFQRALSVE 

201 FAQRRGRRMD FVAGFYEL 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 218 shows 87.2% identity over a 218 aa overlap with a predicted ORF (ORF 218.ng) 
from N. gonorrhoeae: 

m218/g218 

10 20 30 40 50 60 

^AVDPYTAKWSTMPRNQGWYYTMDEIHSDMMLGAAGDYLI^TAASLTI IMWSGLYLW 

ii ii ii i ii 1 1 hill i ii ii hi ii iihii iii nun ii nun mi ii i ii ii i 

MVAVnPYTAKVWTMPRNQGWYHTNflDEIHGDMMLGAAGDYLlJSTAASL 

10 20 30 40 50 60 

80 90 100 110 120 

WVKRRGI KAMLLPS KGXARSWWRNLHGTFGTWVS LI LLLFCLSG I AWAGI WGGKFVQAWS 

hhlllllllM h 1 1 1 1 1 1 1 1 1 h I I I I I II 1 1 1 I I ! 1 1 1 1 1 1 1 1 1 1 1 II I II I h 

WAKQRGI KAMLLPPKSRARSWWRNLHGAFGTWVSLI LLLFCLSG I AWAGI WGGKFVQAWN 
70 80 90 100 110 120 

130 140 150 160 170 180 

QFPAGKWGVEPNPVSWPTHGEVLNDGKVKEVPWVLELTPMPVSGTTVGKDGINPDEPMT 

iiiiiiiiiiiiiihiiiiiiiiiiimiiihiii iimiiiihHiii ii : 

QFPAGKWGVEPNPVSIVPTHGEVLNDGKVKEVPWILELMPMPVSGTTVGENGINPTEPNN 
130 140 150 160 170 180 

190 200 210 

LE TVDRFARXNR FQRALSVE FAQRRGRRMD FVAG F YE 1 
' : I lllllllllllllllll MIIMIIII 
I GNRRPFRAGNRFQRALS VE FAQRRGRGMD FVAG F YE 1» 
190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 773>: 



TCGATCCTTA TACGGCAAAA 
TGGTATTACA CGATGGATGA 
AGGCGATTAT CTTTTGGAAA 
TCAGCGGCTT GTACCTTTGG 
CTGCTGCCGT CAAAAGGCAr 
CACGTTTGGA ACTTGGGTGT 
GTATTGCTTG GGCGGGTATT 
CAGTTCCCTG CCGGTAAATG 
GCCGACCCAC GGCGAGGTAT 
GGGTTTTGGA GCTTACGCCT 
GACGGCATTA ACCCTGACGA 
TGCGCGGnGA AATCX5GTTTC 
GGCGAGGACG GCGTATGGAC 



GTGGTCA3TA CCATGCCGCG 
AATCCACAGC GATATGATGC 
CGGCAGCTTC ACTGACCATT 
TGGGTGAAAC GGCGCGGCAT 
GGCGCGTTCT TGGTGGCGGA 
CGTTGATTTT GCTGTTGTTC 
TGGGGCGGCA AGTTCX5TACA 
GGGTGTCGAA CCGAACCCCG 
TGAATGACGG CAAGGTTAAG 
ATGCCTGTTT CAGGGACGaC 
GCCGATGACA TTGGAAACCG 
AAAGGGCGTT ATCAGTTGAA 
TTTGTCGCAG GATTCTATGA 



ra218 .pep 
g218 

70 

m218 .pep 
g2ie 

m218 .pep 
9218 

m218 .pep 
g218 
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ATGGTCGCGG TCGATCCTTA TACGGCAAAA GTGGTCAGTA CCATGCCGCG 
CAATCAGGGT TGGTATTACG CGATGGATGA AATCCACAGC GATATGATGC 
TCGGTTCGAC AGGTGATTAT CTTTTGGAAA CGGCTGCATC GCTGACGATT 
ATCATGATAA TCAGCGGTTT GTACCTTTGG TGGGTGAAAC GGCGCGGCAT 
CAAGGCGATG CTGCTGCCGC CAAAAGGCAG GGCGCGTTCT TGGTGGCGGA 
ATCTGCACGG CGCGTTTGGA ACTTGGGTGT CGTTGATTTT ACTGTTGTTC 
TGCCTGTCGG GTATTGCTTG GGCAGGTATT TGGGGCGGCA AGTTCGTGCA 
GGCTTGGAGT CAGTTCCCGG CAGGCAAATG GGGTGTCGAA CCGAACCCTG 
TTTCAGTCGT GCCGACCCAC GGCGAGGTAT TGAATGACGG CAAGGTTAAG 
GAAGTGCCGT GGGTTTTGGA GCTTACGCCT ATGCCTGTTT CAGGGACGAC 
TGTGGGCAAA GACGGTATTA ACCCTGACGA GCCGATGACA TTGGAAACCG 
TCGACCGTTT TGCGCGG.GA AATCGGTTTC AAAGGGCGTT ATCAGCTGAA 
TTTGCCCAAA GGCGAGGACG G CG TAT GG AC TTTGTCGCAG GATTCTATGA 
GTTA 

This corresponds to the amino acid sequence <SEQ ID 774; ORF 218.a>: 

&2X8 .pep 

1 MVAVDPYTAK WSTMPRNQG WYYAMDEIHS DMMLGSTGDY LLETAASLTI 

51 IMIISGL YLW WVKRRGIKAM LLPPKGRARS WWRNLHGAFG TW VSLILLLF 

101 CLSGIAWAGI WGGKFVQAWS OFPAGKWGVE PNPVSWPTH GEVLNDGKVK 

151 EVPWVLELTP MPVSGTTVGK DGINPDEPMT LETVDRFARX KRFQRALSAE 

201 FAQRRGRRMD FVAGFYEL 



a218 .seq 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



m21 8/a2 18 95.9% identity in 2 1 8 aa overlap 

10 20 30 40 50 60 

m21 8 .pep MV AV D P YT AKVVS TMPRNQG W Y YTMDE I H S DMMLG AAGD YLLET AAS LT 1 1 MWS GL YLW 

I I I I I I I I It I I I t I I I I I I i I I : I I I I 1 1 I I I I I :: I t I I I I I I I I I I I 1 I :: I I I 1 I | 
a218 MVAVDPYTAKWSTMPRNQGWYYAMDEIHSDMMLGSTGDYLLETAASLTI IMIISGLYLW 

10 20 30 40 50 60 



70 80 90 100 110 120 

m21 8 . pep WV KRRG I KAML LPS KGX ARSWWRNLHGT FGTWVSL ILLLFCLSGI AW AG I WGG KFVQAWS 

1 1 I I I I I I t I I I I II I I I I I I t I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I ! | U | I 
a21 8 WV KRRG I KAML L P PKGRA PS WW RNL HG AFGTWV SLILLLFCLSGI AW AG I WGG KFVQAWS 

70 80 90 1U0 110 120 



130 140 150 160 170 180 

m21 8 . pep QFPAGKWGVEPNPVSWPTHGEVLNDGKVKEVPWVLELTPMPVSGTTVGKDGINPDEPMT 
I I I I I I I I t I I I I I I I I I I I I I I I I [ I I I I I I I I I I I I [ I I I I I I I | | | | | | | | | | | | | | 
a218 QFPAGKWGVEPNPVSWPTHGEVLNDGKVKEVPWVLELTPMPVSGTTVGKDGINPDEPMT 

130 140 150 160 170 180 



190 200 210 

m2 1 8 . pep LETVDRFARXNRFQRALSVEFAQRRGRRMDFVAGFYEL 
I M I M It I II II II it 1:1 III 1 1 If ill II I { I I | I 
a218 LETV DRFARXN RFQRAL S AE FAQRRGRRMD FVAGFYEL 

190 200 210 
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The following partial DNA sequence was identified in AT. gonorrhoeae <SEQ ID 775>: 

g219.eeq 

1 atgacggcaa ggttaaggaa gtgccgtgga ttttggagct tatgcctatg 

51 cctgtctcag ggacgactgt gggtgaaaac ggcattaacc ccaccgagcc 

101 caataacatt ggaaaccgtc gaccgtttcg cgcgggaaat cggtttcaaa 

151 gggcgttatc agttgaattt gcccaaaggc gaggacgggg tatggacttt 

201 gtcgcaggat tctatgagtt atgacatgat cagcccgttt gccgaccgca 

251 cggtacatat cgaccagtac agcggcgaga ttcttgccga catccgtttt 

301 gacgattaca acccgttcgg caaatttatg gcggcaagca ttgcgctgca 

351 tatg gggact ttgggctggt ggagcgtgtt ggcgaacgtc gtgttctgcc 

401 ttgccgtgat ttttatcggc atcagcggct gcgtgatgtg gtggaaacgc 

4 51 cgtccgtccg gcgtggcggg cactgttcct ccggcgcaaa aaatcaaact 

501 gcccgtctgg tgggcgatgg cattgccgct gctgttgatt gcactgcttt 

551 tcccgaccgc gctgcttgcc attgccgtga tttggctgtt ggataccttg 

601 ctgctgtcgc ggattcczgt gttgaggaaa tggtttaaat ga 

This corresponds to the amino acid sequence <SEQ ID 776; ORF 219.ng>: 

g219 .pep 

1 MTARLRKCRG FWSLCLCLSQ GRLWVKTALT PPSPITLETV DRFAREIGFK 

51 GRYQLNLPKG EDGVWTLSQD SMSYDMISPF ADRTVHIDQY SGEILADIRF 

101 DDYNPFGKFM AASIALHMGT LGWWSVLANV VFCLAVIFIG ISGCVM WWKR 

151 RPSGVAGIVP PAQKIKLPVW WAMALP LLLI ALLFPTALIA IAVI WLLDTL 

201 LLSRI PVLRK WFK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 777>: 

ra219 .seq 

1 ATGACGGCAA GGTTAAGGAA GTGCCGTGGG TTTTGGAGCT TACGCCTATG 

51 CCTGTTTCAG GGACGaCyGt gGGCAAAGAC GGCATTAACC CTGACGAGCC 

101 GATGACATTG GAAACCGTCG ACCGCTTTGC GCGGnGAAAT CGGTTTCAAA 

151 GGGCGTTATC AGTTGAATTT GCCCAAAGGC GAGGACGGCG TATGGACTTT 

201 GTCGCAGGAT TCTATGAGTT ACGACATGAT CAGCCCGTTT GCCGACCGCA 

251 CGGTACATAT CGACCAGTAC AGCGGCAAAA TCCTTGCCGA CATCCGTTTT 

301 GACGATTACA ACCCGTTCGG CAAATTTATG GCGGCAAGCA TTGCGCTGCA 

351 TATGGGGACT CTGGGCTGGT GGAGCGTGTT GGCGAACGTC TTGTTCTGCC 

401 TTGCCGTCAT TTTTATCGGT ATCAGCGGCT GCGTGATGTG GTGGAAACGC 

451 CGTCCGACCG GAGCGGTGGG CATCGTTCCG CCGG CGCAGA AAGTCAAGCT 

501 GCCGGTTTGG TGGATGATGG CATTGCCGCT ATTGGCAATC GCACTGCTCT 

551 TCCCGACCTC ACTGCTTGCC ATTGCCGTGA TTTGGCTGTT GGATACGCTG 

601 OTGTTGTCGC GGATTCCTGT TTTGAGGAGA TGGTTTAAAT GA 

This corresponds to the amino acid sequence <SEQ ID 778; ORF 219>: 
m219 .pep 

1 MTARLRKCRG FWSLRLCLFQ GRXWAKTALT LTSRXHWKPS TALRGEIGFK 

51 GRYQLNLPKG EDGVWTLSQD SMSYDMISPF ADRTVHIDQY SGKILADIRF 

101 DDYNPFGKFM AASIALHMGT LGWWSVLA NV LFCLAVIFIG ISGCVM WWKR 

151 RPTGAVGIVP PAQKVKLPVW WMMALP LLAI ALLFPTSLLA IAVI WLLDTL 

201 LLSRI PVLRR WFK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 219 shows 86.9% identity over a 213 aa overlap with a predicted ORF (ORF 219.ng) 
from N. gonorrhoeae: 

m219/g219 



10 20 30 40 50 60 

m219.pep MTARLRKCRGFWSLRLCLFQGRXWAKTALTLTSRXHWKPSTALRGEIGFKGRYQLNLPKG 

II II II I IN III I III III hi I || I | : : llll Ml II INI I I 

g 21 9 MTARLRKCRGFWSLCLCLSQ3RLWVKTALTPPS PITLETVDRFAREI GFKGRYQLNLPKG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m219 .pep EDGVWTLS QDS M S YDMI S P FADRTVH I DQ YS GKI LAD I RFDDYN P FG KFMAAS I ALHMGT 

iiiimiiiiiiiiiimiiiiimiiihiiiiiiiiiiiiiiiimiiMiiii 

g219 BDGVWTLSQDSMSYDMISPFADRTVHIDQYSGEILADIRFDDYNPFGKFMAASIALHMGT 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m219 . pep LGWWS VLANVL FCLAV I F I G I S GCVMWWKRRP TGAVG I VP P AQ KVKLP VWWMMALPL LAI 

lllllllllhll IIIIIMIIIIIfllllihI-lllllllhllll II III Ml I 
g219 LGWWS VLANW FCLAV I F I GI S GCVMWW KRRP S GVAG I VP P AQ KI KL P VWWAMALPLLLI 

130 140 150 160 170 180 

190 200 210 

m21 9 - pep ALLFPTSLLAI AVIWLLDTLLLSRI PVLRRWFKX 

I I I I I h I I I I I I I I I I II I I I I I I I I I h I I I 
g219 ALLFPTALLAIAVIWLLDTLLLSRIPVLRKWFK 
190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 779>: 

a219.seq 

1 ATGACGGCAA GGTTAAGGAA GTGCCGTGGG TTTTGGAGCT TACGCCTATG 

51 CCTGTTTCAG GGACGACTGT GGGCAAAGAC GGTATTAACC CTGACGAGCC 

101 GATGACATTG GAAACCGTCG ACCGTTTTGC GCGG.GAAAT CGGTTTCAAA 

151 GGGCGTTATC AGCTGAATTT GCCCAAAGGC GAGGACGGCG TATGGACTTT 

201 GTCGCAGGAT TCTATGAGTT ACGACATGAT CAGCCCGTTT GCTGACCGCA 

251 CGGTGCATAT CGACCAGTAC AGCGGCAAGA TTCTTGCCGA CATCCGTTTT 

301 GACGATTACA ACCCGTTCGG CAAATTTATG GCGGCAAGCA TTGCGCTGCA 

351 TATGGGGACT TTGGGCTGGT GGAGCGTGTT GGCGAACGTT TTGTTCTGCC 

4 01 TTGCCGTGAT TTTTATCGGC ATCAGCGGCT GCGTGATGTG GTGGAAACGC 

4 51 CGTCCGTCCG GCGCGGTGGG CATGGTTCCG CCGGCGCAAA AAATCAAGCT 

501 GCCCGTCTGG TGGGCAATGG CGGTGCCGCT GCTGCTGATT GCATTGCTTT 

551 TCCCGACCGC GTTGCTTGCC ATTGCCGTGA TTTGGCTGTT GGATACGCTG 

601 CTGTTGTCGC GGATTCCTGT TTTGAGGAGA TGGTTTAAAT GA 

This corresponds to the amino acid sequence <SEQ ID 780; ORF 219.a>: 

a219.pep 

1 MTARLRKCRG FWSLRLCLFQ GRLWAKTVLT LTSR*HWKPS TVLRXEIGFK 

51 GRYQLNLPKG EDGVWTLSQD SMSYDMISPF ADRTVHIDQY SGKILADIRF 

101 DDYNPFGKFM AAS I ALHMGT LGWWSVLAN V LFCLAVI FIG ISGCVM WWKR 

151 RPSGAVGMVP PAQKrKLPVW WAMAVP LLLI ALLFPTALLA IAVI WLLDTL 

201 LLSRIPVLRR WFK* 

m219/a219 94.8% identity in 213 aa overlap 

10 20 30 40 50 60 

m21 9 . pep MTARLRKCRG FW S LRLC L FQGRXW AKT ALT LT S RX H W K P S T ALRGE I G FKGR YQLN L PKG 

I I I I I I II I I I I I I I I I II I I I i I I I : I I I I I I II II I I I : I I I I I I I II I I I I I I I I 
a219 MTARLRKCRGFWSLRLCLFQGRLWAKTVLTLTSRXHWXPSTVLRXEIGFKGRYQLNLPKG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m219.pep EDGVWTLSQDSMSYDMISPFADRTVHIDQYSGKILADIRFDDYNPFGKFMAASIALHMGT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a219 EDGVWTLSQDSMSYDMISPFADRTVHIDQYSGKILADIRFDDYNPFGKFMAASIALHMGT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m219.pep LG WW S VLANVL FC LAV I FI G I S GCVMWW KRR PTG AVG I V P P AQKVKL P VWWMMAL PL LAI 

I I I I I I I I I I I I I I I I I I i I I I t I I I I I I II I : M I I : 1 I I I I I : I I I I I I I I : I I I I 
a219 LGWWSVLANVLFCLAVIFIGISGCVMWWKRRPSGAVGMVPPAQKIKLPVWWAMAVPLLLI 
130 140 150 160 170 180 

190 200 210 

m219.pep ALLFPTSLLAIAVIWLLDTLLLSRIPVLRRWFKX 
I I ! I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a21 9 ALLFPTALLAI AVI WLLDTLLLSRI PVIRRWFKX 

190 200 210 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 78 1>: 

g221.seq 

1 atgcacgacc acggcgccat ggatcgccgc ctccccgctt tcggaagtct 

51 gatgcggcga gccgtaaatc adatcgacgc tgacggattt gaaccctgcc 

101 tcacgggcgg catcgatgac ttctttggtt tcttcgtagc tttggatgcg 

151 gttgactgcc gcctgcactt tggggtcgaa atcctgaatg ccgacgctca 

201 tgcggttgaa gccgagtctg ccgagcatga ggacggtgtc gcggctgact 

251 ttgcgcgggt cgatttcgat ggaatattcg ccggacggta tcagttcgaa 

301 atgtttgcgg atcatgcgga agacacgttc gatctgttcg tcgctcaaaa 

351 aggtcggcgc gccgccgccg aagtgcagtt gggcaagctg gtgccgtccg 

401 ttcagatgtg gagcgagcag ttccatttct ttttcaagat attcgatgta 

451 ggtatcggcg cggcttttgt ctttggtgat gattttgttg cagccgcagt 

501 agtagcagat ggtgttgcaa aacggaatgt gaatgtaaag ggaaagcggt 

551 ttgtttaa 

This corresponds to the amino acid sequence <SEQ ID 782; ORF 221.ng>: 

g221 .pep 

1 MHDHGAMDRR LPAFGSLMRR AVNXIDADGF EPCLTGGIDD FFGFFVALDA 

51 VDCRLHFGVE ILNADAHAVE AESAEHEDGV AADFARVDFD GIFAGRYQFE 

101 MFADHAEDTF DLFVAQKGRR AAAEVQLGKL VPSVQMWSEQ FHFFFKIFDV 

151 GIGAAFVFGD DFVAAAWAD GVAKRNVNVK GKRFV* 

The following partial DNA sequence was identified in A. meningitidis <SEQ ID 783>: 

m221 . seq 

1 ATGGyGGTTT TGATGcwcmg AAGTCTGGTG CGGCAGGCCG TAAATCAAAT 

51 CGACGCTGAC GGATTTGAAC CCCGCTTCGC GCGCCGCATC GATGACTTCT 

101 TTGGTTTCTT CGTAACTTTG GATGCGGTTG ACCGCCGCCT GCACTTTGGG 

151 GTCGAAATCC TGAATGCCGA TGCTCATGCG GTTGAAGCCG AGTCTGCCGA 

201 GCATGAGGAC GGTGTCGCGG CTGACTTTGC GOGGGTCGAT TTCGATGGAG 

251 TATTCGCCGG TGGGGATTAA CTCGAAATGT TTGCGTATCA TGCGGAAGAC 

301 ACGTTCGATC TGTTCGTCGC TCAAAAAGGt GCGTGCcCCG CCGAAGTGCA 

351 GTTGGGCAAG CTGGTGCCGT CCGTTCAGAT GTGGAGCGAG CAGTTCCATT 

401 TCTTTTTCAA GATATTCGAT GTAGGCATCG GCGCGGCTTT TGTCTTTGGT 

451 GATGATTTTG TTGCAGCCGC AGTAGTAGCA GATGGTGTTG CAGAACGGAA 

501 TGTGAATGTA AAGGGAAAGC GGTTTGTTTA A 

This corresponds to the amino acid sequence <SEQ ID 784; ORF 22 1>: 

tn221 .pep 

1 MXVLMXRSLV RQAWQIDAD GFEPRFARRI DDFFGFFVTL DAVDRRLHFG 

51 VEILNADAHA VEAESAEHED GVAADFARVD FDGVFAGGDX LEMFAYHAED 

101 TFDLFVAQKG ACPAEVQLGK LVPSVQMWSE QFHFFFKIFD VGIGAAFVFG 

151 DDFVAAAW A DGVAERNVNV KGKRFV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 221 shows 87.6% identity over a 170 aa overlap with a predicted ORF (ORF 221.ng) 
from Af. gonorrhoeae: 

m221/g221 



10 20 30 40 50 

m221 . pep MXVLMXRS LVRQAVNQ I DADGFEPRFARR IDDFFGFFVTLDAVDRRLH FGVE 

Ihhlll llllllll lllllll 
g2 2 1 MHDHGAMDRRLPAFGSLMRRAVNXIDADGFEPCLTGGIDDFFGFFVALDAVDCRLHFG\''E 
10 20 30 40 50 60 



60 70 80 90 100 110 

m2 2 1 . pep ILNADAHAVEAES AEHEDGVAADFARVDFDGVFAGGDXLEMFAYHAEDTFDLFVAQKGA- 

IIIIMIIIIIIMIIIIIIIIIIIIIMIhlll Hill llllllllllllll 

g2 2 1 I LNADAHAVE AES AEHEDGVAAD FARVDFDG I FAGRYQFEMFADHAEDTFDLFVAQ KGRR 

70 80 90 100 110 120 



120 130 140 150 160 170 
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m221.pep CP AE VQ LGKLVP S VQMWS EQFHFFFKI FDVG I GAAFV FGDD FV AAA WADGVAERNVNV K 

I Mill II Mil III Ml 111 I III III I llllll II III I III I! II III :|l llll 

g221 AAAEVQLGKLVPS VQMWS EQFHFFFK I FDVG I GAAFV FGDDFVAAAWADGVAKRNVNVK 

130 140 150 160 170 180 



m221.pep GKRFVX 
MINI 

g221 GKRFVX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 785>: 
a221.seq 

1 ATGGTGGTTT TGATGCTCCG AAGTCTGGTG CGGCAGGCCG TAAATCAAAT 

51 CGACGCTGAC GGATTTGAAC CCCGCTTCGC GCGCCGCATC GATGACTTCT 

101 TTGGTTTCTT CGTAACTTTG GATGCGGTTG ACCGCCGCCT GCACTTTGGG 

151 GTCGAAATCC TGAATGCCGA TGCTCATGCG GTTGAAGCCG AGTCTGCCGA 

201 GCATGAGGAC GGTGTCGCGG CTGACTTTGC GCGGGTCGAT TTCGATGGAG 

251 TATTCGCCGG TGGGGATTAA CTCGAAATGT TTGCGTATCA TGCGGAAGAC 

301 ACGTTCGATT TGGTCGTCGC TCAAAAAGGT CGGCGTGCCG CCGCCGAAGT 

351 GCAGTTGGGC AAGCTGGTGC CGTCCGTTCA GATGTGGAGC GAGCAGTTCC 

401 ATTTCTTTTT CAAGAAATTC GATGTAGGCA TCGGCGCGGC TTTTGTCTTT 

451 GGTGATGATT TTGTTGCAGC CGCAGTAGTA GCAGATGGTG TTGCAGAACG 

501 GAATGTGAAT GTAAAGGGAA AGCGGTTTGT TTAA 

This corresponds to the amino acid sequence <SEQ ID 786; ORF 221.a>: 

a221 .pep 

1 MVVLMLRSLV RQAVNQI DAD GFEPRFARRI DDFFGFFVTL DAVDRRLHFG 

51 VEILNADAHA VEAESAEHED GVAADFARVD FDGVFAGGD* LEMFAYHAED 

101 TFDLWAQKG RRAAAEVQLG KLVPSVQMWS EQFHFFFKKF D VGIGAAFVF 

151 GDDFVAAAW ADGVAERNVN VKGKRFV* 

m221/a221 95.5% identity in 1 77 aa overlap 

10 20 30 40 50 60 

m221 . pep MXVLMXRSLVRQAVNQ I DADGFEPRFARRI DDFFGFFVTLDAVDRRLH FGVEI LNADAHA 

I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 1 I I I 1 I I I I I I I I I I I I I I II 
a221 MVVLMLRSLVRQAVNQ I DADGFEPRFARRI DDFFGFFVTLDAVDRRLH FGVEI LNADAHA 

10 20 30 40 50 60 

70 80 90 100 110 119 

m221 . pep VEAESAEHEDGVAADFARVDFDGVFAGGDXLEMFAYHAEDTFDLFVAQKGA-CPAEVQLG 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I II I I I I I M II I I I I II I 
a221 VEAESAEHEDGVAADFARVDFDGVFAGGDXLEMFAYHAEDTFDLVVAQKGRRAAAEVQLG 

70 80 90 100 110 120 

120 130 140 150 160 170 

m221 . pep KLVPSVQMWSEQFHFFFKIFDVGIGAAFVFGDDFVAAAWADGVAERNVNVKGKRFVX 
I I I I I I t I M I ! I I I I I f ! I I I I I I I I I I I I i I I I I M I I I I I I I I I I II I I I I I I I 
a221 KLVPSVQMWSEQFHFFFKKFDVGIGAAFVFGDDFVAAAVVADGVAERKVNVKGKRFVX 
130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 787>: 
g223.seq 

1 atggaattca ggcaccaggt agtggtagtt ggtgtcgaac catttggtca 
51 tttcgatggc gaattggtct ttgttgccgc gcgccagttg gaagaattgt 
101 tccaaaggca ggttttggct atcgaagccg aaacgggcgg gaatcgcgcc 
151 cgtggatact tgcaggtcga ggatgtgatg gtagaaegtg aaatcacgta 
201 cagcaacgta atcagcgtta ggagcagctt ggtgtttcca gtttttctcg 
251 cgcaggtctt tggcaacgtc gagcagctct tgttcactga tctctttgcg 
301 ccagtatttt tcttgggcga atttcaattc acggaaggcg ccgacacgcg 
351 ggaagcctga 

This corresponds to the amino acid sequence <SEQ ID 788; ORF 223.ng>: 

g223.pep. . 
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1 MEFRHQWW GVEPFGHFDG ELVF\/AARQL EELFQRQVLA IEAETGGNRA 
51 RGYLQVEDVM VESEITYSN V ISVRSSLVFP VFLAQV FGNV EQLLFTDLFA 
101 PVFFLGEFQF TEGADTREA* " " 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 789>: 

m223.seq 

1 GTGGAATTCA GGCACCAAGT AGTGGTAGTT GGTGTCGAAC CATTTGGTCA 

51 TTTCGATAGC GAATTGGTCT TTGTTACCGC GCG CCAGTTG GAAGAATTGT 

101 TCCAAAGACA GGTTTTGGCT GTCGAAGCCG AAGCGGGCGG GAATCGCGCC 

151 GGTGGCGACT TGCAGGTCGA GGATGTGGTC GTAGAAAGTG AAATCsCTAC 

201 GGCAACGAAA TCGGCGTTGG CAGCGACCTG GTGTTTCCAG TTTTTCTCGC 

. 2 51 GCAAGTCTTT AGCAACAGCC AGCAATTCTT GCTCGCTGAT TTCTTTGCGC 

301 CAGTATTTTT CTTGTGCGAA TTTCAATTCG CGGAAGGCGC CGACACGCGG 

351 GAAGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 790; ORF 223>: 

m223.pep 

1 VEFRHQWW GVEPFGHFDS ELVFVTARQL EELFQRQVLA VEAEAGGNRA 

51 GGDLQVEDW VESEIXYGNE IGVGSDLVFP VFLAQV FSNS QQFLLADFFA 

101 PVFFLCEFQF AEGADTREA* ™ 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 223 shows 80.7% identity over a 1 19 aa overlap with a predicted ORF (ORF 223.ng) 
from N. gonorrhoeae: 

m223/g223 



10 2C 30 40 50 60 

m223.pep VE FRHQWWGVE PPGHFDSELVFVTARQLEELFQRQVLAVEAEAGGNRAGGDLQVEDW 

: I I I I I I I I II I I I I I I I I: I I II I :| I M I M I I! I I I I : I I I : I I I I I I I I I I I I : 
g223 ME FRHQWWGVE P FGH FDGE LVFVAARQLEE L FQRQV LA I EAETGGNRARG Y LQ VED VM 

10 2C 30 40 50 60 



70 80 90 100 110 120 

m223.pep VESEIXYGNEIGVGSDLVFPVFLAQVFSNSQQFLLADFFAPVFFLCEFQFAEGADTREAX 
I I I I I : I : I 1:1 I : I I I I I I I I I I I : I : I : I : : I : I I M I I I I I I I : I I I I II I I I 
g223 VESEITYSNVISVRSSLVFPVFLAQVFGNVEQLLFTDLFAPVFFLGEFQFTEGADTREAX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 791 >: 

a223.seq 

1 GTGGAATTCA GGCACCAAGT AGTGGTAGTT GGTGTCGAAC CATTTGGTCA 

51 TTTCGATAGC GAATTGGTCT TTGTTACCGC GCGCCAGTTG GAAGAATTGT 

.101 TCCAAAGATA GGTTTTGGCT GTCGAAGCCG AAGCGGGCGG GAATCGCGCC 

151 GGTGGCGACT TGCAGGTCGA GGATGTGGTC GTAGAAAGTG AAATCGCCTA 

201 CGGCAACGTA ATCGGCGTTG GCAGCGGCCT GGTGTTTCCA GTTTTTCTCG 

251 CGCAAGTCTT TAGCAACAGC CAGCAATTCT TGCTCGCTGA TTTCTTTGCG 

301 CCAGTATTTT TCTTGTGCGA ATTTCAATTC GCGGAAGGCA CCGACACGCG 

351 GGAAGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 792; ORF 223.a>: 
a223 .pep 

1 VEFRHQWW GVEPFGHFDS ELVFVTARQL EELFQR*VLA VEAEAGGNRA 

51 GGDI.QVEDW VESEIAYGN V IGVGSGLVFP VFLAQV FSNS QQFLLADFFA 

101 PVFFLCEFQF AEGTDTREA* 



m223/a223 95.8% identity in 1 1 9 aa overlap 

10 20 30 40 50 60 

m22 3 . pep VEFRHQVVWGVEPFGHFDSELVFVTARQLEELFQRQVLAVEAEAGGNRAGGDLQVEDVV*. 

> 1 I I I I I H I I I I I I I i I I M M I I I I I I I 1 I I 1 I I I I I 1 1 I I 1 I I 1 I I I I | I 

a 22 3 V E FRUQV V WGVE P FGHFD S EL V FVT ARQLEEL FQRX VLA VEAEAGGNRAGGDLQVEDW 

10 20 30 40 50 60 

70 80 90 100 110 120 

m22 3 . pep VESEIXYGNEIGVGSDLVFPVFLAQVFSNSQQFLIiADrFAPVFFLCEFQFAEGADTElEAX 
Hill HI Mill I I I I I I I I I I I I I I I I \ I | | I | | | | | | | | | | | | | ( | | : | | J [| 1 
a 2 2 3 VESE I A YGN V I GVG SG L VF P VFLAQVFS NS QQFLL AD FF AP V FFLC EFQFAEGTDT REAX 
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70 60 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 793>: 



g225 . seq 






1 


atggattctt 


ttttcaaacc 


51 


tgccgtccgc 


cccgcccttg 


101 


gcgagcagat 


tctcagacag 


151 


gtcaaccgag 


cccccgcccg 


201 


cggcgcgatg 


gggcttaacg 


251 


ccgcccggcg 


ggcgggcaat 


301 


cttttgggta 


ttgcctaccg 


351 


tgactgcagc 


ggattcargc 


401 


acctgccgcg 


cacgtcggcg 


451 


cgaagcgaat 


tgcagcccgg 


501 


cagccgcatt 


tcccatgtcg 


551 


acgcgccgcg 


cacggggaaa 


601 


tattggagcg 


gcaaatatgc 


651 


gtcacgcttt 


ctgaactga 



tttgccgaag acgaacagcc cgttttaccc 



ctaccggttt 



:atcgaaa tcaccagcict gagccacaaa 
icgcccgc cgggtcaaga aaaacgaccc 

This corresponds to the amino acid sequence <SEQ ID 794; ORF 225.ng>: 
g225.pep 

1 MDSFFKPAVW AVLWLMFAVR PALAD ELTNL LSSREQILRQ FAEDEQPVLP 

51 VNRAPARRAG NADELIGGAM GLNEQPWRV NRAXARRAGN ADKLIGSAMR 

101 LLGIAYRYGG TSVSTGFDCS GFMQHIFKRA MGINLPRTSA EQARMGAPVA 

151 RSELQPGDMV FFRTLGGSRI SHVGLYIGNN RFIHAPRTGK NIEITSLSHK 

201 YWSGKYAFAR RVKKNDPSRF LN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 795>: 

m225.Beq (partial) 

1 . . TTTTCAAACC CGGCAGTTTG GGCGGTTTTG TGGCTGAwGT TTGCCGTCCG 

51 CCCCGCCCTT GCCGACGAGT TGACCAACCT GCTCAGCAGC CGCGAGCAGA 

101 TTC TCAGACA GTTTGCCGAA GACGAACAGC CCGTTTTACC CATCAACCGA 

151 GCCCCCGCCC GGCGGGCGGG CAATGCCGAC GAACTCATCG GCAGCGCGAT 

201 GGGGCTTAAC GAACAGCCCG TTTTACCCGT CAACCGAGTC CCCGCCCGGC 

251 GGGCGGGCAA TGCCGACGAA CTCATCGGCA ACGCGATGGG GCTTAACGAA 

301 CAGCCCGTTT TACCCGTCAA CCGAGCCCCC GCCCGGCGGG CGGGCAATGC 

351 CGACGAACTC ATCGGCAACG CGATGGGACT TTTGGGTATT GCCTACCGCT 

401 ACGGCGGCAC ATCGGTTTCT ACCGGTTTTG ACTGCAGCGG CTTCATGCAG 

451 CACATCTTCA AACGCGCCAT GGGCATCAAC CTGCCGCGCA CGTCGGCAGA 

501 ACAGGCACGG ATGGGTACGC CGGTTGCCCG AAGCGAATTG CAGCCCGGAG 

551 ATATGGTGTT TTTCCGCACG CTCGGCGGCA GCCGCATTTC CCATGTCGGA 

601 CTTTATATCG GCAACAACCG CTTCATCCAC GCGCCGCGCA CGGGGAAAAA 

651 TATCGAAATC ACCAGCCTGA GCCACAAATA TTGGAGCGGC AAATACGCGT 

701 TCGCCCGCCG GGTCAAGAAA AACGACCCGT CCCGCTTTCT GAACTGA 

This corresponds to the amino acid sequence <SEQ ID 796; ORF 225>: 

m225.pep (partial) 

1 . . FSNPAVWAVL WLXFAVRPAL AD BLTNLLSS REQILRQFAE DEQPVLPINR 
51 APARRAGNAD BLIGSAMGLN EQPVLPVNRV PARRAGNADE LIGNAMGLNE 
101 QPVLPVNRAP ARRAGNADEL IGMAMGLLGI AYRYGGTSVS TGFDCSGFMQ 
151 HIFKRAMG1N LPRTSAEQAR MGTPVARSEL QPGDMVFFRT LGGSRISHVG 
201 LYIGNNRFIH APRTGKNIEI TSLSHKYWSG KYAFARRVKK NDPSRFLN* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from gonorrhoeae 

ORF 225 shows 83.5% identity over a 248 aa overlap with a predicted ORF (ORF 225 .ng) 
from N. gonorrhoeae: 
m22 5/g225 

10 20 30 40 50 

m225 .pep FSNPAVWAVLWLXFAVRPALADELTNLLSSREQ I LRQ FAEDEQPVLP I NRAPARRAG 

I IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIHIIIIIIII 
g225 MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQ I LRQ FAEDEQPVLP VNRAPARRAG 
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10 20 30 40 50 60 

60 70 80 90 100 110 

m225 .pep NADE LI GS AMGLNEQP VL PVNRVPARRAGNADE LI GNAMGLNEQ P V LPVNRAPARRAGNA 

IIIIIII HIIIIIMh mi IIIIIII 

g 2 2 5 NADE LI G GAMGLNEOPWRVNRAXARRAGKA 

70 80 90 

120 130 140 150 160 170 

m2 2 5 . pep DELIGNAMGLLGI AYRYGGTSVSTGFDCSGFMOHI FKRAMGINLPRTSAEQARMGTPVAR 

MINI II 111 II Ml III III II i II MUM IIIIIIIIIIMIIIIIIhllll 
g2 2 5 DKLIGSAMRLLGI AYRYGGTSVSTGFDCSGFMOHI F KRAMG I NL PRTS AEQARMGAP VAR 

100 110 120 130 140 150 

180 190 200 210 220 230 

m225 . pep SE LQ PGDMVF FRTLGGSRI SHVGL Y I GNNR FI HAPRTGKN I E I TS L SHK YWS GKYAFARR 

1 1 II MM M I Ml II IMIMM Ml III III II I Mill II MM II I II I II I M II 

g 2 2 5 SELQPGDMVFFRTLGGSRI SHVGLY I GNNRFI HAPRTGKN I E ITSLSHKYWSGKYAFARR 

160 170 180 190 200 210 

240 249 
m2 2 5 . pep VKKNDPSRFLNX 

I I II I I II Hi 
g225 VKKNDPSRFLN 
220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 797>: 

a225.seq 

1 ATGGATTCTT TT7TCAAACC GGCAGTTTGG GCGGTTTTGT GGCTGATGTT 

51 TGCCGTCCGC CCCGCCCTTG CCGACGAGTT GACCAACCTG CTCAGCAGCC 

101 GCGAGCAGAT TCTCAGACAG TTTGCCGAAG ACGAACAGCC CGTTTTACCC 

151 ATCAACCGAN CCCCCGCCCG GCGGGCGGGC AATGCCGACG AACTCATCGG 

201 CAGCGCGATG GGGCTTA^CG AACAGCCCGT TTTACCCGTC AACCGANTCC 

251 CCGCCCGGCG GGCGGGCAAT GCCGACNAAC TCATCGGCAA CGCGATGGGG 

301 CTTAACGAAC AGCCCGTTTT ACCCGTCAAC CGAGTCCCCG CCCGGCGGGC 

351 GGGCAATGCC GACGAACTCA TCGGCAACGC GATGGGGCTT AACGAACAGC 

401 CCGTTTTACC CGTCAACCGA GCCCCCGCCC GGCGGGCGGG CAATGCCGAC 

451 GAACTCATCG GCAACGCGAT GGGACTTTTG GGTATTGCCT ACCGCTACGG 

501 CGGCACATCG AT7TCTACCG GTTTTGACTG CAGCGGCTTC ATGCAGCACA 

551 TCTTCAAACG CGCCATGGGC ATCAACCTGC CGCGCACGTC GGCAGAACAG 

601 GCGCGGATGG GTACGCCGGT TGCCCGAAGC GAATTGCAGC CCGGGGATAT 

651 GGTGTNTTTC CGCACGCTCG GCGGCAGCCG CATTTCCCAT GTCGGACTTT 

701 ATATCGGCAA CAACCGCTTC ATCCACGCGC CGCGCACGGG GAAAAATATC 

751 GAAATCACCA GCCTGAGCCA CAAATATTGG AGCGGCAAAT ACGCGTTCGC 

801. CCGCCGGGTC AAGAAAA^CG ACCCGTCCCG CTTTCTGAAC TGA 

This corresponds to the amino acid sequence <SEQ ID 798; ORF 225. a>: 
a225.pep 

1 MDSFFKPAVW AVLWLMFAVR PALA DELTNL LSSREQILRQ FAEDEQPVLP 

51 INRXPARRAG NADELIGSAM GLNEQPVLPV NRXPARRAGN ADXLIGNAMG 

101 LNEQPVLPVN RVPARRAGNA CELIGNAMGL NEQPVLPVNR A PARRAGN AD 

151 ELIGNAMGLL GIAYRYGGTS ISTGFDCSGF MQHIFKRAMG INLFRTSAEQ 

201 ARMGTPVARS ELQPGDMVXF RTLGGSRISH VGLYIGNNRF I HAPRTGKN I 

251 EITSLSHKYW SGKYAFARRV KKNDPSRFLN * 

m225/a225 87.4% identity in 277 aa overlap 

10 20 30 40 50 

m225 . pep FSNPAVWAVLWLXFAVRPALADELTNLLSSREQILROFAEDEQPVLPINRAPARRAG 
I : I I I I I I I II I I I I i I I I I I I I I I I > I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a225 MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRXPARRAG 
10 20 30 40 50 60 

60 70 79 80 
m225. pep NADELIGSAMGLNEQPVLPVNR V PARRAGN A 
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Mill! IMIilMllllllil MM Mill 

a225 NADELIGSAMGLNECPVLPVNRXPAREIAGNADXLIGNAMGLNEQPVLPVNRVPARRAGNA 
70 80 90 100 110 120 

90 100 110 120 130 140 

m225 . pep DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I t I I M I I I I I I I I I | I : | I | | | | | | | 
a225 DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAKGLLGIAYRYGGTSISTGFDCSGF 
130 140 • 150 160 170 180 

150 160 170 180 190 200 

m225.pep MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 

inn ii m m 1 1 1 1 1 m m 1 1 m 1 1 i m 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 

a22S MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVXFRTLGGSRISHVGLYIGNNRF 
190 200 210 220 230 240 

210 220 230 240 249 

m225 . pep IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 
I I I I Ml I I I I I Ml I III I I I I I I I Ml Mi I III III I I 
a225 IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 
250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 799>: 

g225-l.seq 

1 atggattctt ttttcaaacc ggcagtttgg gcggttttgt ggctgatgtt 

51 tgccgtccgc cccgcccttg ccgacgagtt gaccaacctg ctcagcagcc 

101 gcgagcagat tctcagacag tttgccgaag acgaacagcc cgttttaccc 

151 gtcaaccgag cccccgcccg gcgggcgggc aatgccgacg aactcatcgg 

201 cggcgcgatg gggcttaacg aacagcccgt tgtacgcgtc aaccgagccn 

251 ccgcccggcg ggcgggcaat gccgacaaac tcatcggcag cgcgatgcgg 

301 cttttgggta ttgcctaccg ctacggcggc acatcggtgt ctaccggttt 

351 tgactgcagc ggattcatgc agcacatctt caaacgcgcc atgggcatca 

401 acctgccgcg cacgtcggcg gaacaggcgc ggatgggcgc acccgttgcc 

451 cgaagcgaat tgcagcccgg ggatatggtg tttttccgca cgctcggcgg 

501 cagccgcatt tcccatgtcg gactttatat cggcaacaac cgcttcatcc 

551 acgcgccgcg cacggggaaa aatatcgaaa tcaccagcct gagccacaaa 

601 tattggagcg gcaaatatgc gttcgcccgc cgggtcaaga aaaacgaccc 

651 gtcacgcttt ctgaactga 

This corresponds to the amino acid sequence <SEQ ID 800; ORF 225-1 .ng>: 

g225-l.pep 

1 MDSFFKPAVW AVLWLMFAVR PALAD BLTNL LSSREQILRQ FAEDEQPVLP 

51 VNRAPARRAG NADELIGGAM GLNEQPWRV NRAXARRAGN ADKLIGSAMR 

101 LLGIAYRYGG TSVSTGFDCS GFMQHIFKRA MGINLPRTSA EQARMGAPVA 

151 RSELQPGDMV FFRTLGGSRI SHVGLYIGNN RFIHAPRTGK NIEITSLSHK 

201 YWSGKYAFAR RVKKNDPSRF LN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 801 >: 

m225-l.seq 

1 ATGGATTCTT TTTTCAAACC GGCAGTTTGG GCGGTTTTGT GGCTGATGTT 

51 TGCCGTCCGC CCCGCCCTTG CCGACGAGTT GACCAACyTG CTCAGCAGCC 

101 GCGAGCAGAT TCTCAGACAG TTTGCCGAAG ACGAACAGCC CGTTTTACCC 

151 ATCAACCGAG CCCCCGCCCG GCGGGCGGGC AATGCCGACG AACTCATCGG 

201 CAGCGCGATG GGGCTTAACG AACAGCCCGT TTTACCCGTC AACCGAGTCC 

251 CCGCCCGGCG GGCGGGCAAT GCCGACGAAC TCATCGGCAA CGCGATGGGG 

301 CTTAACGAAC AGCCCGTTTT ACCCGTCAAC CGAGCCCCCG CCCGGCGGGC 

351 GGGCAATGCC GACGAACTCA TCGGCAACGC GATGGGACTT TTGGGTATTG 

401 CCTACCGCTA CGGCGGCACA TCGGTTTCTA CCGGTTTTGA CTGCAGCGGC 

451 TTCATGCAGC ACATCTTCAA ACGCGCCATG GGCATCAACC TGCCGCGCAC 

501 GTCGGCAGAA CAGGCACGGA TGGGTACGCC GGTTGCCCGA AGCGAATTGC 

551 AGCCCGGAGA TATGGTGTTT TTCCGCACGC TCGGCGGCAG CCGCATTTCC 

601 CATGTCGGAC TTTATATCGG CAACAACCGC TTCATCCACG CGCCGCGCAC 

651 GGGGAAAAAT ATCGAAATCA CCAGCCTGAG CCACAAATAT TGGAGCGGCA 

701 AATACGCGTT CGCCCGCCGG GTCAAGAAAA ACGACCCGTC CCGCTTTCTG 

751 AACTGA 
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This corresponds to the amino acid sequence <SEQ ID 802; ORF 21 7>: 

m225-l.pep 

1 MDSFFKPAVW AVLWLMFAVR PALADELTNL LSSREQILRQ FAEDEQPVLP 
51 INRAPARRAG NAD EL I GS AM GLNEQPVLPV NRVPARRAGN ADELIGNAMG 
101 LNEQPVLPVN RAPARRAGNA DELIGNANGL LGIAYRYGGT SVSTGFDCSG 
151 FMQHI FKRAM GINLPRTSAE QARMGTPVAR SELQPGDMVF FRTIjGGSRIS 
201 HVGLYIGNNR FIHAPRTGKN IEITSLSHKY WSGKYAFARR VKKNDPSRFL 
251 N* 

m225-l/g225-l 84.9% identity in 251 aa overlap 

10 20 30 40 50 60 

m225-l . pep MDSFFKPAWAVLWLMFAWPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

II llll IMii II II II I lllilllllllll III II II lllllllll I Ihl II llllll 

g2 2 5 - 1 MDS FFKPAVWAVLWLMF AVRPALADELTNLLS S REQ I LRQFAEDEQP VLPVNRAPARRAG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m225-l. pep NAD EL I G S AMG LN EQ P VL P VNR V P ARRAGNAD ELI GN AMGLNEQ P VL P VNRA P ARRAGNA 

lit! Ilhllllllllh llll 1 1 1 1 1 1 1 

g225-l NADE LIGGANGLNEQPVVRVNRAXARRAGNA 

70 80 90 

130 140 150 160 170 180 

m22 5 - 1 . pep DELIGNAMGLLGI AYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

hiihii i e i r J f 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 r t i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 - 1 1 1 1 

g2 2 5 - 1 D KLIG SAMRLLGI AYRYGGTS VSTGFDCSGFMQH I FKRAMG INLPRT 3 AEQARMG APVAR 

100 110 120 130 140 150 

190 200 210 220 230 240 

m22 5 - 1 . pep SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNI E I TS LS HKYWSG KYAFARR 

llllll llllllllllltlMIIIMIIIIIIIIIillll llllllllllll llilllll 

g225 - 1 SELQPGDMVTFRTIGGSRISIIVGLYIGNNRFIHAPRTGKNIEITSLSHK^/WSGKYAFARR 
ISO 170 180 190 200 210 

250 

m225-l.pep VKKNDPSRFLNX 

llllllllllll 
g225-l VKKNDPSRFLNX 
220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 803>: 

a225-l.seq 

1 ATGGATTCTT TTTTCAAACC GGCAGTTTGG GCGGTTTTGT GGCTGATGTT 

51 TGCCGTCCGC CCCGCCCTTG CCGACGAGTT GACCAACCTG CTCAGCAGCC 

101 GCGAGCAGAT TCTCAGACAG TTTGCCGAAG ACGAACAGCC CGTTTTACCC 

151 ATCAACCGAN CCCCCGCCCG GCGGGCGGGC AATGCCGACG AACTCATCGG 

201 CAGCGCGATG GGGCTTAACG AACAGCCCGT TTTACCCGTC AACCGANTCC 

251 CCGCCCGGCG GGCGGGCAAT GCCGACNAAC TCATCGGCAA CGCGATGGGG 

301 CTTAACGAAC AGCCCGTTTT ACCCGTCAAC CGAGTCCCCG CCCGGCGGGC 

351 GGGCAATGCC GACGAACTCA TCGGCAACGC GATGGGGCTT AACGAACAGC 

401 CCGTTTTACC CGTCAACCGA GCCCCCGCCC GGCGGGCGGG CAATGCCGAC 

451 GAACTCATCG GCAACGCGAT GGGACTTTTG GGTATTGCCT ACCGCTACGG 

501 CGGCACATCG ATTTCTACCG GTTTTGACTG CAGCGGCTTC ATGCAGCACA 

551 TCTTCAAACG CGCCATGGGC ATCAACCTGC CGCGCACGTC GGCAGAACAG 

601 GCGCGGATGG GTACGCCGGT TGCCCGAAGC GAATTGCAGC CCGGGGATAT 

651 GGTGTNTTTC CGCACGCTCG GCGGCAGCCG CATTTCCCAT GTCGGACTTT 

701 ATATCGGCAA CAACCGCTTC ATCCACGCGC CGCGCACGGG GAAAAATATC 

751 GAAATCACCA GCCTGAGCCA CAAATATTGG AGCGGCAAAT ACGCGTTCGC 

801 CCGCCGGGTC AAGAAAAACG ACCCG?CCCG CTTTCTGAAC TGA 

This corresponds to the amino acid sequence <SEQ ID 804; ORF 225-1. a>: 

a225-l.pep 

1 MDSFFKPAVW AVLWLMFAVR PALA DELTNL LSSREQILRQ FAEDEQPVLP 

51 INRXPARRAG NADELIGSAM GLNEQPVLPV NRXPARRAGN ADXLIGNAMG 

101 LNEQPVLPVN RVPARRAGNA DELIG11AMGL NEQPVLPVNR AP ARRAGNAD 

151 ELIGNAMGLL GIAYRYGGTS ISTGFDCSGF MQH I FKRAMG INLPRTSAEQ 

201 ARMGTPVARS ELQPGDMVXF RTLGGSRISH VGLYIGNNRF IHAPRTGKNI 

251 EITSLSHKYW SGKYAFARRV KKNDPSRFLN * 
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a225-l/m225-l 88.6% identity in 280 aa overlap 

10 20 30 40 50 60 

a22 5 - 1 . pep MD SF FKPAVWAVLWIWPAVRP AIADELTNLLS SREQ I LRQFAEDEQ PVL PINRX PARRAG 

I 1 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 i I ( 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 1 i [ 1 1 illlll 

m225- 1 MD S F FKPAVWAVLWLM FAVRP ALADE LTNLLS S REQ I LRQFAEDEQP VI t P INR A PARRAG 

10 20 30 40 50 60 

70 80 90 100 110 120 

a2 2 5 - 1 . pep NADELI GS AMGLNEQP VLPVNRXP ARRAGNADXL I GNAMGLNEQP VTj PVNR VPARRAGNA 

llllllllllllllll lllllllllllllll 

m2 2 5 - 1 NADELI GSAMGLNEQP VLPVNTRVPARRAGNA 

70 80 90 

130 140 150 160 170 180 

a2 2 5 - 1 . pep D E L IG AMG LNEQP VL P VNRAP ARRAGNAD EL I GNAMG LLG I AYRYGGTS ISTGFDCSGF 

mill iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihiiniiiii 

m2 2 5 - 1 DEL IGNAMGLMEQPVLPVNRAP ARRAGNADE L I GNAMGLLG I AYR YGGTSVSTG FDCSG F 

100 110 120 130 140 150 

190 200 210 220 230 240 

a2 25 - 1 . pep MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVXFRTLGGSRISHVGLYIGNNRF 

Illlll 1 1 Nil 1 1 II 1 1 III I MM [MINI !l II llllllllillllllllllll 

m225 -1 MQHIFKRAMGINLPRTSAEOARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
160 170 180 190 200 210 

250 260 270 280 

a225-l . pep IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 

II II II II1MIM Mllllli IMMMIIMIIIIIIII 

m225 - 1 IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 
220 230 240 250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 805>: 

g226 . seg 

1 ATGAGCGAAA TCCTCAGGCA GCCCAGCGTT CTGCTTTTCC TCACGCTTGC 

51 CGTGTACGCG CTTGCGATTA TCGTGCGCAC GCGCACGGGC AATATCTTCT 

101 GCAACCCCGT ACTCGTCAGC ACTATCGTGC TGATTGCCTA CCTGAAAATC 

151 CTCGGTATCG ATTATGCGGT GTACCACAAC GCCGCGCAAT TCATTGATTT 

201 TCGGCTGAAA cccGccgtCG TCGTGCTTGC CGTGCCGCTC TACCAAAACC 

251 GCCGTAAAAT CTTCAACCAG TGGCTGCCCG TCATCGTTTC GCAGCTTGCG 

301 GGCAGCGTTA cggGCATTGT tacggggATG TATTTTgccg cttggctcgg 

3 51 gccggatacc caattctcct tcccgcctcg tcttcaatat ctgttattta 

4 01 caccctctgg aatcccaatt cacaccctgt atgcgcgggt tctcccgcca 
4 51 tttctgttgc ctccgcctct cctgccgcgc ctcggcccgc atacattgcg 
501 ccggttcaca atacttccaa aaaaactacg gccgtttaag cccctcctcc 
551 cagttgtggt cctttctcct Ccgggcctcg cccctcccct cttataa 

This corresponds to the amino acid sequence <SEQ ID 806; ORF 226.ng>: 
g226 .pep 

1 MSEILRQPSV LLFLTLAVYA LA IIVRTRTG NIFCNP VLVS TIVLIAYLKI 

51 LGI DYAVYHN AAQFIDFRLK PAVWLAVPL YQNRRKIFNQ WLPVIVSQLA 

101 GSVTGIVTGM YFAAWLGPDT QFSFPPRLQY LLFTPSGIPI HTLYARVLPP 

151 FLLPPPLLPR LGPHTLRRFT ILPKKLRPFK P LLPWVLSP PGLAPPLL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 807>: 

m226.seq 

1 ATGAACGAAA TCCTCAGGCA GCCCAGCGTT CTGCTTTTCC TCACGCTTGC 

51 CGTGTACGCG CTTGCGATTA TCGtGCGCAC GCGCACGGGC AATATCTTCT 

101 GCAACCCCGT ACTCGTCAGC ACTATCGTGC TGATTGCCTA CCTGAAAATC 

151 CTCGGTATCG ATTATGCGGT GTACCACAAC GCCGCGCAAT TCATTGATTT 

2 01 TTGGCTGAAA CCCGCCGTCG TCGTGCTTGC CGTGCCGCTC TACCAAAACC 

251 GCCGTAAAAT CTTCAACCAG TGGCTGCCCG TCATCGTTTC ACAGCTTGCG 

301 GGCAGCGTTA CGGGCATTGT TACAGGGATG TATTTTGCCA AATGGCTGGG 

351 CGCGGAACGC GAAGTCGTCC TCTCGCTCGC GTCCAAATCT GTTACCAACC 

4 01 CCATCG CT AT TGAAATCACC CGCTCCATCG GCGGCATTCC CGCCATTACC 

4 51 GCCGCCACCG TCATCATTGC CGGTCTGGTC GGACAGATTG CCGGTTACAA 
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501 AATGCTGAAG AACACGGTCG TCATGCCCTC GTCCGTGGGT ATGTCGCTCG 
551 GCACGGCTTC GCACGCGATG GGGATTGCCG CCTCGCTCGA ACGCAGCCGC 
601 CGTATGGCGG CATACGCGGG GCTGGGGCTG ACGTTCAACG GCGTACTGAC 
651 CGCGCTGATT GCGCCGCTGC TCATCCCCGT TTTGGGATTT TGA 

This corresponds to the amino acid sequence <SEQ ID 808; ORF 226>: 

m226 .pep 

1 MNEILRQPSV LLFLTLAVYA L AIIVRTRTG NIFCNP VLVS TIVLIAYLKI 

51 LGIDYAVYHN AAQFIDFWLK PAWVLAVPL YQNRRKIFNQ WLPVIVSQLA 

101 GSVTGIVTGM YFAKWLGAER EWLSLASKS VTNPIAIEIT RSIGGIPAIT 

151 AATVI I AGLV GQIAGYKMLK NTWMPSSVG MSLGTASHAM GIAASLERSR 

201 RMAAYAGLGL TFNGVLTALI APLLI PVLGF * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 226 shows 94.2% identity over a 121 aa overlap with a predicted ORF (ORF 226.ng) 
from N. gonorrhoeae: 

m226/g226 

10 20 30 40 50 60 

MNEILRQPSVLLFLTLAVYALAIIVRTRTGNIFCNPVLVSTIVLIAYLKI LGIDYAVYHN 

|:||||IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIi!llllll!llllll 
MSEILRQPSVLLFLTLAVYALAIIVRTRTGNI FCNPVLVST I VLIAYLKI LGIDYAVYHN 
10 20 30 40 50 60 

70 80 90 100 110 120 

AAQFI DFWLKPAVWLAVPLYQNRRKI FNQWLPVI VSQLAGS VTG I VTGMYFAKWLGAER 

lllllll HUM IIMII li M ill III Mill II lillillll HUM III : 
AAQ FI D FRLKP AVWLAVPL YQNRRKI FNQWL PVI VSQLAGS VTG I VTGMY FAAWLGPDT 
70 80 90 100 110 120 

130 140 150 160 170 180 

EWLSLASKSVTNPIAIEITRS IGGIPAITAATVI IAGLVGQIAGYKMLKNTWMPSSVG 

QFS FP PRLQ Y LLFT P S G I P IHTL Y ARVLP P FLL PP PLLPRLGPHTLRRFTI L PKKLRP FK 
130 140 150 160 170 180 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 809>: 

a226.seq 

1 ATGAACGAAA TCCTCAGGCA GCCGAGCATC CTGCTTTTCC TCACGCTTGC 

51 CGTGTACGCG CTTGCGATTA TCGTGCGCAC GCGCACGGGT AATATCTTCT 

101 GCAACCCCGT ACTCGTCAGC ACTATCGTGC TGATTGCCTA CCTGAAAATC 

151 CTCGGTATCG ATTATGCGGT GTACCACAAC GCCGCGCAGT TTATCGATTT 

201 CTGGCTCAAG CCCGCCGTCG TCGTGCTTGC CGTGCCGCTC TACCAAAACC 

251 GCCGTAAAAT CTTCAACCAA TGGCTGCCCG TCATCGTTTC GCAGCTTGCG 

301 GGCAGCGTTA CGGGCATTGT TACGGGGATG TATTTTGCCA AATGGCTGGG 

351 CGCGGAACGC GAAGTCGTCC TCTCGCTCGC GTCCAAATCT GTTACCAATC 

4 01 CTATCGCCAT CGAAATCACC CGCTCCATCG GCGGCATTCC CGCCATTACC 

4 51 GCCGCCACCG TCATCATTGC CGGCCTGGTC GGACAGATTG CCGGTTACAA 

501 AATGTTGAAA AACACGGTCG TTATGCCCTC ATCTGTCGGA ATGTCGCTCG 

551 GCACGGCTTC GCACGCGATG GGCATTGCCG CCTCGCTCGA ACGCAGCCGC 

601 CGCATGGCCG CATACGCGGG GCTGGGGCTG ACGTTCAACG GCGTACTGAC 

651 CGCGCTGATT GCGCCGCTGC TTATCCCCGT TTTGGGATTT TGA 

This corresponds to the amino acid sequence <SEQ ED 810; ORF 226.a>: 

a226.pep 

1 MNEILRQPSI LLFLTLAVYA LA IIVRTRTG NIFCNP VLVS TIVLIAYLKI 

51 LGIDYAVYHN AA QFIDFWLK PAVWLAVPL YQNRRKIFNQ WLPVIVSQLA 

101 GSVTGIVTGM YFAKWLGAER EWLSLASKS VTNPIAIEIT RS IGGIPAIT 

151 AATVI I AGL V GQIAGYKMLK NTWMPSSVG MSLGTASHAM GIAASLERSR 

201 RMAAYAGLGL TFNGVLTALI APLLI PVLGF * 



m226 .pep 
g226 

m226 .pep 
g226 

m226 .pep 
g226 



m226/a226 99.6% identity in 230 aa overlap 

10 20 30 40 50 60 
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m226.pep MNEILRQPSVLLFLTLAVYALAIIVRTRTGNIFCN PVLVSTIVLIAYLKILGIDYAVYKN 

I I I I I [ I I I: ! I I II I I I I I I I I I 1 1 I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
a226 MNE I LRQPS I LLFLTLAVYALAI I VRTRTGN I FCN PVLVST I VLI AYLKIL3 1 DYAVYKN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m226.pep AAQFIDFWLKPAVWLAVPLYQNRRKIFNQWLPVIVSQLAGSVTGIVTGMYFAKWLGAER 
I I I I I 1 1 I I I ! 1 1 I I I 1 1 I I i II I I I I I I I I I I I I I I I I t I I I I I I 1 1 1 I 1 1 M 1 I I I I I 
a226 AAQFIDFWLKPAVWLAVPLYQNRRKIFNQWLPVIVSQLAGSVTGIVTGMYFAKWLGAER 

70 80 90 100 110 120 

130 140 150 160 170 180 

m226.pep EWLSLASKSVTNPIAIEITRSIGGIPAITAATVIIAGLVGQIAGYKMLKNTWMPSSVG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
a 2 2 6 EVVLS LASKSVTN PI AIE I TRS IGG I PAI TAATVI I AGL VGQI AG YKMLKNT WMPS SVG 

130 140 150 160 170 180 

190 200 210 220 230 

m22 6.pep MSLGTASHAMGIAASLERSRRMAAYAGLGLTFNGVLTALIAPLLIPVLGFX 
I ! I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I 
a226 MSLGTASH7^MGIAASLERSRRMAAYAGLGLTFNGVLTALIAPLLIPVLGFX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 81 1>: 

g227 . seq 

1 atgaacatca tccgcgcgct cctcatcatc ctcggctgcc tcgccgccgg 

51 cgaaaccgcc gttttcctag caggcatcaa actgcccggc agcatcgtcg 

101 gcatgggcgt gctgtttgcg cttttgcagg cgggttggct caaaacgtct 

151 tggctgcaac agcttaccga cgcgctgatg gcaaacctga cgctgttcct 

201 cgtgccgccc tgcgtggcgg tcatcagcta tttggatttg attgccgacg 

251 attggttttc gatactggtt tccgcctccg ccagcacttt gtgcgtactg 

301 ctggttacgg gcaaggttca ccgctggata cggagcatta tctga 

This corresponds to the amino acid sequence <SEQ ID 812; ORF 227.ng>: 

g227.pep 

1 MNIIRALLII LGCLAAGETA VFLAGIKLPG SIVGMGVLFA LLQAGWLKT S 

51 WLQQLTD ALM ANLTLFLVPP CVAVI SYLDL IADDW FSILV SASASTLCVL 

101 LVTGKVHRWI RSI I* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 813>: 

m227.seq (partial) 

1 ..ACGTCTTkGC TGCAACAGCT TACCGACGCG CTGATGTCGA ACCTGACGCT 

51 GTtCCTCGTG CCgCC.TGCG TGGCGGTCAT CAGCTATTTG GATTTGATTG 

101 CCGACGATTG GTTTTCGATA CTGGTTTCCG CCTCCGCCAG cACTTTGTGC 

151 GTACTGCTGG TTACGGGCAA AGTCCACCGG TGGATACGGG GTATTATCCG 

201 ATGA 

This corresponds to the amino acid sequence <SEQ ID 814; ORF 227>: 

m227.pep (partial) 

1 . . TSXLQQLT DA LMSNLTLFLV PPCVAVI SYL DLIADDW FSI LVSASASTLC 
51 VLLV TGKVHR WIRGIIR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 227 shows 95.5% identity over a 66 aa overlap with a predicted ORF (ORF 227.ng) 
from N. gonorrhoeae: 

m227/g227 

10 20 30 

m227 . pep TSXLQQLTDALMSNLTLFLVPPCVAVISYL 

II I I I I I I I I I : I I I II I II I I I I I I I I I 
g227 TAVFLAGIKLPGSIVGMGVLFALLQAGWLKTSWLQQLTDALMANLTLFLVPPCVAVISYL 
20 30 40 50 60 70 

40 50 60 
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m227 . pep DLI ADDWFS I LVSASASTLCVLLVTGKVHRWI RGI I RX 

I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I : I I I 
g2 2 7 DLI ADDWFS I LVSASASTLCVLLVTGKVHRWIRSI I X 

80 90 100 110 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 815>: 

a227.seq 

1 ATGAACATCA TCCGCGCGCT CCTCATCATC CTCGGCTGCC TCGCCACCGG 

51 CGAAACCGCC GTTTTCCTAG CAGGCATCAA ACTGCCCGGC AGCATCGTCG 

101 GCATGGGCGT ACTGTTTGCG CTTTTGCAGG CGGGTTGGGT CAAAACGTCT 

151 TGGCTGCAAC AGCTTACCGA CGCGCTGATG GCGAATCTGA CGTTGTTTCT 

20.1 CGTGCCGCCC TGCGTGGCGG TCATCAGCTA TTTGGATTTG ATTGCCGACG 

251 ATTGGTTTTC GATACTGGTT TCCGCCTCCG CCAGCACTTT GTGCGTACTG 

301 CTGGTTACAG GCAAGGTTCA CCGCTGGATA CGGAGCATTA TCTGA 

This corresponds to the amino acid sequence <SEQ ID 816; ORF 227.a>: 

a227.pep 

1 MNIIRALLII LGCLATGETA VFLAGIKLPG SIVGMGVLFA LLQAGWVKT S 

51 WLQQLTD ALM ANLTLFLVPP CVAVI SYLDL 1ADDW FS1LV SASA5TLCVL 

101 LVTGKVHRWI RSII* 

m227/a227 95.5% identity in 66 aa overlap 

10 20 30 

m227 . pep TSXLQQLTDALMSNLTLFLVPPCVAVISYL 

II I I i I I I 1 I I : I I I I I I I I I I I I I I I I I 
a227 TAVFLAGIKLPGSIVGMGVLFALLQAGWVKTSWLQQLTDALMANLTLFLVPPCVAVISYL 
20 30 40 50 60 70 

40 50 60 

m227 .pep DLIADDWFSILVSASASTLCVLLVTGKVHRWIRGIIRX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I 
a 2 2 7 DLTADDW FS I LVS AS ASTLCVLLVTGKVHRWI RSI IX 

80 90 100 110 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 817>: 

m228 . seq 

1 ATGAAAAAAT TATTGATTGC CGCAATGATG GCGGCTGCCT TGGCAGCTTG 

51 TTCGCAAGAA GCCAAACAGG AGGTTAAGGA AGCGGTTCAA GCCGTTGAGT 

101 CCGATGTTAA AGACACTGCG GCTTCTGCCG CCGAGTCTGC CGCTTC7TGCC 

151 GTCGAAGAAG CGAAAGACCA AGTCAAAGAT GCTGCGGCTG ATGCAAAGGC 

201 AAGTGCCGAG GAAGCTGTAA CTGAAGCCAA AGAAGCTGTA ACTGAAGCAG 

251 CTAAAGATAC TTTGAACAAA GCTGCCGACG CGACTCAGGA AGCGGCAGAC 

301 AAAATGAAAG ATGCCGCCAA ATAA 

This corresponds to the amino acid sequence <SEQ ID 818; ORF 228>: 

m228.pep 

1 MKKLLIAAMM AAALAACSQS AKQEVKEAVQ AVE SDVKDT A ASAAESAASA 
51 VEEAKDQVKD AAADAKASAE EAVTEAKEAV TEAAKDTLNK AADATQEAAD 
101 KMKDAAK* 

Computer analysis of this amino acid sequence gave the following results: 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 819>: 

a228.seq 

1 ATGAAAAAAT TATTGATTGC CGCAATGATG GCGGCTGCCT TGGCAGCTTG 

51 TTCGCAAGAA GCCAAACAGG AGGTTAAGGA AGCGGTTCAA GCCGTTGAGT 

101 CCGATGTTAA AGACACTGCG GCTTCTGCCG CCGAGTCTGC CGCTTCTGCC 

151 GTCGAAGAAG CGAAAGACCA AGTCAAAGAT GCTGCGGCTG ATGCAAAGGC 

201 AAGTGCCGAG GAAGCTGTAA CTGAAGCCAA AGAAGCTGTA ACTGAAGCAG 

251 CTAAAGATAC TTTGAACAAA GCTGCCGACG CGACTCAGGA AGCGGCAGAC 

301 AAAATGAAAG ATCCCGCCAA ATAA 

This corresponds to the amino acid sequence <SEQ ID 820; ORF 228.a>: 
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a228.pep 

1 MKKI.LIAAMM AAALAA CSQE AKQEVKEAVQ AVESDVKDTA ASAAESAASA 

51 VEEAKDQVKD AAADAKASAE EAVTEAKEAV TEAAKDTLNK AADATQEAAD 

101 KMKDAAK* 

m228/a228 100.0% identity in 107 aa overlap 

10 20 30 40 50 60 

m228 . pep MKK LL I AAMMAAALAAC S QE AKQEVKEAVQ AV E S D VK DT AAS AAE S AAS AVE EAK DQVK D 

II t I I I 1 1 I 1 1 1 1 I I I I 1 1 1 1 I 1 1 I M I 1 1 1 1 1 1 I I II I I I 1 1 I I I I I I ! I I I I I 1 1 I 1 1 
a228 MKKLLI AAMMAAALAAC SQE AKQE VKEAVQAVE S DVKDTAAS AAE S AAS AVEEAK DQVK D 

10 2C 30 40 50 60 

70 80 90 100 

m2 2 8 . pep AAADAKASAEEAVTEAKEAVTEAAKDTLNKAADATQEAADKMKDAAKX 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I ! I I I I 
a2 2 8 AAADAKASAEEAVTEAKEAVTEAAKDTLNKAADATQEAADKMKDAAKX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 82 1>: 

g229 . seq 

l atggctgccg tatcgggcgg cggtgcggtc ttcctgataa tgcttccaca 

51 tattgcccgc gttcagcgtc agccgccagc gttcgcccaa gcgtcgggag 

101 aaatcggcat tgaagccgcc ggcgaaattg tatcggctgc cgcccaagag 

151 gttttgcccg acaaacggca cggtgccgaa cgagcgcgtt accgaacggt 

201 tttgatggcc gaacgacagg cgcaggttct gttcgctgaa atctttgtta 

251 tcccaataat gcacgccgcg gctgatgccg ccgtagagga aatgatgccc 

301 gcccgcattg atttcgcgcg acacgcccaa gccgtagcgc aaaccgtgtg 

351 ccttttgcgg caggctgtcg gcggttttcg tccagcttct gcccgcaaat 

4 01 tcaatcgttt tttcggacga agcgttgttt atagcggatt aacaaaaatc 

4 51 aggacaaggc ggcgggccgc aggcagtacg gatggtacgg aaccggttcg 

501 cccggtgctt ggacgcctta gggaaccgtt ccctttgagc cggggcgggg 

551 caacccgtac cggtttttgt tcatccgcca tattgtgttg a 
This corresponds to the amino acid sequence <SEQ ID 822; ORF 229.ng>: 

g229 -pep 

1 MAAVSGGGAV FLIMLPHIAR VQRQPPAFAQ ASGEIGIEAA GEIVSAAAQ E 

51 VLPDKRHGAE RARYRTVLMA SR QAQVLFAE IFVIPIMHAA ADAAVEEMMP 

101 ARIDFARHAQ AVAQTVCLLR QAVGGFRPAS ARKFNRFFGR SWYSGLTKI 

151 RTRRRAAGST DGTEPVRPVL GRLREPFPLS RGGATRTGFC SSAILC* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 823>: 

m229.seq (partial) 

1 . . GCTCAAGCGT TGGGAGAAAT CGGCATTGAA GCCGCCGACG AAATTGTATC 

51 GGCTGCCGCC TAAGAGGTTT TGCTCGACAA ACGGCACGAT GCCGAACGAG 

101 CGCGTTACCG AACGGTTTTT ATAGCCGAAC GACAGGCGCA GGCTCTGTTC 

151 GCTGAAATCT TTGTTATCCC AATAATG C AC GCCGCCGCCG CTGATGCCGC 

201 CGTAGAGGAA ATGATGCCTG CCCGCATTGA TTTCGCGCGA CACGCCTAAG 

251 CCCTAGCGCA AACCGTGTGC CTTTTGCGGC AGGCTGTCGG CGGTTTTCGT 

301 CCAGCTTCTG CCCGCAAATT CAATCGTTTT TTCGGACGAA GCGTTGTTTA 

351 TAGCGGATTA ACAAAAATCA GGACAAGGCA ACGAAGCCGC AGACAGTACA 

4 01 AATAGTACGG AACCGATTCA CTTGGTGCTT CAGCACcTTA GAGAATCGTT 

451 CTCTTTTTTG TTCATCCGCT ATATTGTGTT GA 

This corresponds to the amino acid sequence <SEQ ID 824; ORF 229>: 

m229.pep (partial) 

1 . . AQALGEIGIE AADEIVSAAA XEVLLDKRHD AERARYRTVF IAER QAQALF 

51 AEIFVIPIMH AAA ADAAVEE MMPARIDFAR HAXALAQTVC LLRQAVGGFR 

101 PASARKFNRF FGRSWYSGL TKIRTRQRSA DSTNSTEPIK LVLQHLRESR 

151 SLFCSSAILC * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 229 shows 80.5% identity over a 169 aa overlap with a predicted ORF (ORF 229.ng) 
from N. gonorrhoeae: 

m229/g209 

10 20 30 

m2 2 9 . pep AQALG E I G IEAADE I VS AAAXEVLLDKRHDAE 

III llllllll lllllll III I I I I II 
g2 2 9 MAAVSGGGAVFLIMLPHI ARVQRQPPAFAQASGEIGIEAAGEI VSAAAQEVLPDKRHGAE 

• 10 20 30 40 50 60 

40 50 60 70 80 90 

m2 2 9 . pep RAR YRTVF I AERQ AQ ALFAB I FV I PIMHAAAADAAVEEMMPARIDFARHAXALAQTVCLL 

I I 1 1 1 1 1 : : 1 1 1 II I • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 hlllllll 

g229 RAR YRTVLMAERQAQVLFAE I FVI P IMHAAA - DAAVEEMMPARI DFARHAQAVAQTVCLL 

70 8G 90 100 110 

100 110 120 130 140 
m2 2 9 . pep RQAVGGFRPASARKFNRFFGRS WYSGLTKI RTRQRSADSTNSTEP IHLVLQHLRE 

llllllllllimillllllllllllllllllhhl ||>i|||>< II :||| 
g2 2 9 RQ AVGG FRP AS AR KFNRFFGRS W YSGLT K I RTRRRAAG STDGTE PVRP YLGRLRE PFPL 

120 130 140 150 160 170 

150 160 
m229.pep SRSLFCSS AILCX 

:|= llllllll 
g2 2 9 SRGGATRTGFCSSAILC 

180 190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 825>: 

a229.seq (partial) 

1 ATGGCTGTCG TATCGGGCGG CGGTGCGGTC TTCCTGATAA CGCTTCCACA 

51 TATTGCCCAC GTTCAGCGTC AGCCGCCA. . GTTCGCTCAA GCGTCGGGAG 

101 AAATCGGCAT TGAAGCCGCC GACGAAATTG TATCGGCTGC CGCCTAAGAG 

151 GTTTTGCTCG ATAAACGGCA CGATGCCGAA TGAGCGCGTT ACTGAACGGT 

201 TTTTATAGCC GAGCGACAGG CGCAGGCTCT GTTCGCTGAA ATCTTTGTTA 

251 TCCTAATAGT GCACGCCGCC GCCGCTGATG TCTCCGTAGA GGAAATGATG 

301 CCCGCCCGCA TTGATTTCGC GCGACACGCC CAAGCCGTAG CGCAAACCGT 

351 GTGCCTTTTG CGGCAGGCTG TCGGCGGTTT TCGTCCAGCT TCTGCCTGCA 

4 01 AATTCAATCG TTTTTTCGGA CGAAGCGTTG TTTATAGCGG ATTAACAAAA 

4 51 ATCAGGACAA GGCGACGAAG CGCAGACAGT ACAGATAGTA CGGAACCGAT 

501 TCACTTGGTG CTTCAGCACC TTAGAGAATC GTCTCTTTGA GCTAAGGCGA 

551 GGCAACGCCG TACTGGTTTT TGTTCATCCA CTATA 

This corresponds to the amino acid sequence <SEQ ID 826; ORF 229.a>: 

a229.pep (partial) 

1 MAWSGGGAV FLITLPHIAH VQRQPPXFAQ ASGEIGIEAA DEIVSAAA *E 

51 VLLDKRHDAE *ARY*TVFIA ERQ AQALFAE IFVILIVHAA AADVSVEEMM 

101 PARIDFARHA QAVAQTVCLL RQAVGGFRPA SACKFNRFFG RSWYSGLTK 

151 IRTRRRSADS TDSTEPIHLV LQHLRESSL* AKARQRRTGF CSSTI 

m229/a229 85.6% identity in 1 67 aa overlap 

10 20 30 

m229 .pep AQALGEIG IEAADE I VS AAAXEVLLDKRHDAE 

III I I II I I I I I I I I I I I I I I I I I I I I I I I I 
a229 MAWSGGGAVFLITLPHIAHVQRQPPXFAQASGEIGIEAADEIVSAAAXEVLLDKRHDAE 
10 20 30 40 50 60 

40 50 60 70 80 90 

m229 .pep RARYRTVFIAERQAQALFAEXFVIPIMHAAAADAAVEEMMPARIDFARHAXALAQTVCLL 
Ml I II II Ml E I I II I I I I II I : I I I I I I :: II I M I I II I II I II hlllllll 
a2 29 XAR YXT V FI AERQAQAL FAE I FV I L I VHAAAAD VS VEE MM PAR I D FARHAQ AVAQT VC LL 

10 80 90 100 110 120 
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100 110 120 130 140 149 

m229.pep RQAVGGFRPASARKFNRFFGRSWYSGLTKIRTRQRSADSTNSTEPIHLVLQHLRES 

I I I I I I I I I I I I I I I II M I I I I I I II I I I I I I : I I I I I I : I I I I I I I I I I I I I I I 
a229 RQAVGGFRPASACKFNRFFGRSWYSGLTKIRTRRRSADSTDSTEPIHLVLQHLRESSLX 
130 140 150 160 170 180 

150 160 

m229 .pep RSLFCSSAILCX 

1:. 1111:1 

a229 AKARQRRTGFCSSTI 
190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 827>: 
g230.seq 

1 atgttccatt ccatcgaaaa 'atacagaaca cccgcccaag tcttattagg 

51 cctgattgca ttaacttttg tcggcttcgg cgtcagcacg gtttcccatc 

101 cgggcgccga ctacatcgtc caagtgggcg acgaaaaaat cagcgagcac 

151 tcaatcaaca acgccatgca gaacgagcag gcggacggcg gcagcccttg 

201 gcgcgacgcg gtgttccaat ccctgctgca acgcgcctac ctgaaacagg 

251 gcgcgaagct gatgggcatt tcggtttctt ccgaacaaat caagcagatg 

301 attgtggacg atcccaattt ccacgacgca aacggcaaat tcagtcacgc 

3 51 gcttttgagt caatacctgt cgcaacgcca tatgtctgaa gaccagtttg 

401 ccgaagaaat ccgcgatcag tttgccttgc agaatttggt aagccccgtc 

451 caaaacggcg tattggtcgg cgacgcgcag gcggaacagc tgatcaggct 

5 01 gacgcaggtc aaccgcacca tccgttcgca cactttcaac cccgacgagt 

551 tcatcgccca agtcaaagcg tctgaagccg atttgcagaa attttataat 

601 gcgaacaaaa aagactatct gctgccgcag gcggtcaaat tggaatatgt 

651 cgccttgaat ctgaaggatt ttgcagacaa gcagaccgtc agtgaaacgg 

701 aagtgaaaaa tgcgtttgaa gagcgcgtgg cgcgtttgcc ggcacatgaa 

751 gccaaacctt ctttcgagca ggaaaaagcc gccgtcgaaa acgaactgaa 

801 aatgaaaaag gcggttgccg acttcaacaa ggcaaaagaa aagctgggcg 

851 acgatgcgtt caatcatccc tcctcgcttg ccgaagccgc caaaaacagc 

901 ggtttgaaag tggaaaccca agaaacttgg ctgagcaggc aggacgcaca 

951 aatgtccggc atgcccgaaa acctaatcaa tgccgtattc agcgacgacg 

1001 tattgaagaa aaaacacaat tccgaagtgc tgaccatcaa cagcgaaacc 

1051 gcgtgggtcg tccgcgccaa agaagtccgc gaagaaaaaa acctactgtt 

1101 tgaagaagcc aaagatgcgg tgcgtcaggc ctatatccgt accgaagccg 

1151 ccaaactttt gaaaacaatg taa 

This corresponds to the amino acid sequence <SEQ ID 828; ORF 230.ng>: 

g230.pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISEH 

51 SINNAMQNEQ ADGGSPWRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQM 

101 IVDDPNFHDA NGKFSHALLS QYLSQRHMSE DQFVEEIRDQ FALQNLVSLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKA SEADLQKFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPAHE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWWRAKEVR EEXNLLFEEA KDAVRQAYIR TEAAKLLKTM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 829>: 

m230.seq (partial) 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAaT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

201 GCc.GACGCG GTGTTCCAAT CCCTGCTGCA ACGCGCCTAC CTGAAACAGG 

251 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAAATT 

301 ATCGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCGACCACGC 

351 GCTTTTAAAC CGCTACCTTT CCCAACGCCA TATGTCTGAA GACCAGTTTG 

401 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAACCTCGTC 
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451 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACACAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAGT 

551 TCATCGCCCA AGTCAAAGTG TCTGAAGCCG ATTTGCAGAA ATTTTATAAT 

601 GCGAACAAAA AAGACTATCT GCTGCCGCAG gCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAGGATT TTGCAGACAA GCAGACCGTC AGTGAAACGg 

701 AAGTGAAAAA TGCATTTGAA GAGCGCGTGG CGCGTTTGCC GGCAAATGAA 

751 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAACAA GGCAAAAGAA AAATTGGGCG 

851 ACGATGC.GT cAACCATCCT TCyTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TCGAAACCCA AGAAACTTGG CTGAGTAGGC AGGACGCGCA 

951 AATGTCCGGT ATGCCCGAAA ACCTGATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAGAAAA CCCTGCCGTT 

1101 TGCCGAAGCC AAAGACGCGG TACGTCAGGC TTATATCCGT ACCGAAGCCG 

1151 CCAAACTT 

This corresponds to the amino acid sequence <SEQ ID 830; ORF 230>: 

m230.pep (partial) 

X MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSPDA VFQSLLORAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADL0KFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKI4KK AVADFNKAKE KLGDDAVNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWWRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKL . . ; . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 230 shows 95.9% identity over a 386 aa overlap with a predicted ORF (ORF 230ng) 
from N. gonorrhoeae: 

m230/g230 

10 20 30 40 50 60 

m23 0 . pep MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKI SDHSINNAIQNEQ 

1 1 1 II II II I III II I III II 1 1 1 1 1 1 1 ! Ml II Ml M I M I II IIMI MM M I II 

g23 0 MFHSIEKYRTPAOVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISEHSINNAMQNEQ 
10 20 30 40 50 60 

70 80 90 100 110 120 ' 

m23 0 - pep ADGGGPS PDAVFQSLLQRAYLKQGAKLMGI SVSSEQI KQI I VDD PN FHDANGK FDHALLN 

M MM II III III IMMMI III Mill IM MIIMMMMM MIMMMh 

g23 0 ADGGS PWRDAVFQSLLQRAYLKQGAKLMGI SVS S EQI KQMIVDDPNFHDANGKFSHALLS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m23 0 . pep RYLSQRHMSEDQFVEEIRDQFALQNLVNLVONGVLVGDAQAEQLIRLTQVNRTIRSHTFN 

MIMIMMMIIMIMMMIIIhMIIIMIIIIIIIIIIIIMIIIIIIIIIM 

g23 0 Q YLS QRHMS EDQFVEE I RDQFALQNLVSLVQNGVLVGDAQAEQL I RLTQVNRTI RSHTFN 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 3 0 . pep PDEFI AQVKVS EADLQKFYNANKKDYLLPQAVKLEYVALNLKD FAD KQTVS ETE VKNAFE 

II MMMIMM III IMMMI Ml II III III III IMMMI II MM III I II II 

g23 0 PDEF I AQVKAS EADLQKFYNANKKD YLLPQ AVKLEYVALNLKD FAD KQTVS ETEVKNAFE 

190 200 210 220 230 240 

250 260 270 280 290 300 

m23 0 . pep ERVARl^ANEAKPSFEQEKAAVENELKMKKAVADFNKAK^ 

i 1 i 1 1 1 1 1 = 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 } 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 IIMMMIMM 

g23 0 ERVAR LP AHE AK P S F EQE KAAVENELKMKKAVAD FNKAKE KLGDDAFNH P S S LAE AAKNS 

250 260 270 280 290 300 
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310 320 330 340 350 360 

m2 3 0 . pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKiCKHNSEVLTINSETAWVVRAKEVR 

II I II MM M II II II llll III Ml MM II 11 II! Ill III II Mill Mil III II 

g23 0 GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 
310 320 330 340 350 360 

370 380 
m23 0 . pep EEKTLPFAEAKDAVRQAYI RTEAAKL 

I M - I I I I I I I I I I I I I I I I I I M 
g230 EEKNLLFEEAKDAVRQAY I RTEAAKLLKTM 

370 380 390 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 83 1>: 

a230.seq (partial) 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAAT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

201 GCGCGACGCG GTGTTCCAAT CCCTGCTACA ACGCGCCTAC CTGAAACAGG 

251 GCGCGAAGCT GATG3GCATT TCGGTTTCTT CCGAACAAAT CAAGCAGATT 

301 ATCGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCGACCACGC 

351 GCTTTTAAAC CGCTACCTTT CCCAACGTCA TATGTCTGAA GACCAGTTTG 

4 01 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAACCTCGTC 

4 51 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACGCAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAAT 

551 TCATCGCCCA AGTCAAAGTG TCTGAAGCCG ATTTGCAGAA GTTTTATAAC 

601 GCAAACAAAA AAGACTACCT GCTTCCCAAA GCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAAGACT TTGCAGACAA ACAGACCGTC AGCGAAACAG 

7 01 AAGTGAAAAA TGCGTTTGAA GAGCGCGTGG CGCGTTTGCC GGCAAATGAA 

751 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAATAA GGCAAAAGAA AAGCTGGGCG 

851 ATGACGCGTT CAACCATCCT TCCTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TCGAAACCCA AGAAACTTGG CTGAGCAGGC AGGATGCGCA 

951 AATGTCCGGT ATGCCCGAAA ACCTGATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAGAAAA CCCTGCCGTT 

1101 TGCCGAAGCC AAAGACGCGG TACGTCAGGC TTATATCCGT ACCGAAGCCG 

1151 CCAAACTT 

This corresponds to the amino acid sequence <SEQ ID 832; ORF 230.a>: 

a230.pep (partial) 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPK AVKL2YVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWWRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKL 

m230/a230 99.2% identity in 386 aa overlap 

10 20 30 40 50 60 

m230.pep MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHP3ADYIVQVGDEKISDHSINNAIQNEQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I 
a230 MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHP3ADYIVQVGDEKISDHSI^5NAIQNEQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

m230 . pep ADGGGPSPDAVFQSLLQRAYLKQGAKLMGISVSSEQIKQIIVDDPNFHDANGKFDHALLN 
I I I I I I I I I I I I I I I I I I I | I I I I | | | | I | | | | | I I I || I I I I I | I | | 1 | | I I | | | I | | 
a230 ADGGGPSRDAVFQSLLQRAYLKQGAKLMGISVSSEQIKCIIVDDPNFHDANGKFDHALLN 

70 80 90 100 110 120 

130 140 150 160 170 180 
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m230 . pep RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 
1 1 I 1 1 I 1 1 I 1 1 I I I 1 1 1 1 1 I I 1 1 I 1 1 I 1 1 I I I I I I 1 1 I M I M I 1 1 I I I I M I II 1 1 I I 
a230 RYLSQRHMSEDQEVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 
130 140 150 160 170 180 

190 200 210 220 230 240 

m230 . pep PDEFIAQVKVSEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 
I I I I I I I I I II I I I I II I I I I I I I Ml I I: I I I I I I I I I I I I I I I I I I I I II II I I I II I 
a230 PDEFIAQVKVSEADLQKFYNANKKDYLLPKAVKLEYVALNLKDFADKQTVSETEVKNAFE 

190 200 210 220 230 240 

250 260 270 2B0 290 300 

m2 30 . pep ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAVNHPSSLAEAAKNS 
I II I I I I I I I I I I M I I I I I II I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I 
a230 ERVARLPANEAKPSFEQEKAAVENELKhJKKAVADFNKAKEKLGDDAFNHPSSLAEAAKNS 

250 260 270 230 290 300 

310 320 330 340 350 360 

m230 . pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWWRAKEVR 
I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I t I I I I I I I I I II I I I I I I I I I I II I I I 
a230 GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWWRAKEVR 

310 320 330 340 350 360 

370 380 
m2 30 . pep E E KT L P FAE AKD AVRQAY I RTE AAK L 

I I I II I I II I I I I I I I I I I I I I I I I I 
a230 EEKTLP FAE AKD AVRQAY I RTE7VAKL 

370 380 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 833>: 

g230-l.oeq 

1 ATGTTCCATT CCATCGAAAA ATACAGAACA CCCGCCCAAG TCTTATTAGG 

51 CCTGATTGCA TTAACTTTTG TCGGCTTCGG CGTCAGCACG GTTTCCCATC 

101 CGGGCGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAAT CAGCGAGCAC 

151 TCAATCAACA ACGCCATGCA GAACGAGCAG GCGGACGGCG GCAGCCCTTG 

201 GCGCGACGCG GTGTTCCAAT CCCTGCTGCA ACGCGCCTAC CTGAAACAGG 

251 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAGATG 

301 ATTGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCAGTCACGC 

351 GCTTTTGAGT CAATACCTGT CGCAACGCCA TATGTCTGAA GACCAGTTTG 

401 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAGCCTCGTC 

451 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACGCAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAGT 

551 TCATCGCCCA AGTCAAAGCG TCTGAAGCCG ATTTGCAGAA ATTTTATAAT 

601 GCGAACAAAA AAGACTATCT GCTGCCGCAG GCGGTCAAAT TGGAATATG7 

651 CGCCTTGAAT CTGAAGGATT TTGCAGACAA GCAGACCGTC AGTGAAACGG 

701 AAGTGAAAAA TGCGTTTGAA GAGCGCGTGG CGCGTTTGCC GGCACATGAA 

751 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAACAA GGCAAAAGAA AAGCTGGGCG 

851 ACGATGCGTT CAATCATCCC TCCTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TGGAAACCCA AGAAACTTGG CTGAGCAGGC AGGACGCACA 

951 AATGTCCGGC ATGCCCGAAA ACCTAATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAAAAAA ACCTACTGT7 

1101 TGAAGAAGCC AAAGATGCGG TGCGTCAGGC CTATATCCGT ACCGAAGCCG 

1151 CCAAACTTGC CGAAAACAAG GCAAAAGAAG TGCTTACCCA ACTGAACGGC 

1201 GGCAAGGCAG TTGACGTGAA ATGGTCGGAA GTGTCCGTTT TGGGCGCGCA 

1251 GCAGG CAAGG CAGTCCATGC CGCCCGAGGC TTATGCGGAA CTGCTGAAAG 

1301 CAAAACCGGC AAACGGCAAA CCCGCCTATG TCAGACTGAC CGGTCTGCCG 

1351 GCACCCGTGA TTGTCGAGGC GCAGGCAGTC ACGCCTCCGG AGGATATTGC 

1401 CGCACAGCTT CCTCCTGCGA AACAGGCTTT GGCGCAACAG CAGTCTGCCA 

1451 ATACTTTCGA CCTGCTGATC CGCTATTTCA ACGGAAAAAT CAAACAGACT 

1501 AAAGGAGCAC AATCGGT7GA CAACGGCGAT GGTCAGTAA 

This corresponds to the amino acid sequence <SEQ ID 834; ORF 230-l.ng>: 

9230-1. pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV OVGDEKISEK 

51 SINNAMQNEQ ADGGS PWRDA VFQSLLQRAY LKQGAKLMGT SVSSEQIKQN 

101 IVDDPNFHDA NGKFS HALLS QYLSQRHMSE DQFVEE1RDQ FALQHLVSLV 
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151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKA SEADLQKFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPAHE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWWRAKEVR EEKNLLFEEA KDAVRQAYIR TEAAKLAENK AKEVLTQLNG 

401 GKAVDVKWSE VSVLGAOQAR OSMPPEAYAE LLKAKPANGK PAYVRLTGLP 

451 APVIVEAQAV TPPEDIAAQL PPAKQALAQQ QSANTFDLLI RYFNGKIKQT 

501 KGAQSVDNGD GQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 835>: 

m230-l.seq 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAAT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

201 GCGCGACGCG GTGTTCCAAT CCCTGCTGCA ACGCGCCTAC CTGAAACAGG 

251 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAAATT 

301 ATCGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCGACCACGC 

351 GCTTTTAAAC CGCTACCTTT CCCAACGCCA TATGTCTGAA GACCAGTTTG 

401 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAACCTCGTC 

451 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACACAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAGT 

551 TCATCGCCCA AGTCAAAGTG TCTGAAGCCG ATTTGCAGAA ATTTTATAAT 

601 GCGAACAAAA AAGACTATCT GCTGCCGCAG GCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAGGATT TTGCAGACAA GCAGACCGTC AGTGAAACGG 

701 AAGTGAAAAA TGCATTTGAA GAGCGCGTGG CGCGTTTGCC GGCAAATGAA 

751 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAACAA GGCAAAAGAA AAAT?GGGCG 

851 ACGATGCGTT CAACCATCCT TCCTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TCGAAACCCA AGAAACTTGG CTGAGTAGGC AGGACGCGCA 

951 AATGTCCGGT ATGCCCGAAA ACCTGATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAGAAAA CCCTGCCGTT 

1101 TGCCGAAGCC AAAGACGCGG TACGTCAGGC TTATATCCGT ACCGAAGCCG 

1151 CCAAACTTGC CGAAAACAAG GCAAAAGACG TGCTTACCCA ACTGAACGGC 

1201 GGCAAGGCTG TTGACGTGAA ATGGTCGGAA GTGTCCGTTT TGGGCGCACA 

1251 GCAGGCAAGG CAGTCCATGC CGCCCGAGGC TTATGCGGAA CTGCTGAAAG 

1301 CAAAACCGGC AAACGGCAAA CCCGCCTACG TCAGGCTGAT CGGTCTGCCG 

1351 GCACCCGTGA TTGTCGAAGT ACAGGCTGTA ACCCCGCCGG ATGATATCGC 

1401 CGCACAGCTT CCGCTTGCAA AACAGGCTTT GGCGCAACAG CAGTCTGCCA 

1451 ATACTTTCGA CTTGTTGATA CGTTATTTCA ACGGCAAAAT CAAACAGACC 

1501 AAAGGAGCGC AATCGGTCGA CAACGGCGAC GGTCAGTAA 

This corresponds to the amino acid sequence <SEQ ID 836; ORF 230-l>: 

m230-l.pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWWRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKLAENK AKDVLTQLNG 

401 GKAVDVKWSE VSVLGAQQAR QSMPPEAYAE LLKAKPANGK PAYVRLIGLP 

451 APVIVEVQAV TPPDDIAAQL PLAKQALAQQ QSANTFDLLI RYFNGKIKQT 

501 KGAQSVDNGD GQ* 

m230-l/g230-l 96.3% identity in 512 aa overlap 



10 20 30 40 50 60 

MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISDHSINNAIQNEQ 

I I I I I I I I II I I I I t I I I I I I IHM1 III Mill Mil II j:||| |||:||| I 

MFHSIEKYRT P AQVLLGL I ALT FVG FG VS T VS H PGA DY 1 V QVGDEK I S EH S I NN AMQNEQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

A DGGG PS RDA V FQS L LQRA Y LKQG AKLMG ISVSSEQIKQII VDD PN FH DANGK FDHALLN 
IMI:l llllflllflllMlilllllillllilllll:||||lliittlltl:|M|: 
ADGGSPWRDAVFQSLLQRAYLKQGAKLMGISVSSEQIKQMIVDDPNFHDANGKFSHALLS 

70 80 90 100 110 120 

130 140 150 160 170 180 

RYLSQRHMSEDQFVEEI RDQ FALQN L V NLVQNGVLVGD AQAEQL I RLTQVNRTI RSHTFN 



it)230-l .pep 
g230-l 

m230-I.pep 
g230-l 

m230-l.pep 
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: I I 1 1 1 1 1 I II I I I I I 1 1 1 II I I I 1 1 I : I 1 1 I! M l ! I 1 1 t 1 1 I i I I I I II I I I 

g23 0- 1 QYLSQRHMSEDQFVEEI RDQFALQNLVSLVQNGVLVGDAQAEQLI RLTQVNRTI RSHTFN 

130 140 150 160 170 180 

190 200 210 220 230 240 

m230- 1 . pep PDEFIAQVKVSEADLOKFYNANKKDYLLPQAVKLEYVALNLKDFACKQTVSETEVKNAFE 

I I I I I I I I I : I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 1 I I I I I I I I I I I I [ I I I I I I 
g230-l PDEFIAQVKASEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 

190 200 210 220 230 240 

250 260 270 280 290 300 

m230-l . pep E RV ARLP ANEAK P S FEQEKAA VENELKMKKA V AD FN KAKE K LGD DAFN HPS S LAEAAKN S 

I I I 1 1 1 1 1 s 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

g230- 1 ERVARLPAHEAKPS FEQEKAA VENELKMKKA VADFNKAKEKLGDDAFNHPSSLAEAAKN3 

250 260 270 280 290 300 

310 320 330 340 350 360 

m230-l . pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWWRAKEVR 
i 1 1 t I I 1 1 1 1 1 1 1 1 i I 1 1 I I 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 f I I I t I 1 1 ! I I I 1 1 1 I 
g230-l GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVRAKEVR 
310 320 330 340 350 360 

370 380 390 400 410 420 

m230- 1 . pep E EKTL P FAEAKDAVRQA Y I RTEAAKLAENKAKDVLTQLNGGKAVD VKWS EVSVLGAQQAR 
111:1 I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I M I I I I I I I I I I I I I I I I I I I I I 
g230-l EEKRLLFEEAKDAVRQAYIRTEAAKLAENKAKEVLTOLNGGKAVDVKWSEVSVLGAOQAR 

370 380 390 400 410 420 

430 440 450 460 470 480 

IU230-1 . pep QSMPPEAYAELLKAKPANGKPAYVRLIGLPAPVIVEVQAVTPPDDIAAQLPLAKQALAQQ 
I I I I I I I I 1 I I I I It I I I I U I I I II illll|lil:|||tl|:|lil||| 1 I I I I I I | 
g230- 1 QSMPPEAYAELLRAKPANGKPAYVRLTGLPAPVIVEAQAVTPPEDIAAQLPPAKQALAQQ 

430 440 450 460 470 480 

490 500 510 

m230-l .pep QSANTFDLLI RYFNGKI KQTKGAQSVDNGDGQX 
I II I I I II II I I I II i I I I I I I I ill M i I II I 
g230-l QSANTFDLLI RYFNGKI KQTKGAQSVDNGDGQX 

490 500 510 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 837>: 

a230-l . eeq 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAAT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

201 GCGCGACGCG GTGTTCCAAT CCCTGCTACA ACGCGCCTAC CTGAAACAGG 

251 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAGATT 

301 ATCGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCGACCACGC 

351 GCTTTTAAAC CGCTACCTTT CCCAACGTCA TATGTCTGAA GACCAGTTTG 

401 TCGAAGAAAT CCGCGATCAG TT7GCCTTGC AGAATTTGGT AAACCTCGTC 

451 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACGCAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAAT 

551 TCATCGCCCA AGTCAAAGTG TC?GAAGCCG ATTTGCAGAA GTTTTATAAC 

601 GCAAACAAAA AAGACTACCT GCTTCCCAAA GCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAAGACT TTGCAGACAA ACAGACCGTC AGCGAAACAC 

701 AAGTGAAAAA TGCGTTTGAA GAGCGCGTGG CGCGTTTGCC GGCAAATGAA 

751 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

801 AATGAAAAAG GCGGTTGCCG ACTTCAATAA GGCAAAAGAA AAGCT3GGCG 

851 ATGACGCGTT CAACCATCCT TCCTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TCGAAACCCA AGAAACTTGG CTGAGCAGGC AGGATGCGCA 

951 AATGTCCGGT ATGCCCGAAA ACCTGATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAGAAAA CCCTGCCGTT 

1101 TGCCGAAGCC AAAGACGCGG TACGTCAGGC TTATATCCGT ACCGAAGCCG 

1151 CCAAACTTGC CGAAAACAAG GCAAAAGACG TGCTTACCCA ACTGAACGGC 

1201 GGCAAGGCTG TTGACGTGAA ATGGTCGGAA GTGTCCGTTT TGGGC3CACA 

1251 GCAGGCAAGG CAGTCCATGC CGCCCGAGGC TTATGCGGAA CTGCTSAAAG 

1301 CAAAACCGGC AAACGGCAAA CCCGCCTACG TCAGGCTGAT CGGTCTGCCG 

1351 GCACCCGTGA TTGTCGAAGT ACAGGCTGTA ACCCCGCCGG ATGATATCGC 

1401 CGCACAGCTT CCGCTTGCAA AACAGGCTTT GGCGCAACAG CAGTCTGCCA 

1451 ATACTTTCGA CTTGTTGATA CGTTATTTCA ACGGCAAAAT CAAACAGACC 

1501 AAAGGAGCGC AATCGGTCGA CAACGGCGAC GGTCAGTAA 
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This corresponds to the amino acid sequence <SEQ ID 838; ORF 230-La>: 

a230-l.pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEI RDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPK AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWVVRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKLAENK AKDVLTQLNG 

401 GKAVDVKWSE VSVLGAQQAR QSMPPEAYAE LLKAKPANGK PAYVRLIGLP 

451 APVIVEVQAV TPPDDIAAQL PLAKQALAQQ QSANTFDLLI RYFNGKIKQT 
501 KGAQSVDNGD GQ* 



a230-l/m230-l 99.8% identity in 512 aa overlap 

10 20 . 30 40 50 60 

a230- 1 . pep MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYI VQVGDEKISDHSINNAIQNEQ 
I I I I I ! i I I I I I I I I I I I I I I i I I I I I I I t I I I I I I I I I I I I I I I I II I I I I t I I II I I I 
m230-l MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISDHSINNAIQNEQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

a23 0- 1 . pep ADGGGPSRDAVFQSLLQRAY LKQGAKLMGI S VSSEQI KQI I VDDPNFHDANCKFDHALLN 
I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I 1 I I I I I I I I I I 
m230-l ADGGGPSRDAVFQSLLQRAYLKQGAKLMGISVSSEQIKQIIVDDPNFHDANGKFDHALLN 

70 80 90 100 110 120 

130 140 150 160 170 180 

a230- 1 . pep RYLSQPxHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
m230- 1 RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFW 

130 140 150 160 170 180 

190 200 210 220 230 240 

a230- 1 . pep PDEFIAQVKVSEADLQKFYNANKKDYLLPKAVKLEYVALNLKDFADKQTVSETEVKNAFE 
I I I I I I I I I I I I I I I I I I I I I I I f I I I I I : I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I 
m230-l PDEFIAQVKVSEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 

190 200 210 220 230 240 

250 260 270 280 290 300 

a230-l.pep ERVARL PANEAKP S FEQEKAA VEN ELKMK KA VADFNKAKEKLG DDAFNH PS S L AE AAKN S 
i II I I I I I I I I I I I I I I I I I I I i I I I I I M I I I I I I I I I I I I I I I I I I 1 I I I I i I I I I II 
m23 0- 1 ERVARL PANEAKP S FEQEKAA V EN E LKMK KAVADFNKAKEKLG DDAFNHPS S L AE AAKN S 

250 260 270 280 290 300 

310 320 330 340- 350 360 

a230-l . pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 

I I I I I I I I I I I I I I t I I I I I I I I I I I I II I I I I II I I I I I I I I II I I I I I I I I I I I I I 1 i 
ni230-l GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 
310 320 330 340 350 360 

370 380 390 400 410 420 

a230-l.pep EEKTLPFAEAKDAVRQAYIRTEAAKLAENKAKDVLTQLNGGKAVDVKTJSEVSVLGAQQAR 
I I I I I I I I I I I I I I II I M I I I I I I I I I I I I i I I I I I I II I I I I I I I I I I I I I I I I I I I I 
m2 3 0- 1 EEKTLPFAEAKDAVRQAYI RTEAAKLAENKAKDVLTQLNGGKAVDVKWSEVSVLGAQQAR 

370 380 390 400 410 420 

430 440 450 460 470 480 

a230-l . pep QSMPPEAYAELLKAKPAWGKPAYVRLIGLPAPVIVEVQAVTPPDDIAAQLPLAKQALAQQ 
I I I I 1 I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I I I I M I I I I I I 

m230- 1 QSMPPEAYAELLKAKPANGKPAYVRLIGLPAPVIVEVQAVTPPDDIAAQLPLAKQALAQQ 
430 440 450 460 470 480 

490 500 510 

a230-l .pep QSANTFDLLI RYFNGKIKQTKGAQSVDNGDGQX 
lllllillllllflllllllll llllllllll I 
m230- 1 QSANTFDLLI RYFNGKIKQTKGAQSVDNGDGQX 

490 500 510 
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The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 839>: 



ttcagacggc atttgccgtt caaggccgtg cggtgtcttt 
caaccattcg cccacggaat ccatccaatc cttactgccc 
tgcctgcccg gcggtacgcc cacggcgctt gcggattttt 
aatcctttgc gttccctttc cgcctgaatt tgagcgtcgg 
aaaatccgcc ttatcctgct gttctttagc ataactttta 



y * ut-vj 




i 


atgtcaaaac 


51 


y '••^'3 »»wy 


101 


acttttfctat 

a t^. l_- L- l_ U, L* * — d V— 


151 


yuy<-y<-yycc 


201 


a w\« ci a a l. y ^ ^ 


251 




JUi 




351 


p af- anh cane* 


401 


taatgccacg 


451 


geeggeggat 


501 


cacgtacact 


551 


gaacgtgtac 


601 


ccgaatttta 


651 


cgtcacagac 


701 


gatgcccgtc 


751 


gagtaegteg 


801 


ccgtataacc 


851 


atgatttttt 


901 


ctctga 



This corresponds to the amino acid sequence <SEQ ID 840; ORF 231 .ng>: 

g231.pep 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 

51 ARGFQTAFAV QGRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 

101 SFPQSFAFPF RLNLSV GIVG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 

151 AGGYLRHFAL IAVGFKHTYT DFPTLRRRAQ WARTCTV3L FHLRCVDIRH 

201 PNFMFRAVAV DDVKGIAVID FGHRACVAVA GFRRCPSANG RVETHVPCSA 

251 EYVEYGNRRP HRLTVISRIT QRAQKRQGDG KPFHDFFNLH IFQMPMPSEH 

301 L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 841 >: 

m231.seq (partial) 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGCCTGA 

101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAAGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG GC 

This corresponds to the amino acid sequence <SEQ ID 842; ORF 23 1>: 

m231.pep (partial) 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 
51 ARGFQTAFAV QSRAVSLPNA QPFG 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 231 shows 98.6% identity over a 73 aa overlap with a predicted ORF (ORF 231.ng) 
from N. gonorrhoeae: 

m231/g231 

10 20 30 40 50 60 

m231 . pep MSKRiCSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 

III llllllll Ml ll! MIIIIIIIIMI llllillll I Mill II III II I III II I 
g231 MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 
10 20 30 40 50 60 

70 

ra231 .pep OSRAVSLPNAQPFG 
hlllilllllll: 

9231 QGRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRIFSFPQSFAFPFRLNLSVGIVG 
70 80 90 100 110 120 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 843>: 
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a231.seq (partial) 

. 1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGCCTGA 

101 ACTTTTTTAT CGNGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAAGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGCAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCCCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

351 CATAATCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

401 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 

451 GCCGACAGAA ACCTGCGCCA CTTCGCGCTG GTAGCGGTCG GTGTCGAACA 

501 CGCGGACGCT GACTTTCCTG CCTTCCGCCG CCGCGCGCAG GTTGTCGCGC 

551 GAACGCGTGC CGTAAGCCTG TTTCATCTCC GGCGCGTCGA TATACGCCAT 

601 CCGGATTTTG TGTTTCGCGC CGTCGCCGTC GATAACGTGA AGGGTGTCGC 

651 CGTCATAGAC TTTGGACACC GTGCCTGTGT AGCGGTGGCC GGATTTCGCC 

701 GATGCTCGGC GGCGGGCGGG CGCGTCGGAA CCCGCGTCCC CTGCCGCGCC 

751 GAGTACGTCG AGTACGGCAA CCGCCGTCCG CACCGCCTCG CTGCCGTACC 

801 CCGTATAACC CAACGCACCC AAAAGCGACA AGGCGACGGG AAGCCATTTC 

851 ATGATTTTTT . TAATCTGCAT ATTTTTCAAA TGCCGATGCC GTCTGAACAT 

901 ATC 

This corresponds to the amino acid sequence <SEQ ID 844; ORF 217.a>: 

a231.pep (partial) 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLN FFIXAD GGCASPQKCR 

51 ARGFQTAFAV QSRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 

101 SFPQSFAFPF RLNLSVG IIG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 

151 ADRNLRHFAL VAVGVEHADA DFPAFRRRAQ WARTRAVSL FHLRRVDIRH 

201 PDFVFRAVAV DNVKGVAVID FGHRACVAVA GFRRCSAAGG RVGTRVPCRA 

251 EYVEYGNRRP HRLAAVPRIT QRTQKRQGDG KPFKDFFNLH IFQMPMPSEH 

301 I 

m231/a231 98.6% identity in 73 aa overlap 

10 20 30 40 50 60 

m231.pep MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLKFFIAADGGCASPQKCRARGFQTAFAV 
I I I I I I I i I II I II I I I I I I I I I I I I II I I I I I I I I I i I I I I I I I I I I I I I I I I I I M I 
a231 MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLKFFIXADGGCASPQKCRARGFQTAFAV 

10 20 30 40 50 60 

70 

m231 .pep QSRAVSLPNAQPFG 
I I II I I I I I I 1 I I : 

a231 QSRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRIFSFPQSFAFPFRLNLSVGIIG 
70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 845>: 

g231-l. seq 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGcCTGA 

101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAGGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGAAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCCCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

351 CATAGTCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

401 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 

4 51 GCCGGCGGAT ACCTGCGCCA CTTCGCGCTG ATAGCGGTCG GTTTCAAACa 

501 CaCgTaCaat gagtttcgtA ccctccGCCG ccgcgcgCAG GTTGtcgcGC 

551 GAACgTGTAC CGTAagcgtg TTtcatctcc GGTGCgtcGA TATACGCCaT 

601 cCgAATTTta tGTttcgcgc cgtcgcCgtc gATGACGTGA AGGGtatcGC 

651 CgtcATAGAC TTTGGACACC Gtgcctgcgt AGctGTGGCC GGATttcgc 

This corresponds to the amino acid sequence <SEQ ID 846; ORF 231 -1 .ng>: 

g2 31-1. pep 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 
51 ARGFQTAFAV QGRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRI F 
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101 SFPQS FAFPF RLNLSV GIVG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 
151 AGGYLRHFAL IAVGFKHTYN EFRTLRRRAQ WARTCTVSV FHLRCVD1RH 
201 PNFMFRAVAV DDVKGIAVID FGHRACVAVA GFR 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 847>: 

m231-l.aeq 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGCCTGA 

101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAAGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGCAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCTCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

351 CATAATCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

401 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 

451 GCCGACAGAA ACCTGCGCCA CTTCGCGCTG GTAGCGGTCG GTATCGAACA 

501 CGCGCACGCT GACTTTCCTG CCTTCCGCCG CCGCGCGCAG GTTGTCGCGC 

551 GAACGCGTGC CGTAAGCCTG TTTCATCTCC GGCGCGTCGA TATACGCCAT 

601 CCGGATTTTG TGTTTCGCGC CGTCGCCGTC GATAACGTGA AGGGTGTCGC 

651 CGTCATAGAC TTTGGACACC GTGCCTGTGT AGCGGTGGCC GGATTTCGCC 

701 GATGCTCGGC GGCGGGCGGG CGCGTCGGAA CCCGCGTCCC CTGCCGCGCC 

7 51 GAGTACGTCG AGTACGGCAA CCGCCGTCCG CACCGCCTCG CTGCCGTACC 

801 CCGTATAACC CAACGCACCC AAAAGCGACA GGGCGACGGG AAGCCATTTC 

851 ATGATTTTTT TAATCTGCAT ATTTTTCAAA TGCCGATGCC GTCTGAACAT 

901 ATCGGAATCG GATTTCAGAC GGCATCTTAA 

This corresponds to the amino acid sequence <SEQ ID 848; ORF 231-1>: 

m231-l .pep 

1 MSKRKSINRP YQRPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 

51 ARGFQTAFAV QSRAVSLPNA QPFAHGIHPI LIAPAAPACS AVRPRRLRIF 

101 SFPQS FAFPF RLNLSVG IIG KIRLILLFFS ITFI MPRRPV LHLHQVQIGF 

151 ADRNLRHFAL VAVGIEHAHA DFPAFRRRAQ VVARTRAVSL FHLRRVDIRH 

201 PDFVFRAVAV DNVKGVAVID FGHRACVAVA GFRRCSAAGG RVGTRVPCRA 

251 EYVEYGNRRP HRLAAVPRIT QRTQKRQGDG KPFHDFFNLH I FQMPMPSEH 

301 IGIGFQTAS* 

g231-l/m231-l 87.0% identity in 262 aa overlap 

10 20 30 40 50 60 

g231-l . pep MSKRKSINRPYQKPAELPPLO^NPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

m231-l MSKRKSINRPYQKPAELPPLQMNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 
10 20 30 40 50 60 

70 80 90 100 110 120 

g231-l . pep QGRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRIFSFPQSFAFPFRLNLSVGIVG 
I : I I I I I I I I I I I I I I I I I I I I 1 I I I I I I i ] I I I I I I I II I I I I I I I I I I I I | | | 1 I: | 
m231- 1 QSRAVSLPNAQPFAHGIHPILIAPAAPACSAVRPRRLRI FSFPQSFAFPFRLNLSVGIIG 

70 80 90 100 110 120 

130 140 150 160 170 180 

g231-l - pep KIRLILLFFSITFIMPRRPVLHLHQVQIGFAGGYLRHFALIAVGFKHTYNEFRTLRRRAQ 
I I I I I I 1 1 I I I f I I I I I I I ! 1 1 I I I I I I I I I I I I I 1 I : I I I : : t : : : I : : I i 1 1 I 
m231-l KIRLILLFFS I TFIMPRRPVLHLHQVQIGFADRNLRHFALVAVGIEHAHADFPAFRRRAQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

g231-l . pep WARTCTVSVFHLRCVDIRHPNFMFRAVAVDDVKGIAVIDFGHRACVAVAGFRXCPSANG 
llllt : I I : 1 1 I I I I I I I I : I : I I 1 I I I I : I I I : I I I \ I I I I I I I I I II I I I : I : I 
m23 1- 1 VVARTRAVSLFHLRRVDIRHPDFVFRAVAVDNVKGVAVIDFGHRACVAVAGFRRCSAAGG 

190 200 210 220 230 240 

250 260 
g231- 1 . pep CVETHVPCSAEYVVXGNRRPHR 
I llll lllllll 

m231- 1 RVGTRVPCRAEYVEYGNRRPHRLAAVPRITQRTQKRQGDGKPFHDFFNLH I FQMPMPSEH 

250 260 270 280 290 300 

The following partial DNA sequence was identified in N. meningitidis <SEQ LD 849>: 

a231-l . seq 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGCCTGA 

101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 
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151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAAGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGCAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCCCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

351 CATAATCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

401 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 

451 GCCGACAGAA ACCTGCGCCA CTTCGCGCTG GTAGCGGTCG GTGTCGAACA 

501 CGCGGACGCT GACTTTCCTG CCTTCCGCCG CCGCGCGCAG GTTGTCGCGC 

551 GAACGCGTGC CGTAAGCCTG TTTCATCTCC GGCGCGTCGA TATACGCCAT 

601 CCGGATTTTG TGTTTCGCGC CGTCGCCGTC GATAACGTGA AGGGTGTCGC 

651 CGTCATAGAC TTTGGACACC GTGCCTGTGT AGCGGTGGCC GGATTTCGCC 

701 GATGCTCGGC GGCGGGCGGG CGCGTCGGAA CCCGCGTCCC CTGCCGCGCC 

751 GAGTACGTCG AGTACGGCAA CCGCCGTCCG CACCGCCTCG CTGCCGTACC 

801 CCGTATAACC CAACGCACCC AAAAGCGACA AGGCGACGGG AAGCCATTTC 

851 ATGATTTTTT TAATCTGCAT ATTTTTCAAA TGCCGATGCC GTCTGAACAT 

901 ATCGGAATCG GATTTCAGAC GGCATCTTAA 

This corresponds to the amino acid sequence <SEQ ID 850; ORF 231-l.a>: 

a231-l.pep 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 

51 ARGFQTAFAV QSRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 

101 SFPQSFAFPF RLNLSVG IIG KIRLILLFFS ITFIM PRRPV LHLHQVQIGF 

151 ADRNLRHFAL VAVGVEHADA DFPAFRRRAQ WARTRAVSL FHLRRVDIRH 

201 PDFVFRAVAV DNVKGVAVID FGHRACVAVA GFRRCSAAGG RVGTRVPCRA 

251 EYVEYGNRRP HRLAAVPRIT QRTQKRQGDG KPFHDFFNLH IFQMPMPSEH 

301 IGIGFQTAS* 



a231-l/m231-l 99.0% identity in 309 aa overlap 



10 20 30 40 50 60 

a2 31-1. pep MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 
I I I 1 I I 1 1 I I I I I I I I i I I I I II I I 1 1 I I I I 1 1 I I I I II I II I I I II I I I I ! I I { I I I I I 

m231-l MSKRKSINRPYQKPAELPPLQNNPPFYRKKRRLNFFIAADGGCASPQKCRARGFQTAFAV 
10 20 30 40 50 60 



70 80 90 100 110 120 

a2 3 1-1. pep QSRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRIFSFPQSFAFPFRLKLSVGIIG 
1 I I M I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m231-l QS RA V S LPNAQP FAHG T H P I LI A P AAP ACS AVRPRRLRI FS FPQS FAF P FRLNLS VGI I G 

70 80 90 100 110 120 



130 140 150 160 170 180 

a231-l.pep KIRLILLFFSITFIMPRRPVLHLHQVQIGFADRNLRHFALVAVGVEHADADFPAFRRRAQ 
I I I M I i M 11 I I I I I I i I I ! I I I I I I I I I I I I I I I I I I I ! ! II : I I 1 I 1 I I t I I I I t I 
m2 3 1 - 1 KIRLI LLFFS ITFIMPRRP VLHLHQVQI GFADRNLRHFALVAVGI EHAHADFPAFRRRAQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

a2 3 1 - 1 . pep WARTRAVSL FHLRRVDIRH PDFV FRAVAVDNVKGVAV I DFGHRAC VAVAGFRRCS AAGG 
I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I II I I I I I I I I I I I I I II I I I I 
m2 3 1 - 1 WART RAVS LFHLRRVDI RH PDFV FRAVAVDNVKGVAV I DFGHRAC VAVAGFRRCS AAGG 

190 200 210 220 230 240 



250 260 270 280 290 300 

a231-l .pep RVGT RV PC RAE Y VE YGN RRP HRLAAV PRI TQRTQKRQGDGKP FHDFFN LH I FQMPM PS EH 
i I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
m231-l RVGTRVPC RAEYVEYGNRRPHRLAAVPRITQRTQKRQGDGKP FHDFFN LHIFQMPMPSEH 

250 260 270 280 290 300 



310 

a231-l .pep IGIGFQTASX 
I I I I I I I I I I 
m231-l IGIGFQTASX 
310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 85 1>: 

g232 .seq 

1 atgatgggca acagcctgat tgaatccggt acgtttgtcg ccatcctgtt 

51 tggtcagatt ttgggaacgg cggttgccgg cgcgccgcct tatattgtcg 

101 ggatactggt tttgctggtc gccgtcggag gaacggccgg cagcctgttt 
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151 atgccgtccg tacccgccaa ggctgccgat acccaaatcg agtggaatat 

201 tgtccgtggfc acaaaatccc tgctgcgtga aacggtgcgg cacaatcccg 

251 tttttaccgc cattatcggc atctcgtggt tttggtttgt cggcgcggtt 

301 tataccacgc aactgccgac ctttacccaa atccatttgg gcggcaacga 

351 taatgttttt aacctgatgc ttgctttgtt ttccatcggt attgccgccg 

401 gttcggtact gtgtgccaag ttcggcaggg aacggctgat gttggcttgg 

451 gtaacggttg gfcgcgttggg ttcgacggtt tgcggcctgg ttttggtgtg 

501 gctgacgcac ggacaccgtt ttgaagggct gaacggcatt ctctggtttt 

551 tatcgcaagg atgggcatac cccgtgatgg cggtgatgac gctgatcggc 

601 tttttcggcg gatttttctc cgttccgctc tatacctggc tgcaaaccgc 

651 cagcagcgag actttccgcg cccgcgccgt tgccgccaac aatatcgtta 

701 acggcatctt tatggtttcc gccgccgttL tgagcgcggt attgctgttt 

751 ttgtttgaca gcatttccct gctgtatctg attgtcgcct tgggcaatat 

801 tccgttggcg gtatttttga ttaagcgcga aaggcggttt ttaggcgcgg 

851 cggcaatcag gaaaaaacct tga 

This corresponds to the amino acid sequence <SEQ ID 852; ORF 232.ng>: 

g232 .pep 

1 MMGNSLIESG TFVAILFGQI LGTAVAGAPP YIVGILVLLV AVGGTAGSLF 

51 MPSVPAKAAD TQIEWNIVRG TKSLLRETVR HNP VFTAIIG I 5WFWFVGAV 

101 YTTQLPTFTQ IHLGGNDNVF N LMLALFSIG IAAGSVL CAK FGRERLMLAW 

151 VTVGALGSTV CGLVLVW LTH GHRFEGLNGI FWFLSQGWAY P VMAVMTLIG 

201 FFGGFFSV PL YTWLQTASSE TFRARAVAAN NIVN GIFMVS AAVLSAVLLF 

251 LFDSIS LLYL I VALGNI PLA VFLI KRERRF LGAAAIRKKP * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 853>: 



m232 . seq 












1 


ATGATGGGCA 


ACAGCCTGAT 


TGAATCGGGT 


ACGTTTGTCG 


CCATCCTGTT 


51 


CGGTCAGATT 


TTGGGAACGG 


CGGTGGCAGG 


TGTACCGCCT 


TATATTGTCG 


101 


GGATACTGGT 


TTTGCTGGTC 


GCCGTCGGAG 


GCACGGTCGG 


CAGCCTGTTT 


151 


ATGCCGTCCG 


TACCCGCCAA 


GGCTGCCGAT 


ACACAAATTG 


AGTGGAATAT 


201 


TGTCCGTGGC 


ACAAAATCCC 


TGCTGCGTGA 


AACGGTGCGG 


CACAAGCCCG 


251 


TTTTTACCGC 


CATTATCGGT 


ATTTCGTGGT 


TTTGGTTTGT 


CGGCGCGGTT 


301 


TATACCACGC 


AACTGCCGAC 


CTTTACCCAA 


ATCCATCTGG 


GCGGCAACGA 


351 


CAATGTTTTC 


AACCTGATGC 


TTGCTCTGTT 


TTCCATCGGT 


ATTGCCGCCG 


401 


GTTCGGTACT 


GTGTGCCAAG 


TTCAGCAkGG 


AACGC CTGAT 


GTTGGCTTGG 


451 


GTAACGGTTG 


GTGCGTTGGG 


TTTGACGGTT 


TGCGGCTTGG 


TTTTGGTGTG 


501 


GCTGACGCAC 


GGACACCGTT 


TTGAAGGGCT 


GAACGGCATT 


TTTTrGTTTT 


551 


TATCGCAAGG 


ATGGGCATAT 


CCCGTGATGG 


CGGTGATGAC 


GCTGATCGGC 


601 


TTTTTCGGCG 


GATTTTTCTC 


CGTTCCGCTC 


TATACCt (g) TG CAAACCGCCa 


651 


TAGCGAGaTT 


TCCGCGCCCg 


GCCGTTGCCG 


CCAACAATAT 


CGTTAACGGT 


701 


ATTTTTATGG 


TTTCCGCTGC 


CGTTTTGAGC 


GCGGTGTTGC 




751 


TGACAGCATT 


TCCTTGTTGT 


ATCTGATTGT 


CGCTTTGGGC 


AATATTCCGT 


801 


TGTCGGTATT 


TTTGATTAAG 


CGCGAAAGGC 


GGTTTTTAGG 


CGCGGCGGCA 


851 


ATCAGGAAAA 


AACCTTGA 









This corresponds to the amino acid sequence <SEQ ID 854; ORF 232>: 

m232 .pep 

1 MMGNSLIESG TFVAILFGQI LGTAVAGVPP YIVGILVLLV AVGGTVGSLF 

51 MPSVPAKA AD TQIEWNIVRG TKSLLRETVR HKP VFTAIIG ISWFWFVGAV 

101 YTTQLPTFTQ IHLGGNDNVF N LMLALFSIG IAAGSVL CAK FSXERLMLAW 

151 VTVGALGLTV CGLVLVW LTH GHRFEGLNGI FXFLSQGWAY P VMAVMTLIG 

201 FFGGFFSV PL YTVQTAI7VRF PRPAVAANNI VNG I FMVSAA VLSAVLLFLF 

251 DSIS LLYLIV ALGNIPLSVF LI KRERRFLG AAAIRKKP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 232 shows 94.1% identity over a 290 aa overlap with a predicted ORF (ORF 232.ng) 
from N. gonorrhoeae: 

m232/g232 



10 20 30 40 50 60 

m232.pep MMGNSLI ESGTFVAI LFGQILGTAVAGVPPYIVGI LVLLVAVGG WGSLFMPSVPAKAAD 

1 1 1 [ 1 1 1 1 1 [ i i I [ 1 1 1 1 1 1 1 1 f 1 1 1 ! 1 ! 1 1 f I f 1 1 1 M f 1 1 1 = 1 1 f 1 1 1 1 1 1 1 1 1 1 1 

g232 MMGNS LIES GT FVAI LFGQ I LGT AV AG APP Y I VG I LVLLVAVGGTAG S LFM PS VPAKAAD 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m232 . pep TQIEWNIVRGTKSLLRETVRHKPVFTAI IGISWFWFVGAVYTTQLPTFTQIHLGGNDNVF 

iii iiiiiiiiiiiii i ii i hi ii i ii 1 1 iii i ii iii iii i mi iii nun mi 

g232 TQIEWNIVRGTKSLLRETVRHNPVFTAI IGI SWFWFVGAVYTTQLPTPTQIHLGGKDNVF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m232 . pep NLMLALFS IGIAAGSVLCAKFSXERLMLAOTWGALGLTVCGLVLVWLTHGHRFEGLNGI 

llllllllllllllllllllh llllllllllllll I II I II II I II I Mill Ml I I 
g232 NLMLMjFSIGIAAGSVLCA1G?GRER1jMLAWVTVGALGSWCGLVLWLTH 

130 140 150 160 170 180 

190 200 210 220 230 

m232 . pep FXFLSQGWAYPVMAVMTLIGFFGGFFSVPLYT- VQTAIARFPRP-AVAANNIVNGIFMVS 

I Mllllllllllllllllllllllllllll MM s: I IIIIIIIIIIIIIM 
g232 FWFI^QGWAYPVl^VMTLIGFFGGFFSVPLYTWLQTASSETFRARAVAANNIVNGIFMVS 

190 200 210 220 230 240 

240 250 260 270 280 289 

m232 .pep AAVLSAVLLFLFDSISLLYLI VALGNI PLSVFL1 KRERRFLGAAAIRKKPX 

II MM II II I HUM II Ml III 1 1 Ihlll III III Mill Mill I 

g232 AAVLSAVLLFLFDS ISLLYLI VALGNI PLAVFLI KRERRFLGAAAI RKKP 

250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 855>: 

a232 . seq 

1 ATGTACGCTA AAAAAGGCGG TTTGGGACTG GTTAAAAGCC GCCGTTTCGC 

51 ACCTCTTTTC GCTACGCAGT TTCTCGGCGC GTTCAACGAC AATGTGTTCA 

101 AAACCGCGCT GTTTGTGATG ATTGGGTTTT ACGGTTTGGG GCAAAACGGC 

151 TTCCTGCCTG CCGGACAGAT GTTGAACTTG GGCGCGTTGC TGTTTATTTT 

201 GCCGTATTTC CTGTTTTCCT CGCTGTCGGG GCAGTTGGGT AACAAATTCG 

251 ACAAGGCCGT TTTGGCGCGT TGGGCCAAGG TGCTGGAAAT GATCATTATG 

301 GCGGTGGCGG CATACGGGTT TTATATCCGG TCTGCCCCGC TGCTTTTGGC 

351 GTGTCTGTTT TGCATGGGCG CGCAATCGAC GCTGTTCGGG CCGCTGAAAT 

401 ACGCCATCCT GCCCGATTAT CTCGACGACA AAGAGTTGAT GATGGGCAAC 

451 AGCCTGATTG AATCGGGTAC GTTTGTCGCC ATCCTGTTCG GTCAGATACT 

501 GGGGACTGCG GTGGCAGGTG TACCGCCTTA TATTGTCGGG ATACTGGTTT 

551 TGCTGGTCGC CGTAGGAGGC ACGGTCGGCA GCCTGTTTAT GCCGTCCGTA 

601 CCCGCCAAGG CTGCCGATAC. ACAAATTGAG TGGAATATTG TCCGGGGTAC 

651 AAAATCCCTG CTGCGTGAAA CGGTGCGGCA CAAGCCCGTT TTTACCGCCA 

701 TTATCGGTAT TTCGTGGTTT TGGTTTGTCG GCGCGGTT7A TACCACGCAA 

751 CTGCCGACCT TTACCCAAAT CCATCTAGGC GGCAACGACA ATGTTTTCAA 

801 CCTGATGCTT GCCCTGTTTT CCATCGGTAT TGCCGCCGGT TCGGTACTGT 

851 GTGCCAAGTT CAGCAGGGAA CGGCTGAGGT TGGCTTGGGT AACGGTTGGT 

901 GCGTTGGGTT TGACGGTTTG CGGCTTGGTT TTGGTGTGGC TGACGCACGG 

951 ACACCGTTTT GAAGGGCTGA ACGGCATTTT TTGGTTTTTA TCGCAAGGAT 

1001 GGGCATATCC CGTGATGGCG GTGATGACGC TGATCGGCTT TTTCGGCGGA 

1051 TTTTTCTCCG TTCCGCTCTA TACCTGGCTG CAAACCGCCA GTAGCGAGAC 

1101 TTTCCGCGCC CGCGCCGTTG CCGCCAACAA TATCGTTAAC GGTATTTTTA 

1151 TGGTTTCCGC TGCCGTTTTG AGCGCGGTGT TGCTGTTT7T GTTTGACAGC 

1201 ATTTCCTTGT TGTATCTGAT TGTCGCTTTG GGCAATAT7C CGTTGTCGGT 

1251 ATTTTTGATT AAGCGCGAAA GGCGGTTTTT AGGCGCGGCG GCAATCAGGA 

1301 AAAAACCTTG A 

This corresponds to the amino acid sequence <SEQ ID 856; ORF 232.a>: 

a232.pep 

1 MYAKKGGLGL VKSRRFAPLF ATQFLGAFND NVFKTALFVM IGFYGLGQNG 

51 FLPAGQ MLNL GALLFILPYF LFS SLSGQLG NKFD KAVLAR WAKVLEMIIM 

101 AVAAYGFYIR SAP LLLACLF CMGAQSTLFG PLKYAILPDY LDDKELMMGN 

151 SLIESGT FVA ILFGQILGTA VAGV PPYIVG ILVLLVAVGG TVG3LFM PSV 

201 PAKAADTQIE WNI VRGTKSL LRETVRHK PV FTAIIGISWF WFVGAVY TTQ 

251 LPTFTQIHLG GNDNVFNLML ALFSIGIAAG SVLCAKFSRE RLRLAWVTVG 
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301 ALGLTVCGLV LW LTHGHRF EGLNGIFWFL SQGWAY PVI4A VMTLIGFFGG 
351 FFSVPLYTWL QTASSETFRA RAVAANNIVN G IFNVSAAVL SAVLLFLF DS 
401 IS LLYLIVAL GNIPLSVFLI KRERRFLGAA AIRKKP* 

m232/a232 95.9% identity in 290 aa overlap 

10 20 30 

Itl232 .pep MMGNSLIESGTFVAILFGQILGTAVAGVPP 

I I I I I I I I I i I I 1 I I t I I I } I I I I II I II I 
a232 ACLFCMGAQSTLFGPLKYAILPDYLDDKELMMGNSLIESGTFVAILFGQILGTAVAGVPP 
120 130 140 150 160 170 

40 50 60 70 80 90 

ro232.pep YIVGILVLLVAVGGTVGSLFMPSVPAKAADTQIEWNIVRGTKSLLRETVRHKPVFTAIIG 
II i I I I I I II I I I I I i I 1 1 I M I I I I I I I I II I I I I 1 1 i 1 1 I II I I I I II I It I I I I I I I 

a232 YIVGILVLLVAVGGTVGSLFMPSVPAKAADTQIEWNIVRGTKSLLRETVRHKPVFTAIIG 
180 190 200 210 220 230 

100 110 120 130 140 150 

m232.pep I SWFW FVGAVYTTQLPT FTQI HLGGNDNVFNLMLALFS IGI AAG SVLCAKFSXERLMLAW 

II N I I I I I I I I I I I I I I I I I I I I I I I I | | I | | I I | I | | | | | | | | | | | | | | | IN | | | 
a232 ISWFWFVGAVYTTQLPTFTQIHLGGNDNVFNLMLALFSIGIAAGSVLCAKFSRERLRLAW 
240 250 260 270 280 290 

160 170 180 190 200 210 

m232.pep VTVGALGLTVCGLVLVWLTHGHRFEGLNGIFXFLSQGWAYPVMAVMTLIGFFGGFFSVPL 

I I I I I I I I I I I I I I I II I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I 1 I M I I 
a232 VT VG ALG LTVCGL V L VW LTHGHR FEGLNGIFWFLS QGWA Y P VMAVMT LI G FFGG FF S V P L 

300 310 320 330 340 350 

220 230 240 250 260 

m232 . pep YT-VQTAI ARFPRP-AVAANN IVNGI FMVSAAVLSAVLLFLFDS IS LLYLIVALGN I PLS 

II -III : : I I I 1 I I I I I I 1 I I I I I I I I I I 1 I I 1 I I I i I I I I I I I I I I I I I M I I 
a232 YTWLQTA5 SETFRARAVAANN IVNGI FMVS AAV LSAVLLFLFDS IS LLYLIVALGN I PLS 

360 370 380 390 400 410 



270 280 289 

m232 .pep VFLIKRERRFLGAAAIRKKPX 
I I I I I I I I I I I I I I I I I I I I I 
a232 VFLIKRERRFLGAAAIRKKPX 
420 430 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 857>: 

g233 .seq 

1 atgaaacgca aaaatatcgc gctgattccc gccgccggca tcggggtgcg 

51 tttcggtgcg gacaaaccca agcaatatgt cgaaatcgga agcaaaaccg 

101 ttttagaaca tgtacttggg atttttgaac ggcatgaggc cgtcgatttg 

151 accgtcgtcg tcgtctcgcc cgaagacacg tttgccgata aggttcagac 

201 ggcatttcca caggttcggg tgtggaaaaa cggtggacag acccgcgccg 

251 aaactgtccg caacggtgtg gcaaaactgt tggaaaccgg tttggcggcg 

3 01 gaaaccgaca atattctggt acacgatgcc gcccgctgct gcctgccgtc 

351 tgaagctctg gcgcggttga tagaacaggc gggcaacgcc gccgaaggcg 

401 ggactttggc agttcccgtt gccgatacgc tcaagcgcgc agaaagcgga 

451 caaatcagtg caactgtcga ccgttcgggg ctttggcagg cgcaaacgcc 

501 gcagcttttt caagcgggtt tgctgcaccg cgcattggct gcggaaaact 

551 tgggcggcat taccgatgaa gcgtccgccg tggaaaaact gggtgtgcgt 

601 ccgctactga tacagggcga cgcgcgcaat ttgaaactga cgcagccgca 

651 ggacgcatac atcgtcagcc tgctgctcaa tgccgtctga 
This corresponds to the amino acid sequence <SEQ ID 858; ORF 233.ng>: 

g233 -pep 

1 MKRKNIALIP AAGIGVRFGA DKPKQYVEIG SKTVLBHVLG IFERHEAVDL 

51 TWWSPEDT FADKVQTAPP QVRVWKNGGQ TRAETVRNGV AKLLETGLAA 

101 ETDNILVHDA ARCCLPSEAL ARLIEQAGNA AEGGI LAVPV ADTLKRAESG 
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151 QISATVDRSG LWQAQTPQLF QAGLLHRALA AENLGGITDE ASAVEKLGVR 
201 PLLIOGDARN LKLTQPQDAY IVRLLLNAV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 85 9>: 

m233.seq (partial) 

1 ATGAAGCGCA AAAATATCGC GCTGATTCCC GCCGCCGGCA TCGGGGCGCG 

51 TTTCGGTGCG GACAAACCCA AGCAATATGT CGAAATCGGA AGCAAAACCG 

101 TTTTAGAACA TACGATTGGG ATTTTTGAAC GGCATGAGGC CGTCGATTTG 

151 ACCGTCGTTG TCGTCTCGCC CGAAGACACG TTTGCCGATA AGGTTCAGAC 

201 GGCATTTCCA CAGGTTCGGG TGTGGAAAAA CGGCGGACAG ACCCGCGCCG 

251 AAACCGTCCG CAACGGTGTG GCAAAACTGT TGGAAACCGG TTTGGCGGCG 

301 GAAACCGACA ATATTCTGGT ACACGATGCC GCGCGTTGCT GCCTGCCGTC 

351 TGAAGCTTTG ACGCGGTTGA TAGAACAGGC GGGCAACGCC GCCGAAGGCG 

401 GGATTTTGGC AATTCCCATT GCCGATACGC TCAAGTGCGC GGACGGTGGG 

451 AACATT 

This corresponds to the amino acid sequence <SEQ ID 860; ORF 233>: 

m233.pep (partial) 

1 MKRKNIALIP AAGIGARFG A DKPKQYVEIG SKTVLEHTIG I FERHEAVDL 
51 TWWSPEDT FADKVQTAFP QVRVWKNGGQ TRAETVRNGV AKLLETGLAA 
101 ETDNILVHDA ARCCLPSEAL TRLIEQAGNA AEGGILAIPI ADTLKCADGG 
151 NI 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 233 shows 93.4% identity over a 1 52 aa overlap with a predicted ORF (ORF 233.ng) 
from N. gonorrhoeae: 

rr\233/g233 

10 20 30 40 50 60 

m233 .pep MKRKNIALIPAAGIGARFGADKPKQYVEIGSKTVLEHTIGIFERHEAVDLTVVVVSPEIDT 

ii i ii i ii ill ii ihi ii mi i iii nun in h: ii i iii mi i ii ii i in n 

g233 MKRKNIALIPAAGIGVRFGADKPKQYVEIGSKTVLEHVLGIFERHEAVDLTVVVVSPEDT 
10 20 30 40 50 60 



70 80 90 100 110 120 

m23 3 . pep FADKVQTAFPQWVWKNGGQTRAETVRNGVAKLLETGLAAETDNIIjVHDAARCCLPSEAL 

lllllllllll Mill III ill III I IN I III III Mi llllimillll Ml Mill 

g23 3 FADKVOTAFPQVRVWKNGGOTRAETVRNGVAKLLETGLAAETDNILWIDAARCCLPSEAL 
70 80 90 100 110 120 



130 140 150 

m23 3 . pep TRLI EQAGNAAEGGILAI PI ADTLKCADGGNI 

: I I I I II I I I I I I II I I : I : I I I I I 1-1 = 1 
g23 3 ARLIEQAGNAAEGGIIAVPVADTLKRAESGQISATVDRSGLWQAQTPQLFQAGLLHRAIiA 
130 140 150 160 170 180 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 86 1>: 

a233.seq 

1 ATGAAGCGCA AAAATATCGC GCTGATTCCC GCCGCCGGCA TCGGGGCGCG 

51 TTTCGGTGCG GACAAACCCA AGCAATATGT CGAAATCGGA AGCAAAACCG 

101 TTTTAGAACA TACGATTGGG ATTTTTGAAC GGCATGAGGC CGTCGATTTG 

151 ACCGTCGTTG TCGTCTCGCC CGAAGACACG TTTGCCGATA AGGTTCAGAC 

201 GGCATTTCCA CAGGTTCGGG TGTGGAAAAA CGGCGGACAG ACCCGCGCCG 

251 AAACTGTCCG CAACGGTGTG GCAAAATTGT TGGAAACCGG TTTGGCGGCG 

301 GAAACCGACA ATATTCTGGT ACACGATGCC GCGCGTTGCT GCCTGCCGTC 

351 TGAAGCTTTG ACGCGGTTGA TAGAACAGGC GGGCAACGCT GCCG7AAGGTG 

4 01 GGATTTTGGC AATTCCCGTT GCCGATACGC TCAAGTGCGC GGACGGTGGG 

4 51 AACATTAGTG CAACCGTCGA GCGGACGAGC CTTTGGCAGG CGCAAACGCC 

501 GCAGCTTTTC CGCGCCGGGC TGCTGCACCG CGCATTGGCT GCGGAAAACT 

551 TGGACGGCAT TACCGATGAA GCGTCCGCCG TGGAAAAATT GGGCATCCGC 

601 CCTTTGCTGG TGCAGGGCGA CGCGCGCAAT TTGAAACTGA CGCAGCCGCA 

651 GGACGCATAC ATCGT CAGGC TGCTGCTCGA TGCCGTCTGA 
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This corresponds to the amino acid sequence <SEQ ID 862; ORF 233.a>: 

a233 . pep 

1 MKRKNIALIP AAGIGARFG A DKPKQYVEIG SKTVLEHTIG I FERHEAVDL 

51 TVWVSPEDT FADKVQTAFP QVRVWKNGGQ TRAETVRNGV AKLLETGLAA 

101 ETDNILVHDA ARCCLPSEAL TRLIEQAGNA AEGGILAIPV ADTLKCADGG 

151 N I SAT VERTS LWQAQTPQLF RAGLLHRALA AENLDGITDE ASAVEKLGIR 

201 PLLVQGDARN LKLTQPQDAY IVRLLLDAV* 

m233/a233 99.3% identity in 1 52 aa overlap 

10 20 30 40 50 60 

MKRKN I ALIPAAGIGARFGADKPKQYVE I G SKTVLEHTIG I FERHEAVDLTVVVVSPEDT 
I I I I I II M 1 1 I I I I 1 1 I I I I 1 1 I I 1 1 II I I I I I I I I I I I II I I I II 1 1 I II I I I I I I I 
MKRKN I AL I PAAG I G AR FG ADK PKQ YVE I G S KTVLE HT I G I FERHE AV DLT VWV S PE DT 
10 20 30 40 50 60 

70 • 80 90 100 110 . 120 

FADKVQTAFPQVRVWKNGGQTRAETVRNGVAKLLETGLAAETDNILVHDAARCCLPSEAL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I 
FADKVQTAFPQVRVWKNGGQTRAETVRNGVAKLLETGLAAETDNILVHDAARCCLPSEAL 
70 80 90 100 110 120 

130 140 150 

TRL IEQAGN AAEGGI LAI P I ADTLKCADGGN I 
I I 1 1 I I I I I M 1 1 I M I I I : I I I I I I I I I I II 

TRLIEQAGN AAEGGI LAIPVADTLKCADGGNISATVERTSLWQAQTPQLFRAGLLHRA LA 
130 140 150 150 170 180 

AENLDGITDEASAVEKLGIRPLLVQGDARNLKLTQPQDAYIVRLLLDAVX 
190 200 210 220 230 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 863>: 
g234 .seq 

1 atgaaaaccg tttccgccgc catcgctttt 

51 gaccggctgt gcgaccgagt cctcacgcag 
101 cctcctgcaa tacgcaatat cacggtgttc 

151 acattcgaca accgctccag cttccaaaaa 

201 agaccgtctg ggcagccagg caaaaaccat 

251 aaaccaaccg cttcaacgta ctgaaccgca 
301 caggaatccg gcatttccgg caaagcgcag 
351 tgtcgttacc ggcgatgtaa ccgaattcgg 

401 atcagctctt cggcattttg ggtcgcggca 

451 aaagtggctc tgaatatcgt caacgtcaat 

501 cacacagggc gcgggcgaat acgcactttc 
551 tcggcggcac ttccggctac gatgcgactt 
601 ttggcaatcc gcgaagccgt cgacaacttg 

651 cgcatggcaa tccaaccgtt aa 

This corresponds to the amino acid sequence <SEQ ID 864; ORF 234.ng>: 

g234 .pep 

1 MKTVSAAIAF AAAA VSLTGC ATESSRSLEV AKVASCNTQY HGVRTPISVG 

51 TFDNRSSFQK GIFSDSEDRL GSOAKTILVT HLQOTNRFNV LNRTNLSALK 

101 QESGISGKAQ NLKGADYWT GDVTE FGRRD VGDHQLFGIL GRGKSQIAYA 

151 KVALNIVNVN TSEIVYSTQG AGEYALSNRE I IGFGGTSGY DAT LNG KVLD 

201 LAIREAVDNL VQAVDNGAWQ SNR* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 865>: 

m234.seq (partial) 

1 . . .GGCGCGGGCG AATACGCACT TTCCAACCGt GAAATCATCG GTTTCGGCGG 
51 CACTTCCGGC TACGATGCGA CTTTGAACGG CAAAGTTTTA GACTTGGCAA 

101 TCCGCGAAGC . gTCAACAGC CTGGTTCAGG CTGTTGACAA CGGCG CAT GG 

151 CAACCCAACC GTTAA 

This corresponds to the amino acid sequence <SEQ ID 866; ORF 234>: 



m233.pep 
a233 

m233.pep 
a233 

m233 .pep 
a233 

a233 



gccgccgccg ccgtttcact 
cctcgaggnt gcaaaagtcg 
gcaccccgat ttccgtcgga 
ggcattttct ccgacagtga 
cctggtaaca cacctgcaac 
ccaaccttag cgcattgaaa 
aacctgaaag gcgcagatta 
acgcagagat gtcggcgatc 
aatcgcaaat cgcctatgca 
acttccgaaa tcgtctattc 
caaccgcgaa atcatcggtt 
tgaacggcaa agttttagac 
gttcaggctg tcgacaacgg 
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m234.pep (partial) 

1 . . GAGEYALSNR EIIGFGGTSG YDATLNGKVL DLAIREAVNS LVQAVDNGAW 
51 QPNR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 234 shows 94.4% identity over a 54 aa overlap with a predicted ORF (ORF 234.ng) 
from N. gonorrhoeae: 

m234/g234 



10 20 30 

m234.pep GAGEYALSNRE I I GFGGTSG YDATLNGKVL 

IMMIIIMMMMIlim MINI II 

9234 LGRGKSQI AYAKVALNI VNVNTS E I VYSTQGAGBYALSNRE 1 1 GFGGTSGYDATLNGKVL 

140 150 160 170 180 190 

40 50 
m234 . pep DLAI REAVNSLVQAVDNGAWQPNRX 

III llllh'lllllllllll III 
g2 3 4 DLAI RE AVDN LVQAVDNGAW Q SNRX 

200 210 220 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 867>: 

a234 . seq (partial ) 

1 AACCGCACCT ATTTGAACGC ATTAAAACAG GAATCCGGCA TTTCCGGCAA 

51 AGCGCATAAC CTGAAAGGCG CAAATTATGT CGNNACCGGC GATGTAACCG 

101 AATTCGGACG CANAGATGTC GGCGATCATC AGCTCTTCGG CATTTTGGGT 

151 CGCGGCAAAT CGCAAATCGC CTATGCAAAA GTGGCTCTGA ATATCGTCAA 

201 CGTCAATACT TCCGAAATCG TCTATTCCGC ACAGGGCGCG GGCGAATACG 

251 CACTTTCCAA CCGTGAAATC ATCGGTTTCG GCGGCACTTC CGGCTACGAT 

301 GCGACTTTGA ACGGCAAAGT TTTAGACTTG GCAATCCGCG AAGCCGTCAA 

351 CAGCCTGGTT CAGGCTGTTG ACAACGGCGC ATGGCAACCC AACCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 868; ORF 234.a>: 

a234.pep (partial) 

1 NRTYLNALKQ ESGISGKAHN LKGANYVXTG DVTEFGRXDV GDHQLFGILG 

51 RGKSQIAYAK VALNIVNVNT SEIVYSAQGA GEYALSNREI IGFGGTSGYD 

101 ATLNGKVLDL AIREAVNSLV QAVDNGAWQP NR* 



m234/a234 1 00.0% identity in 54 aa overlap 

10 

30 

m234 . pep 

GAGEYALSNRE 1 1 GFGGTSGYDATLNGKVL 



I I i I I I I M I ! II I I I I I I I I I I I I I i II 

a234 

LG RGKSQI AYAKVALNI VNVNTSEIVYSAQGAGEYALSNREI I GFGGTSG YDATLNGKVL 

50 60 70 80 90 100 



40 50 
m234 . pep DLAI REAVN S L VQ AV DNG AWQ PN RX 

I I I I I I I I I I I I I I I I I I I I I I I I 
a234 DLAIREAVNSLVQAVDNGAWQPNRX 
110 120 130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 869>: 

g235 . seq 



WO 99/57280 



PCT/US99/09346 



536 

1 atgaaacctt tgattttagg gcttgccgcc gtgttggctc tgtctgcctg 

51 ccaagttcga aaagctcccg acctcgacta cacgtcattc aaagaaagca 

101 aaccggcttc aattttggtg gttccgccgc tgaacgagtc gcctgatgtc 

151 aacggcactt gggggatgct ggcttcgacc gccgcgccga tttccgaagc 

201 cggctattac gtctttcccg ccgcagtcgt ggaggaaacc ttcaaagaaa 

251 acggcttgac caatgccgcc gatattcacg ccgtccggcc ggaaaaactg 

301 catcaaattt tcggcaatga tgcggttttg tacattacgg ttaccgaata 

351 cggcacttca tatcaaattt tagacagcgt gacgaccgua tccgccaaag 

401 cacggctggt cgatccccgc aacgggaaag agttgtgg-c gggcccggcc 

451 agcatccgcg aaggcagcaa caacagcaac agcggcctgt tgggggcttt 

501 ggtcggcgca gtggtcaatc agattgccaa cagcctgacc gaccgcggtt 

551 atcaggtttc caaaaccgcc gcatacaacc tactgtcgcc ctattcccgc 

601 aacggtatct tgaaaggtcc gagattcgtc gaagagcagc ccaaataa 

This corresponds to the amino acid sequence <SEQ ID 870; ORF 235.ng>: 

g235 .pep 

1 MKPLILGLAA VLALS ACQVR KAPDLDYTSF KESKPASILV VPPLNESPDV 

51 NGTWGMLAST AAPISEAGYY VFPAAWEET FKENGLTNAA DIHAVRPEKL 

101 HQIFGNDAVL YITVTEYGTS YQILDSVTTV SAKARLVDSR NGKELWSGSA 

151 SIREGSNNSN S G L LGALVG A WNQIANSLT DRGYQVSKTA AYNLLSPYSR 

201 NGILKGPRFV EEQPK* 

The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 871 >: 



m235 . eeq 




1 


ATGAAACCTT 


51 


CCAAGTTCAA 


101 


AACCGGCTTC 


151 


AACGGAACAT 


201 


CGGCTATTAC 


251 


ACGGCTTGAC 


301 


CATCAGATTT 


351 


CGGCACTTCA 


401 


CACGGCTGGT 


451 


AGCATCCGCG 


501 


GGTCAGCGCA 


551 


ATCAGGTTTC 


601 


AACGGCATCT 



This corresponds to the amino acid sequence <SEQ ID 872; ORF 235>: 

m235 .pep 

1 MKPLILGLAA VLALSA CQVQ KAPDFDYTSF KESKPASILV VPPLNESPDV 

51 NGTWGVLAST AAPLSEAGYY VFPAAWEET FKQNGLTNAA DIHAVRPEKL 

101 HQIFGNDAVL YITVTEYGTS YQILDSVTTV SAKARLVDSR NGKELWSGSA 

151 SIREGSNNSN SGLLGALVSA WNQIANSLT DRGYQVSKTA AYNLLSPYSH 

201 NGILKGPRFV EEQPK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 235 shows 96.7% identity over a 215 aa overlap with a predicted ORF (ORF 235.ng) 
from N. gonorrhoeae: 

m235/g235 

10 20 30 40 50 60 

m235 .pep MKPLIIX3LAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESFDVNGTWGVLAST 

Ml II IN M I I I i II llhlllhll IIIIIMII IIIIIMMillllll llhll II 
g235 MKPLI LGLAAVLALSACQVRKAPDLDYTS FKES KP AS I LWPPLNES FDVNGTWGMLAS T 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 3 5 . pep AAPLS EAG Y YVF PAAWEET F KQNGLTNAAD I HAVRPE KLHQ I FGNDAVLYITVTE YGTS 

MhMII III Mill MIIII-.IIIIIMIIIIII IIMIIIIIII Mill I II III II 

g23 5 AAP I SEAGYYVFPAAVVEETFKENGLTNAAD I HAVRPEKLHQI FGNDAVLYITVTE YGTS 

70 80 90 100 110 120 



130 140 150 160 170 180 
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YQI LDSVTTCSAKARLVDSRNGKELWSGSASIREGS 

IIIIMIIMIIIIIMIIMIIII III llllll lllllllllllllhlllllllllll 

YQI LDSVTTVSAKARLVDSRNGKELWSGSAS I REGSNNSNSGLLGALVGAWNQI ANS LT 
130 140 150 160 170 180 

190 200 210 

DRGYQVSKTAAYNLLSP YSHNGI LKGPRFVEEQPKX 

li Mill Mil Mil II IhllillllllllllMI 

DRG YQVS KTAAYNLLS P YS RNG I LKG PRFV EEQ P KX 
190 2O0 210 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 873>: 

a235.seq 

1 ATGAAACCTT TGATTTTAGG GCTTGCCGCC GTGTTGGCGC TGTCTGCCTG 

51 CCAAGTTCAA AAAGCGCCCG ATTTCGACTA CACGTCATTC AAGGAAAGCA 

101 AACCGGCTTC AATTTTGGTG GTTCCGCCGC TGAACGAATC GCCCGATGTC 

151 AACGGAACAT GGGGTGTACT GGCTTCGACC GCCGCGCCGC TTTCCGAAGC 

201 CGGCTATTAC GTCTTCCCCG CCGCAGTCGT GGAGGAAACC TTCAAACAAA 

251 ACGGCTTGAC CAATGCCGCC GAT ATT C AC G CCGTCCGGCC GGAAAAACTG 

301 CATCAGATTT TCGGCAATGA TGCGGTTTTG TACATTACGG TTACCGAATA 

351 CGGCACTTCA TATCAAATTT TAGACAGCGT GACGACCGTA TCCGCCAAAG 

401 CACGGCTGGT CGATTCCCGC AACGGAAAAG AGTTGTGGTC GGGTTCGGCC 

451 AGCAT CCGCG AAGGCAGCAA CAACAGCAAC AGCGGCCTGT TGGGGGCTTT 

501 GGTCAGCGCA GTGGTCAATC AGATTGCCAA CAGCCTGACC GACCGCGGTT 

551 ATCAGGTTTC TAAAACCGCC GCATACAACC TGCTGTCGCC CTATTCTCAC 

601 AACGGCATCT TGAAAGGTCC GAGATTCGTC GAAGAGCAGC CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 874; ORF 235.a>: 

a235.pep 

1 MKPLILGLAA VLALSA CQVQ KAPDFDYTSF KESKPASILV VPPLNESPDV 

51 NGTWGVLAST AAPLSEAGYY VFPAAWEET FKQNGLTNAA DIHAVRPEKL 

101 HQIFGNDAVL YITVTEYGTS YQILDSVTTV SAKARLVDSR NGKELWSGSA 

151 SIREGSNNSN SGLLGALVSA WNQIANSLT DRGYQVSKTA AYNLLSPYSH 

201 NGILKGPRFV EEQPK* 

m235/a235 100.0% identity in 215 aa overlap 

10 20 30 40 50 60 

MKPLI LGLAAVLALSACQVQKAPD FDYTS FKESKPAS I LWPPLNES PDVNGTWGVLAST 

I I I I I I I I I 1 1 1 1 I I I I 1 1 1 1 1 I I I I 1 1 1 1 1 1 II II 1 1 I 1 1 M I I I I I I I 1 1 I I I i 1 1 1 1 

MKPLI LGLAAVLALSACQVQKAPDFDYTSFKESKPAS I LVVPPLNES PDVNGTWGVLAST 
10 20 30 40 50 60 

70 80 90 100 110 120 

AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
I I I M I I I I I i I I I I I I I ! I I I | | 1 | I I | i I I I I | I I | I I I I I I I I | I I I I ! t I ! I I I I 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
70 80 90 100 110 120 

130 140 150 160 170 180 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
I 1 I I t I I I I I I I I I I I I I 1 I I I I I I I I I I II I II I I 1 I : I I I I I I I I i I I I I I I I I I I I I 
YQILDSVTWSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSA WNQIANSLT 
130 140 150 160 170 180 

190 200 210 

DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPKX 

II I 1 1 I 1 1 I I I I I I 1 1 1 1 f M I I 1 1 I I I I I 1 1 II I I 
DRGYQVSKTAAYNLLSP YSHNGI LKGPRFVEEQPKX 
190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 875>: 

g236 . seq 



m235.pep 
g235 

m235.pep 
9235 



m235.pep 
a235 

m235.pep 
a235 

m235.pep 
a235 

m235.pep 
a235 
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1 ATGGCGCGTT TCGCCTTCTC CGCCGACATT CTCCGCACAG CGTTTGCAGA 

51 CGGTTTCATA ACCTGCAACC GCGCCCACAT CGCGGGTGTA ATGCCAGCAG 

101 CGTTCGCATT TTTCGCCGTC GCTGGCTTTG GCGGCAACGG CAAGTTCATC 

151 ACCGACTTTC ACTTCTGCTT TAGACACCAG CAGGGCAAAG CGCAATTCTT 

201 CGCCCAAAGC ATTCAGATAG CCGGCCATTT CTTCCGGCGC GGTAATTTCG 

251 GCTTCCGCCT GCAAggacga accgacagTT TTGTcggcGC GCAAAGGCTC 

301 GAtagcggcg gTTACTGCTT CGCGCGCTTC GCGGATTGCC GTCCATTTTT 

351 TCACCAGTTC GGCTTCGGCT TTTTCGTTGA TGGCCGGGAA CTCGTGCCAA 

401 GTATGGAAGA GGACGCTGTC TTCTTCGCCG CCGCCGATGA TGTCCCACGC 

451 TTCTTCGCCG GTGAAGCACA AAATCGGTGC AATCAAGAGA ACCAGGCTGC 

501 GCGTGATGTG GTACAGGGCG GTTTGCGCGC TGCGGCGGGC GCGGCTGTCG 

551 GCTTTGGTGG TGTAGAGGCG GTCTTTCAGG ATGTCGAGGT AGAACGCGCC 

601 CAAGTCTTCC GAGCAGAAAG AAACAATGTC TTTCACGGCG AAGTGGAAGG 

651 CATAGCGCGG ATAGTAACCG CCTGCCAAAC GCTCTTGCAG CCGCCGCGCC 

701 AATACCAAGG CGTAGCGGTC GATTTCCACC ATATCCGCCT GTTGCACGGC 

751 ATCTTCAATC GGATTAAAGT CGCTCAAATT GGCAAAcagG AAGCTCAAGG 

801 TATTGCGGAT GCGGCGGTAG CTTTCGGTAA CGCGTTTGAG GATTTCTTTG 

851 GAAatcgCCA ATtcgccgct gTAATCGGTG GATGCCGCCC ACAGGCGCAG 

901 GATGTCCGCG CCGAATTCGT TATAGACTTC CTGCGGCGCG ACGACGTTGC 

951 CGATGGATTT CGACATTTTG CGGCCGTTTT GGTCAACCAC GAAACCGTGG 

1001 GTCAGCAGCT GTTTATACGG TGCGCGTCCC ATGGATGA 

This corresponds to the amino acid sequence <SEQ ID 876; ORF 236.ng>: 

g236 .pep 

1 MARFAFSADI LRTAFADGFI TCNRAHIAGV MPAAFAFFAV AGF GGNGKFI 

51 TDFHFCFRHQ QGKAQFFAQS IQIAGHFFRR GNFGFRLQGR TDSFVGAQRL 

101 DSGGYCFARF ADCRPFFHQF GFGFFVDGRE LVPSMEEDAV FFAAADDVPR 

151 FFAGEAQNRC NQENQAARJDV VQGGLRAAAG AAVGFGGVEA VFQDVEVERA 

201 QVFRAERNNV FHGEVEGIAR IVTACQTLLQ PPRQYQGVAV DFHHIRLLHG 

251 IFNRIKVAQI GKQRAQGIAD AAVAFGNAFE DFFGNRQFAA VIGGCRPQAQ 

301 DVRAEFVIDF LRRDDVADGF RHFAAVLVNH ETVGQQLFIR CASHG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 877>: 

m236.seq (partial) 

1 . . TTGCACGGAC GAACCGACGG TTTTGTCGGC 

51 CGGTTACCGC TTCGCGGGCT TCGCGGATTG 

101 TCGGCTTCGG TTTTTTCGTT GATGGTCGGG 

151 GAGGACGCTG TCkTCTTCGC CGCCGCCGwT 

201 CGGTGAAGCA CAAAATCGGT GCAATCAAGA 

251 TGATACAGGG CAGTTTGTGC GCTGCGGCGT 

301 GGTGTAGAGG CGGTCTTTCA GGATGTCGAG 

351 CCGAGCAGAA AGAAACArTG TCTTTTACGG 

401 GGATAGTAAT CGCCTGCCAG ACACTCTTGC 

451 GGCGTAGCGG TCGATTTCCA CCATATCCGC 

501 TCGGATTAAA GTCGCTCAAG TTGGCAAACA 

551 ATACGGCGGT AgCTTTCGGT TACGCGTTTG 

601 CAATTCGCCG CTGTAATCGG TAGATGCCGC 

651 CGCCGAATTC GTTATAAACC TCTTGCGGTG 

701 TTCGACATTT TTTTGCCTTC GCCGTCGACA 

751 CTGTTTATAC GGCGCGCGAC CCATTGA 

This corresponds to the amino acid sequence <SEQ ID 878; ORF 236>: 

m2 3 6 . pep ( partial ) 

1 . . LHGRTDGFVG AQRLDGGGYR FAGFADCRPF FHQFGFGFFV DGRELVPSME 

51 EDAVXFAAAX DVPRFFAGEA QNRCNQENQT ACDVIQGSLC AAACMAVCFG 

101 GVEAVFODVE VERTQVFRAE RNXVFYGKVE XITRIVIACQ TLLQLTCQYH 

151 GVAVDFHHIR LLHGIFNRIK VAQVGKQKAQ GIADTAVAFG YAFEDFFGNR 

201 QFAAVIGRCR PQAQDVCAEF VINLLRCNDV ADGFRHFFAF AVDNETMGQQ 

251 LFIRRATH* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 236 shows 82.9% identity over a 258 aa overlap with a predicted ORF (ORF 236.ng) 
from N. gonorrhoeae: 
m236/g236 



GCGCAAAGGC 
CCGTCCATTT 
AACTCGTGCC 
GAyGTCCCAC 
GAACCAAACT 
GCATGGCTGT 
GTAGAACGCA 
CAAAGTGGaA 
AG CTGACGTG 
CTGTTGCACG 
AAAAGCTCAA 
AGGATTTCTT 
CCACAGGCGC 
CAACGACGTT 
ACGAAACCAT 



TCGATGGCGG 
TTTCACCAGT 
AAGTATGGAA 
GCTTCTTCGC 
GCGTGTGATG 
CTGCTTTGGT 
CCCAAGTCTT 
kGCATAACGC 
CCAATACCAC 
GCATCTTCAA 
GGTATTGCGG 
TGGAAATCGC 
AGGATGTCTG 
GCCGATGGAT 
GGGTCAGCAG 
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10 20 30 

LHG RTDGFVGAQRLDGGGYR FAG FAD CRP F 

H II HIM ill hi II M MIIHI 
FRHQQGKAQFFAQS IQI AGHFFRRGNFGFRLQGRTDS FVGAQRLDSGGYCFARFADCRP F 
60 70 80 90 100 110 

40 50 60 70 80 90 

FHQ FGFGFFVDGRELVPSMEEDAVXFAAAXDVPRFFAGEAQNRCNQENQTACDVI QGS LC 

tlllllllllllMltimilll llll I INI II Mil III III Ihl Ihlhl 

FHQFGFGFFVDGRELVPSMEEDAVFFAAADDVPRFFAGEAQNRCNQENQAARDWQGGLR 
120 130 140 150 160 170 

100 110 120 130 140 150 

AAACMAVCFGGVEAVFQDVEVERTQVFRAERNXVFYGKVEXITRIVIACQTLLQLTCQYH 
HI II IIMIIIIIIIIIIhllllllll ||:|:|| |:||| IIIIIM |h 
AAAGAAVG FGGVE AVFQDVEVERAQVFRAERNNVFHGEVEG I ARI VTACQTLLQP PRQ YQ 
180 190 200 210 220 230 

160 170 180 190 200 210 

GVAVDFHHIRLLHGI FNRI KVAQVGKQKAQGI ADTAVAFGYAFEDFFGNRQFAAVIGRCR 

imiiimmmiimihiihiiiiihiim linn Minimi n 

GVAVDFHHIRLLHGI FNRI KVAQIGKQEAQGI ADAAVAFGNAFEDFFGNRQFAAVIGGCR 
240 250 260 270 280 290 

220 230 240 250 259 

PQAQDVCAE FV I NLLRCKFD VADG FRH FFAFAVDNETMGQQL F I RRATHX 

mill m mm mm mi i mi hi mm hi 

PQAQDVRAE FVI DFLRRDDVADGFRH FAAVLVNHE TVGQQL FI RCAS HG 
300 310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 879>: 

a236 . seq 

1 ATGGCGCGTT TCGCCTTCTC CGCCGACATT CTCTGCACAG CGTTTGCAGA 

SI CGGTTTCATG GCCTGCAACC GCGCCCACAT CGCG3GTGTA GTGCCAGCAG 

101 CGTTCGCATT TTTCACCATC ACTGGCTTTA GCGGCAACGG CAAGTTCGCT 

151 GCCTACTTTC ACTTCTGCTT TAGACACCAG CAAA3CAAAG CGCAATTCTT 

201 CGCCCAAAGC ATTCAGATAG CCGGCCATTT CTTCCGGCGC GGTAATTTCG 

251 GCTTCGGCTT GCAAGGACGA ACCGACGGTT TTGTCGGCGC GCAAAGGCTC 

301 GATGGCGGCG GTTACCGCTT CGCGGGCTTC GCGGATTGCC GTCCATTTTT 

351 TCACCAGTTC GGCTTCGGCT TTTTCGTTGA TGGTCGGGAA CTCGTGCCAA 

401 GTATGGAAAA GCACGCTGTC TTCTGCGCCG CCGCCGATGA TGTCCCACGC 

451 TTCTTCGCCG GTGAAGCACA AAATCGGTGC AATCAAGAGA ACCAGGCTGC 

501 GCGTGATGTG GTACAGGGCG GTTTGCGCGC TGCGGCGGGC GCGGCTGTCG 

551 GCTTTGGTGG TATAGAGGCG GTCTTTCAGG ATATCGAGGT AGAACGCGCC 

601 CAAGTCTTCC GAGCAGAAAG AAACCATTTC TTTCACGGCA AAGTGGAAGG 

651 CATAACGCGG ATAAAAATCA CCGGCAACGC GTTCTTGCAG CCGCCTTGCC 

701 AACACCAAGG CATAGCGGTC GATTTCCACC ATATCCGCCT GTTGCACGGC 

751 ATCTTCAATA GGATTGAAGT CGCTCAAGTT GGCAAACAAA AAGCTCAAGG 

801 TATTGCGGAT ACGGCGGTAG CTTTCGGTTA CGCGCTTGAG GATTTCTTTG 

851 GAAATCGCCA ATTCGCCGCT GTAATCGGTG GATGCCGCCC ACAGGCGCAG 

901 GATGTCCGCG CCGAACTCGT TATACACTTC TTGCGGCGCG ACGACGTTGC 

951 CGATGGATTT CGACATTTTG CGCCCGTTTT GATCCACCAC GAAACCATGG 

1001 GTCAGCAGCT GTTTGTACGG CGCGCGACCC ATTGA 

This corresponds to the amino acid sequence <SEQ ID 880; ORF 236.a>: 

a236.pep 

1 MARFAFSADI LCTAFADGFM ACNRAH IAGV VPAAFAFFTI TGF SGNGKFA 

51 AYFHFCFRHQ QSKAQFFAQS IQIAGHFFRR GNFGFGLQGR TDGFVGAQRL 

101 DGGGYRFAGF ADCRPFFHQF GFGFFVDGRE LVPSMEKHAV FCAAADDVPR 

151 FFAGEAQNRC NQENQAARDV VQGGLRAAAG T^AVGFGGIEA VFQDIEVERA 

201 QVFRAERNHF FHGfC/EGITR IKITGNAFLQ PPCQHQGIAV DFHHIRLLHG 

251 IFNRIEVAQV GKQKAQGIAD TAVAFGYALE DFFGNRQFAA VIGGCUPQAQ 

301 DVRAELVIHF LRRDDVADGF RHFAPVLIHH ETMGQQLFVR RATH* 



m23 6 -pep 
g236 

m236 -pep 
g236 

m23 6 .pep 
g236 

m2 3 6 . pep 
g236 

m23 6 .pep 
g236 
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m236/a236 81.0% identity in 258 aa overlap 

10 20 30 

m2 36 . pep LHGRTDGFVGAQRLDGGGYRFAGFADCRPF 

I : I I I I I I I I I I I I I II I I I I 1 I I I I I I I I 
a236 FRHQQSKAQFFAQSIQIAGHFFRRGNFGFGLQGRTDGFVGAQRLDGGGYRFAGFADCRPF 
60 70 80 90 100 110 

40 50 60 70 80 90 

m236 . pep FHQFGFGFFVDGRELVPSMEEDAVXFAAAXDVPRFFAGEAQNRCNQENQTACDVIQGSLC 

I I I I I I I I I I I I I I I I I I II : II III I I I I I I I I I I I I I I I I I I I : I 11:11:1 
a236 FHQFGFGFFVnGRELVPSMEKHAVFCAAADDVPRFFAGEAQNRCNQENQAARDWQGGLR 
120 130 140 150 160 170 

100 110 120 130 140 150 

m236 . pep AAACMAVC FGG VE AV FQDVEVE RTQ V FRAERNXVFYGKVEX I TR I VI ACQT LLQLTCQ YH 

III I I 111:111111:1111:11111111 I : I I I I I I I I I : : : : | | I I : : 
a236 AAAGAAVGFGGIEAVFQDIEVERAQVFRAERNHFFHGEC/EGITRIKITGNAFLQPPCQHQ 
180 190 200 210 220 230 

160 170 180 190 200 210 

m236 . pep GVAVDFHH I RLLHGI FNRI KVAQVGKQKAQG I ADTAVAFGYAFE D FFGNRQFAAVIGRCR 

!: I II I I I I I i I I I I I I I I : I M I I I I I I I I I I I I I I I I I I I : I I 1 I I I I I I M I I I II 
a236 GIAVDFHH I RLLHGI FNRIEVAQVGKQKAQG I ADTAVAFGYALE D FFGNRQFAAV I GGCR 

240 250 260 270 280 290 

220 230 240 250 259 

m236 . pep PQAQDVCAEFVINLLRCNDVADGFRHFFAFAVDNETMGQQLFIRRATHX 
I I I I I I I I : I I:: I I M I I II I I I I : : I I I I I I I I : I I I I II 
a236 PQAQDVRAELV I H FLRRDD VADG FRHFAPVL IHHETMGQQL FVRRATHX 

300 310 320 330 340 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 88 1>: 

g237 . seq 

1 atgcgggaca aggttggcgg taatatcgca ctccccgccc cacgaatatt 

51 cgattctaac atcggcaagc tgcggaaaaa ctttaagcat atcttggcgg 

101 acaagctcgg tcatacgcgc aggattgtcg ataaattcgt tatccttacc 

151 gccgaaaagc agcctgccgt ccgcgctgag gcggtaataa tccaaaatat 

201 ggcggttgtc gcatactgcc atattgttgc ggataagccc ttttgtgcgc 

251 gcgcccaagg gttcggtggc aataataaag gtgctgacgg caatcgcctt 

301 gcgttccaaa ggccggaata tcgggttcaa accgacataa gtattgacgg 

351 catagaccac atttttacac tcgacgctgc cttcgggcgt gtaaaccagc 

401 caaccgtttt gatacggttc gatgcgcgtc atcggggatt gctcgaaaat 

451 ctgcgcgccg gcttcggcag cggcgctggc aacacccaac gtgtaattga 

501 gcggatgaag atgcccggac aagggatcga actgtgcgcc ttggtacata 

551 tcgctgtcaa gctgctgttt caactcggct ttatcccaaa gttgataatg 

601 actcgcaccg taatgccgtt gggcgtgttc atgccactgc tgcaactctt 

651 cccaatgctg cggacggacg gcaaccgtgg cataaccgcg ctgccaatcg 

701 caatcgatgg catgtttgcg gacgcgttcg tccaccagtt cgaccgcctg 

751 caaagactgt tgccaaaacc attgcgcctg ctccaagccg acctgttttt 

801 caatttcccc cataccgcag gcgtagtcgc tgataacctg cccgccactc 

851 ctgccggacg cgccgaagcc gatacgtgcg gcttccaaaa cgacggcttc 

901 atgtccgtgt tccgccagcg gcaatgcggt acacaaaccg ctcaaaccgc 

951 cgccgataat gcaggtttcg gctttcagac ggcattggag tttcggataa 

1001 acagtatgcg gattaaccga actaaaataa taagaaggca gatattcttg 

1051 aaaatcaggg cgaatcattg tgtttgcttt atcgggtata ttttcggacg 

1101 gaatgataca gactgtcggg ccatatcgtc caaacagaaa atcggttga 
This corresponds to the amino acid sequence <SEQ ID 882; ORF 237.ng>: 

g237.pep 

1 MRDKVGGNIA LPAPRI FDSN IGKLRKNFKH ILADKLGHTR RIVDKFVILT 

51 AEKQPAVRAE AVI IQNMAW AYCHIVA DKP FCARAQGFGG NNKGADGNRL 
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101 AFQRPEYRVQ TDISIDGIDH IFTLDAAFGR VNQPTVLIRF DARHRGLLEN 

151 LRAGFGSGAG NTQRVIERMK MPGQGIE LCA LVHIAVKLLF QLGFI PKLIM 

201 TR TVMPLGVF MPLLQLFPML RTDGNRGITA LPIAIDGMFA DAFVHQFDRL 

251 QRLLPKPLRL LQADLFFNFP HTAGWADNL PATPAGRAEA DTCGFQNDGF 

301 MSVFRQRQCG TQTAQTAADN AGFGFQTALE FRINSMRINR TKIIRRQIFL 

351 KIRANHCVCF IGYIFGRNDT DCRAISSKQK IG* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 883>: 



m237 . seq 








1 


ATGCGGGACA 


AGGTTGGCGG 


TAATGTCGCA 


51 


CGATTTTGAC 


ATCGGCAAGC 


TGCGGAAAAA 


101 


ACAAGCTCGG 


TCATaCGCTC 


AGGATTGTCG 


151 


GCCGAAAAGC 


AGTCTGCCGT 


CCGCGCTGAG 


201 


GGCGGTTGTC 


GCATACTGCC 


ATATTGTTAC 


251 


GCCCCCAAGG 


GTTCGGTCGC 


AATAATAAAG 


301 


GCGTTCCAAA 


GGCCGGAATA 


TCGGGTTCAA 


351 


CATAGACCAC 


ATTTTTGCAC 


TCGACGCTGC 


401 


CAACCGTTTT 


GATGCGGTTC 


GATGCACGTC 


451 


CTGCGCACCG 


GCTTCGGCAG 


CGGCACGAGC 


501 


GCGGATGCAG 


GTGTCCGGAT 


AAGGGGTCGA 


551 


TCGCTGTCAA 


GCTGCTGTTT 


CAACTCGGCT 


601 


ACTCGCACCG 


TAATGCCGTT 


GGGCGTGTTC 


651 


CCCAATGCTG 


CGGACGGACG 


GCAACCGTGG 


701 


CAATCGACGG 


CATGTTTGCG 


GACGCGTTCG 


751 


CAAAGACTGT 


TGCCAAAACC 


ATTGCGCCTG 


801 


CAATTTCCCC 


CATACCGCAG 


nCGTAATCGC 


851 


CGTCCCGACG 


CGCCGAAACC 


GATACGCGCG 


901 


ATGTCCCTGC 


TCCGCCAAGG 


GCAATGCAGT 


951 


CGCCGATGAT 


ACAGGTATCG 


GTTTTCAGAC 


1001 


ACAGTATGAG 


GATTAACCGA 


ACTGAAATAA 


1051 


AAAATCAGGG 


CGAATCATTG 


TGTTTGCTTT 


1101 


GAATGATACA 


GGCTGTCGGG 


CCATATCGTC 



This corresponds to the amino acid sequence <SEQ ID 884; ORF 237>: 

m237 .pep 

1 MRDKVGGNVA LP APRI FDFD IGKLRKNFKH ILADKLGHTL RIVDKLVILT 

51 AEKQSAVRAE AVI IQNMAW AYCHIVTD KP FCARPQGFGR NNKGADSNRL 

101 AFQRPEYRVQ TCISIDSIDH I FALDAAFGR VNQPTVLMRF DARHRGLLEN 

151 LRTGFGSGTS DAQSVSERMQ VSGXGVEL CP LVHIAVKLLF QLGFI PKLIM 

201 TR TVMPLGVF MPLLQLFPML RTDGNRGITA LPITIDGMFA DAFVHQFDRL 

251 QRLLPKPLRL LQADLFFNFP HTAXVIADNL PATPSRRAET DTRGFQHNRF 

301 MSLLRQGQCS AQTTQSAADD TGIGFQTALK FRINSMRINR TEIIRRQIFL 

351 KIRANHCVCF IRCIFGRNDT GCRA1SSXQK IG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 237 shows 86.1 % identity over a 382 aa overlap with a predicted ORF (ORF 237.ng) 
from N. gonorrhoeae: 

m237/g237 

10 20 30 40 50 60 

m2 3 7 . pep MRDKVGGNVALPAPRI FDFD I GKLRKNFKH I LADKLGHTLRI VDKLVI LTAEKQS AVRAE 

lllllllhlllllllll :||!IIIIMIIIIIIIIII MIIUIIIIMI Mill 

g237 MRDKVGGNIALPAPRI FDSNIGKLRKNFKHILADKLGHTRRIVDKFVILTAEKQPAVRAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 3 7 . pep AVI I QNMAWAYCH I VTDKP FCARPQG FGRNNKGADSNRLAFQRPE YRVQTCI SI DS I DH 

lllllllllllllllhlllllll llll I II I IN III III I III III II MM 

g2 3 7 AVI I QNMAWAYCH I VADKP FCARAQG FGGNNKGADGNRLAFQR PE YRVQTDI S I DG I DH 

70 80 90 100 110 120 



m237 .pep 



130 140 150 160 170 180 

I FALDAAFGRVNQ PTVLMRFDARHRGLLENLRTG FGSGTSDAQS VS ERMQVSGXGVELCP 
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Ihlll III III lit Ihlllllll III II I hi | I I I-- | | |||:: | |:||| 
g237 I FTLDAAFGRVNOPTVL I RFDARHRGLLENLRAGFGSGAGNTORVI ERMKMPGQGI ELCA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m23 7 . pep LVH I AVKLLFQIX3 F I P KL I MTRTVMP LGV FMPLLQLF PMLRTDGNRG I TALP I T I D GMFA 

II Mil II I III IIMII IIIIM I illlll III II II MMIM III II llhl II I II 

g237 LVHIAVKLLFQLGFIPKLIMTRTVMPLGVFMPbLQLFPMLRTDGNRGITALPIAIDGMFA 
190 200 210 220 230 240 

250 260 270 280 290 300 

m2 3 7 . pep DAFVHQ FDRLQRLL PKP LRL LQADLF FNFPHTAXV I ADNLPATPS RRAETDTRGFQIINRF 

I I I I II I I I I I I I I I I I I I I I I 1 I 1 I I I I I I I I hi Mil |||: |||: | | |||:: | 
g2 3 7 DAFVHQFDRLQRLLPKPLRLLQADLFFNFPHTAGWADNLPATPAGRAEADTCGFQNDGF 

250 260 270 2B0 290 300 

310 320 330 340 350 360 

m2 3 7 . pep MSLLRQGQCSAQTTOSAADDTGIGFQTAIiKFRINSMRINRTEIIRRQIFLKIRANHCVCF 

M::|| l|:'lh|:|||::|:||||lhllll|||||||:||||||||||| IIIIHI 
g2 3 7 MSVFRQRQCGTQTAQTAADNAGFGFQTALE FR INSMRI MRTKI IRRQI FLKIRANHCVCF 

310 320 330 340 350 360 

370 380 
m237 . pep I RC I FGRNDTGCRAI S SXQKI GX 

I lllllll Illlll Mill 
g237 IGYIFGRNDTDCRAISSKQKIGX 
370 380 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 885>: 
a237.seq 

1 ATGCGGGACA AGGTTGGCGG TAATGTCGCA CTCCCCGCCC CACGAATATT 

51 CGATTTTGAC ATCGGCAAGC TGCGGAAAAA CTTTAAGCAT ATCTTGGCGG 

101 ACAAGCTCGG TCATACGCGC GGGATTGTCG ATAAACTCGT TATCCTTACC 

151 GCCGAAAAGC AGTCTGCCGT CCGCGCTGAG GCGGTAATAA TCCAAAATAT 

201 GACGGTTGTC GCATACTGCC ATATTGTTGC GGATAAGCCC TTTTGCACGC 

251 GCGCCCAAGG GTTCTGTGGC AATAATAAAG GTGCTGACAG CAATCGCCTT 

301 GCGCTCCAAA GGCTTGAATA TCGGATTCAA ACCGGCATAA GTATTGACGG 

351 CGTACACCAG ATTTTTGCAT TCGACGCTGC CTTCGGGGGT GTAAACCAGC 

401 CAACCGTTTT GATAAGGTTC AATGCGTATC ATGGGAGAAT GCTCAAAAAT 

451 CTTCGTACCA GCTTCGGCAG CGGCGCGGGC GATGCCCAAC GTGTAATTGA 

501 GCGGATGGAG ATGCCCGGAC AAGGGATCGA ACTGTGCGCC TTGGTACATA 

551 TCGCTGTCAA GCTGCTGCTT CAGTTCAGTG TTATCCCAGA GTTGATAATG 

601 AGTTGCACCG TAATATTTTT GGGCGTGCTC ATGCCATTGT TGCAATTCTT 

651 CCCAATGCTG CGAACGGATG GCAACCGTGG CATAACCGCG CTGCCAATCG 

701 CAATCAATGG CATCTTTGCG GACGCGTTCG TCCACCAGTT CGACCGCCTG 

751 CAAAGACTGT TGCCAAAACC ATTGCGCTTG CTCCAAACCG ACCTGTTTTT 

801 CAATTTCCTC CATACCGCAG GCGTAATCGC TGATAACCTG CCCGCCACTC 

851 CGTCCCGACG CGCCGAAACC GATACGCGCG GCTTCCAACA CAACCGTTTC 

901 ATGTCCCTGC TCCGCCAAGG GCAATGCAGT GCACAAACCA CTCAATCCGC 

951 CGCCGATGAT ACAGGTATCG GTTTTCAGAC GGCATTGAAG TTTCGGATAA 

1001 ACAGTATGAG GATTAACCGA ACTGAAATAA TAAGAAGGCA GATATTCTTG 

1051 AAAATCAGGG CGAATCATTG TGTTTGCTTT ATCGGGTATA TTTTCGGACG 

1101 GAAT GAT AC A GGCTGTCGAG CCATATCGTC CAAACAGAAA ATCGGTTGA 

This corresponds to the amino acid sequence <SEQ ID 886; ORF 237.a>: 

a237 .pep 

1 MRDKVGGNVA LPAPRIFDFD IGKLRKNFKH ILADKLGHTR GIVDKLVILT 

51 AEKQSAVRAE A VIIQNMTW AYCHIVA DKP FCTRAQGFCG NNKGADSNRL 

101 ALQRLEYRIQ TGISIDGVHQ I FAFDAAFGG VNQPTVLIRF NAYHGRMLKN 

151 LRTSFGSGAG DAQRVIERME MPGQGIEL CA LVHIAVKLLL QFSVI PELIM 

201 SCTVIFLGVL MPLL QFFPML RTDGNRGITA LPIAINGMFA DAFVHQ FDRL 

251 QRLLPKPLRL LQTDLFFNFL HTAGVIADNL PATPSRRAET DTRGFQHNRF 

301 MSLLRQGQCS AQTTQSAADD TGIGFQTALK FRINSMRINR TEIIRRQIFL 

351 KIRANHCVCF IGYIFGRNDT GCRAISSKQK IG* 
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m237/a237 85.6% identity in 382 aa overlap 

10 20 30 40 50 60 

m237 . pep MRDKVGGNVALPAPRIFDFDIGKLRKNFKHILADKLGHTLRIVDKLVILTAEKQSAVRAE 
I I f I I I I I I I I I f I I I I I I i I I I I I [ M I M I I I I I I I I I I I I II I I I I I I I I I || | I 
a 2 3 7 MRDKVGGN VALPAPR I FDFDI GKLRKN FKH I LADKLGHTRG I VDKLV I LT AEKQS AVRAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m237 . pep AVIIQNMAWAYCHI^/TDKPFCARPQGFGRNNKGADSNRLAFQRPEYRVQTCISIDSIDH 
I I I I I I I : I I I I ! I I I : I I I I I : I III 11111111111:11 I I I : I I llilr: : 
a237 AVIIQNMTWAYCHIVADKPFCTRAQGFCGNNKGADSKRLALQRLEYRIQTGISIDGVHQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

m237 .pep IFALDAAFGRWQPTVLMRFDARHRGLLENLRTGFGSGTSDAQSVSERMQVSGXGVELCP 

111:11111 1111111:11:1 I : I : I I I I : I I I I : : I I I I III:: I I : I I I 
a237 IFAFDAAFGGVNQPTVLIRFNAYHGRMLKNLRTSFGSGAGDAQRVIERMEMPGQGIELCA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m237 . pep LVHIAVKLLFQLGFIPKLIMTRTVMPLGVFMPLLQLFPMLRTDGNRGITALPITIDGMFA 
I I I I I I I I I : I : : I I : I I I : I I : I I I : I I I I I : I I I I I I I II I I I I I I I I : I : I I I I 
a237 LVHIAVKLLLQFSVIPELIMSCTVIFLGVLMPLLQFFPMLRTDGNRGITALPIAINGMFA 

190 200 210 220 230 240 

250 260 270 280 290 300 

m237 . pep DAFVHQFDRLQRLLPKPLRLLQADLFFNFPHTAXVIADNLPATPSRRAETDTRGFQHNRF 

I I I I I i I I I I I i I I I 11 t I J I = I I 1 1 1 I III I I M I ! I 1 M I I I I I II M I I I I I I I 
a237 DAFVHQFDRLQRLLPKPLRLLQTDLFFNFLHTAGVIADNLPATPSRRAETDTRGFQHNRF 

250 260 270 280 290 300 

310 320 330 340 350 360 

10237 . pep MSLLRQGQCSAQTTQSAADDTGIGFQTALKFRINSMRINRTEIIRRQIFLKIRANHCVCF 

I I r 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 

a237 MSLLRQG0CSAOTTQSAADDTGIGFQTALKFRINSMRINRTEIIRRQIFLKIRANHCVCF 
310 320 330 340 350 360 

370 380 
m237 .pep IRCIFGRNDTC-CRAISSXQK1GX 
I I I I I I I I I I I I I I I I I I I I 
a2 3 7 IGYI FGRNDTGCRAI S SKQKIGX 

370 380 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 887>: 

g238.seq 

1 atgaatttgc ctattcaaaa attcatgatg ctgttggcag cggcaatatc 

51 gatgctgcat atccccatta gtcatgcgaa cgctttggat gcccgtttgc 

101 gcgatgatat gcaggcaaaa cactacgaac cgcgtggcaa ataccatctg 

151 tttggtaatg ctcgcggcag tgttaaaaat cgcgtttgcg ccgtccaaac 

2 01 atttgatgca actgcggtcg gccccatact gcctattaca cacgaacgga 

251 caggatctga aggtgttatc ggctatgaaa cccatttttc aggacacgga 

301 cacgaagtac acagtccgtt cgataatcat gattcaaaaa gcacttctga 

351 tttcagcggc ggcgtagacg gcggttttac cgtttaccaa cttcatcgga 

4 01 cagggtcgga aatacatccc gcagacggat atgacgggcc tcaaggcggc 

4 51 ggttatccgg aaccacaagg ggcaagggat atatacagct accatatcaa 

501 aggaacttca accaaaacaa agataaacac tgttccgcaa gccccttttt 

551 cagaccgctg gctaaaagaa aatgccggtg ccgcttccgg ttttctcagc 

6 01 cgtgcggatg aagcaggaaa actgatatgg gaaaacgacc ccgataaaaa 

651 ttggcgggct aaccgtatgg atgatattcg cgccatcgtc caaggtgcgg 

701 ttaatccttt tttaacgggt tttcaagggg tagggattgg ggcaattaca 

751 gacagtgcgg taagcccggt cacagataca gccgctcagc agactctaca 

801 aggtattaat gatttaggaa atttaagtcc ggaagcacaa cttgccgccg 
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851 cgagcctatc acaggacagt gcctttgcgg taaaagacgg catcaattcc 

901 gccagacaat gggctgatgc ccatccgaat ataacagcaa cagcccaaac 

951 tgcccttgcc gtagcagagg ccgcaggtac ggtttggcgc ggtaaaaaag 

1001 tagaacttaa cccgaccaaa tgggattggg ttaaaaatac cggctataaa 

1051 aaacctgctg cccgccatat gcagactgta gatggggaga tggcaggggg 

1101 gaatagaccg cctaaatcta taacgtcgga aggaaaagct aatgctgcaa 

1151 cctatcctaa gttggttaat cagctaaatg agcaaaactt aaataacatt 

1201 gcggctcaag atccaagatt gagtctagct attcatgagg gtaaaaaaaa 

1251 ttctccaata ggaactgcaa cttatgaaga ggcagataga ctaggtaaaa 

1301 tttgggttgg tgagggtgca agacaaacta gtggaggcgg atggttaagt 

13 51 agagatggca ctcgacaata tcggccacca acagaaaaaa aatcacaatt 

14 01 tgcaactaca ggtattcaag caaattttga aacttatact attgattcaa 
1451 atgaaaaaag aaataaaatt aaaaatggac atttaaatat taggtaa 

This corresponds to the amino acid sequence <SEQ ID 888; ORF 238.ng>: 

g238 .pep 

1 MNLPIQKFMM LLAAAISMLH IPISHAN GLD ARLRDDMQAK HYEPGGKYHL 

51 FGNARGSVKN RVCAVQTFDA TAVGP1LPIT HERTGFEGVI GYETHFSGHG 

101 HEVHSPFDNH DSKSTSDPSG GVDGGFTVYQ LHRTGSEIKP ADGYDGPQGG 

151 GYPEPQGARD IYSYHIKGTS TKTKINTVPQ APFSDRWLKE NAGAASGFLS 

201 RADEAGKLIW ENDPDKNWRA NRMDDIRGIV QGAVNPFLTG FQGVGIGAIT 

251 DSAVSPVTDT AAQQTLQGIN DLGNLSPEAQ LAAASLLQDS AFAVKDGINS 

301 AROWADAHPN ITATAOTALA VAEAAGTVWR GKKVELNPTK WDWVKNTGYK 

351 KPAARHMQTV DGEMAGGNRP PKSITSEGKA NAATYPKLVN QLNEQNLNNI 

401 AAQDPRIiSLA IHEGKKNFPI GTATYEEADR LGKIWVGEGA RQTSGGGWLS 

451 RDGTRQYRPP TEKKSQFATT GIQANFETYT IDSNEKRNKI KNGHLNIR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 889>: 

ra23 8.seq 

1 ATGAATTTGC CTATTCAAAA ATTCATGATG CTGTTTGCAG CAGCAATATC 

51 GTTGCTGCAA ATCCCCATTA GTCATGCGAA CGGTTTGGAT GCCCGTTTGC 

101 GCGATGATAT GCAGGCAAAA CAcTACGAAC CGGG TGGTAA ATACCATCTG 

151 TTTGGTAATG CTCGCGGCAG TGTTAAAAAG CGGGTTTACG CCGTCCAGAC 

201 ATTTGATGCA ACTGCGGTCA GTCCTGTACT GCCTATTACA CACGAACGGA 

251 CAGGGTTTGA AGGTGTTATC GGTTATGAAA CCCATTTTTC AGGGCACGGA 

301 CATGAAGTAC ACAGTCCGTT CGATCATCAT GATTCAAAAA GCACTTCTGA 

351 TTTCAGCGGC GGTGTAGACG GCGGTTTTAC TGTTTACCAA CTTCATCGAA 

401 CAGGGTCGGA AATCCATCCG GAGGATGGAT ATGACGGGCC GCAAGGCAGC 

4 51 GATTATCCGC CCCCCGGAGG AGCAAGGGAT ATATACAGCT ATTATGTCAA 

501 AGGAACTTCA ACAAAAACAA AGACTAATAT TGTCCCTCAA GCCCCATTTT 

551 CAGACCGTTG GCTAAAAGAA AATG CCGGTG CCGCCTCTGG TTTTTTCAGC 

601 CGTGCGGATG AAGCAGGAAA ACTGATATGG GAAAGCGACC CCAATAAAAA 

651 TTGGTGGGCT AACCGTATGG ATGATGTTCG CGGCATCGTC CAAGGTGCGG 

701 TTAATCCTTT TTTAATGGGT TTTCAAGGAG TAGGGATTGG GGCAATTACA 

751 GACAG TGCAG TAAGCCCGGT CACAGATACA GCCGCGCAGC AGACTCTACA 

801 AGGTATTAAT GATTTAGGAA AATTAAGTCC GGAAGCACAA CTTGCTGCCG 

851 CGAGCCTATT ACAGGACAGT GCTTTTGCGG TAAAAGACGG TATCAACTCT 

901 GCCAAACAAT GGGCTGATGC CCATCCAAAT ATAACAGCTA CTGCCCAAAC 

951 TGCCCTTTCC GCAGCAGAGG CCGCAGGTAC GGTTTGGAGA GGTAAAAAAG 

1001 TAGAACTTAA CCCGACTAAA TGGGATTGGG TTAAAAATAC CGGTTATAAA 

1051 AAACCTGCTG CCCGCCATAT GCAGACTTTA GATGGGGAGA TGGCAGGTGG 

1101 GAATAAACCT ATTAAATCTT TACCAAACAG TGCCGCTGAA AAAAGAAAAC 

1151 AAAATTTTGA GAAGTTTAAT AGTAACTGGA GTTCAGCAAG TTTTGATTCA 

1201 GTGCACAAAA CACTAACTCC CAATGCACCT GGTATTTTAA GTCCTGATAA 

1251 AGTTAAAACT CGATACACTA GTTTAGATGG AAAAATTACA ATTATAAAAG 

1301 ATAACGAAAA CAACTATTTT AGAATCCATG ATAATTCACG AAAACAGTAT 

1351 CTTGATTCAA ATGGTAATGC TGTGAAAACC GGTAATTTAC AAGGTAAGCA 

1401 AGCAAAAGAT TATTTACAAC AACAAACTCA TATCAGGAAC TTAGACAAAT 

1451 GA 

This corresponds to the amino acid sequence <SEQ ID 890; ORF 23 8>: 

ra238.pep 

1 MNLPIQKFMM LFAAAISLLQ IPISHAN GLD ARLRDDMQAK HYEPGGKYHL 

51 FGNARGSVKK RVYAVQTFDA TAVSPVLPIT HERTGFEGVI GYETHFSGHG 

101 HEVHSPFDHH DSKSTSDFSG GVDGGFTVYQ LHRTGSEIHP EDGYDGPQGS 
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151 DYPPPGGARD IYSYYVKGTS TKTKTNIVPQ APFSDRWLKE NAGAASGFFS 

201 RADEAGKLIW ESDPNKNWWA NRMDDVRGIV QGAVNPFLMG FQGVGIGAIT 

251 DSAVSPVTDT AAQQTLQGIN DLGKLSPEAQ LAAASLLQDS AFAVKDGINS 

301 AKQWADAHPN ITATAQTALS AAEAAGTVWR GKKVELNPTK WDWVKNTGYK 

351 KPAARHMQTL DGEMAGGNKP IKSLPNSAAE KRKQNFEKFN SNWSSASFDS 

401 VHKTLTPNAP GILSPDKVXT RYTSLDGKIT IIKDNENNYF RIHDNSRKQY 

4 51 LDSNGNAVKT GNLQGKQAXD YLQOOTHIRN LDK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 238 shows 86.0% identity over a 401 aa overlap with a predicted ORF (ORF 238.ng) 
from N. gonorrhoeae: 
m238/g238 

10 20 30 40 50 60 

m238 . pep MNLPIQKFMMLFAAAI SLLQI PI SHANGLDARLRDDMQAKHYEPGGKYHLFGNARGSVKK 

MM II M II MM illMMMI MIMIMi MHM Mill I II II MM Mi I I M 
g 2 3 8 MNLP I QKFMMLiLAAAI SMLH I P I SHANGLDARLRDDMQAKH YEPGGKY HLFGNARGS VKN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 3 8 . pep RVYAVQTFDATAVSPVLP I THERTGFEGV IGYETHFSGHGHEVHS PFDHHDS KSTSDFSG 

II MMMMIIMMIMMIMMMMMIMIMMMIIMM I MMM 

g2 3 8 RVCAVQTFDATAVG P I LP I THERTG F EGV I G YETH FS GHGHEVHS P FDNHD S KS TSDF SG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m238 .pep GVDGGFTVYQLHRTGSEIHPEDGYDGPQGSDYPPPGGARDIYSYYVKGTSTKTKTNIVPQ 

IIIIIIIIIMIIIIIIIIi IIMIMM II I MIMMhMIIIMM I III 
g2 3 8 GVDGGFTVYQLHRTGSE IHPADGYDGPQGGGYPEPQGARDI YSYH I KGTSTKTKINTVPQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 3 8 . pep APFSDRWLKENAGAASGFFSRADEAGKL IWESDPNK1WWANRMDDVRGIVQGAVNPFLMG 

MMMMMMMIMIMIIIMMMMM Mill r 1 1 1 1 1 = I i 1 1 E 1 1 1 1 1 1 i I 

g238 APFSDRWLKENAGAASGFLSRADEAGKLIWENDPDKlWRANRMDDIRGrVQGAVNPFLTG 
190 200 210 220 230 240 

250 260 270 280 290 300 

ra238 . pep FQGVGIGAITDSAVSPVTDTAAQQTLQGINDLGKLSPEA 

III lllll II I llllll Mill 1 1 1 MM! Ilhll III III 1 1 1 lllllll II IMI 1 1 ' 

g238 FQGVG IGAI TDSAVSPVTDTAAQQTLQG INDLGNLSPEAOLAAASLLQDSAFAVKDGI NS 

250 260 270 280 290 300 

310 320 330 340 350 360 

m2 3 8 . pep AKQWADAH PN I TATAQTALS AAEAAGTWJRG KKVE LNP TKWDWVKNTG YKKPAARHMQTL 

Ml 1 1 1 Mill II Ml MMMM I MMMM III MMM 1 1 1 M M MM IM II M 

g 2 3 8 ARQWADAHPN I TATAQTALAVAEAAGTVWRGKKV^ELNPTKWD WVKNTGYKKPAARHMQTV 

310 320 330 340 350 360 

370 380 390 400 410 420 

m238 . pep DGEMAGGNKPIKSLPNSAAEKRKQNFEKFNSNWSSAS FDSVHKTLTPNAPGIliSPDKVKT 

Mil Ml Ml ||: :| := :: |: :: : ::::: 

g238 DGEMAGGNRPPKSI -TSEGKANAATYPKLVNQLNEQNLNNIAAQDPRLSLAIHEGKI^FP 

370 380 390 400 410 



430 440 450 460 470 480 

m2 3 8 . pep RYTSLDGKITI I KDNENNYFRI HDNS R KQYLDS NGNAVKTGNIOGKQ AKD YLQQQTH I RN 

g23 8 I GTATYE E ADRLGKI WVGEGARQTS GGGWLS RDGTRQ YRP PTE KKS QFATTG I QANFE TY 

420 430 440 450 460 470 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 891 >: 

a238.seq (partial) 

1 ATGAATTTGC CTATTCAAAA ATTCATGATG CTGTTTGCAG CAGCAATATC 

51 GTTGCTGCAA ATCCCCATTA GTCATGCGAA CGGTTTGGAT GCCCGTTTGC 

101 GCGATGATAT GCAGGCAAAA CACTACGAAC CGGGTGGTAA ATACCATCTG 

151 TTTGGTAATG CTCGCGGCAG TGTTAAAAAT CGGGTTTACG CCGTCCAAAC 

201 ATTTGATGCA ACTGCGGTCG GCCCCATACT GCCTATTACA CACGAACGGA 

251 CAGGATTTGA AGGCATTATC GGTTATGAAA CCCATTTTTC AGGACATGGA 

301 CATGAAGTAC ACAGTCCGTT CGATAATCAT GATTCAAAAA GCACTTCTGA 

351 TTTCAGCGGC GGCGTAGACG GTGGTTTTAC CGTTTACCAA CTTCATCGGA 

401 CAGGGTCGGA AATCCATCCG GAGGATGGAT ATGACGGGCC GCAAGGCAGC 

451 GATTATCCGC CCCCCGGAGG AGCAAGGGAT ATATACAGCT ACTATGTCAA 

501 AGGAACTTCA ACAAAAACAA AGAGTAATAT TGTTCCCCGA GCCCCATTTT 

551 CAGACCGCTG GCTAAAAGAA AATGCCGGTG CCGCCTCTGG TTTTTTCAGC 

601 CGTGCTGATG AAGCAGGAAA ACTGATATGG GAAAGCGACC CCAATAAAAA 

651 TTGGTGGGCT AACCGTATGG ATGATATTCG CGGCATCGTC CAAGGTGCGG 

701 TTAATCCTTT TTTAATGGGT TTTCAAGGAG TAGGGATTGG GGCAATTACA 

751 GACAGTGCAG TAAGCCCGGT CACAGATACA GCCGCGCAGC AGACTCTACA 

801 AGGTATTAAT CATTTAGGAA ATTTAAGTCC CG AAG CACAA CTTGCGGCTG 

851 CAACCGCATT ACAAGACAGT GCTTTTGCGG TAAAAGACGG TATCAATTCC 

901 GCCAGACAAT GGGCTGATGC CCATCCGAAT ATAACTGCAA CAGCCCAAAC 

951 TGCCCTTGCC GTAGCAGAGG CCGCAACTAC GGTTTGGGGC GGTAAAAAAG 

1001 TAGAACTTAA CCCGACCAAA TGGGATTGGG TTAAAAATAC CGGCTATAAA 

1051 ACACCTGCTG TTCGCACCAT GCATACTTTG GATGGGGAAA TGGCCGGTGG 

1101 GAATAGACCG CCTAAATCTA TAACGTCCAA CAGCAAAGCA GATGCTTCCA 

1151 CACAA 

This corresponds to the amino acid sequence <SEQ ID 892; ORF 238.a>: 

a238.pep (partial) 

1 MNLPIQKFMM LFAAAISLLQ IPISHA KGLD ARLRDDMQAK HYEPGGKYHL 

51 FGNARGSVKN RVYAVQTFDA TAVGPILPIT HERTGFEGII GYETHFSGHG 

101 HEVHSPFDNH DSKSTSDFSG GVDGGFTVYQ LHRTGSEIHP EDGYDGPQGS 

151 DYPPPGGARD IYSYYVKGTS TKTKSNIVPR APFSDRWLKE NAGAASGFFS 

201 RADEAGKLIW ESDPNKNWWA NRMDDIRGIV QGAVNPFLMG FQGVGIGAIT 

251 DSAVSPVTDT AAQQTLQGIN HLGNLSPEAQ LAAATALQDS AFAVKDGINS 

301 ARQWADAHPN ITATAQTALA VAEAATTVWG GKKVELNPTK WDWVKNTGYK 

351 TPAVRTMHTL DGEMAGGNRP PKSITSNSKA DASTQ 



m238/a238 9 1 .9% identity in 385 aa overlap 

10 20 30 40 50 60 

m238.pep MNLPIQKFMMLFAAAISLLQIPISHANGLDARLRDDMQAKHYEPGGKYHLFGNARGSVKK 
I 1 1 I I I II 1 1 I I I II I I I I I I I I I I 1 1 I I I I 1 1 1 1 1 I 1 1 I I II I I I i I 1 1 II I 1 1 I I I I: 
a 2 3 8 MNLPI QKFMM LFAAAISLLQ I PIS HANG LDARLRDDMQAKH YEPGGKYHLFGNARGS VKN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m238 . pep RVYAVQTFDATAVSPVLPITHERTGFEGVIGYETHFSGHGHEVHSPFDHHDSKSTSDFSG 
I II I I! I MM II: 1:1 Ml II III III I II I II Nil 11:1 Ml II I I III 

a238 RVYAVQTFDATAVGPILPITHERTGFEGIIGYETHFSGHGHEVHSPFDNHDSKSTSDFSG 

70 80 90 100 110 120 



130 140 150 160 170 180 

m238.pep GVDGGFTVYQLHRTGSEIHPEDGYDGPQGSDYPPPGGARDIYSYYVKGTSTKTKTNIVPQ 
I I I I I I I M I I I I I I II II I I I I I I I I I M I 1 1 I I M II I I I I I I I I I I I II I I: I I I I: 
a238 GVDGGFTVYQLHRTGSEIHPEDGYDGPQGSDYPPPGGARDIYSYYVKGTSTKTKSNIVPR 
130 140 150 160 170 180 



190 200 210 220 230 240 

m238 . pep APFSDRWLKENAGAASGFFSRADEAGKLIWESDPNKNWWANRMDDVRGIVQGAVNPFLMG 
Ml IMI MM III III Mill Mill III MM II MM M Ml MM I Ml MMIM 
a238 APFSDRWLKENAGAASGFFSRADEAGKLIWESDPNKNWWANRMDDIRGIVQGAVNPFLMG 

190 200 210 220 230 240 



m238 .pep 



250 260 270 280 290 300 

FQGVG I GA I T DS A VS P VTDT AAQQT LQG IN DLGK L S PEAQLAAAS LLQ DS A FA V K DG I N S 
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I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I : I ( I I 1 I II I I : I II I I I I I l-l I f I I 
a233 FQGVGIGAITDSAVSPVTDTAAQQTLQGINHLGNLSPEAQL7VAATALQDSAFAVKDGINS 
250 260 270 280 290 300 

310 320 330 340 350 360 

m233 . pep AKQW ADAH PN I TAT AQT ALS AAEAAG T VWRGKKVE LN PTKWDW VKN TGYKK PAARHMQT L 

I : I I I I I I I I I I I I I I I I I :: I I I I Mi I I I I I I II I I I I I I II I I I I I I : I I : I I 
a233 ARQWADAHPNITATAQTALAVAEAATTVWGGKKVELNPTKWDWVKNTGYECTPAVRTMHTL 

310 320 330 340 350 360 

370 380 390 400 410 419 

m233 . pep DGEMAG GNKP I KS LP- N S AAEKRKQN FEK FN S N WS S AS FD S VHKT LT PN A PG I LS P DKVK 

11111111:111: III: I 
a2 3 8 DGEMAGGNRP PKS ITSNSKADASTQ 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 893>: 
g23 9.seq 

1 atgctccacc ataaaggtat tgcccgaaac cggcggatgg aggttttgtt 

51 tttctgccgc cgccctgatc gcttcgtgat tcgccaaacg cgcctgttgc 

101 agcctcattt: gcgcataatc ctgctccaag gcgatttcct gttttttcgc 

151 cttgtccaaa gctgtgaagt tgagcctgta ctggttttgc tgcatcacaa 

201 cggaaaaagc ggaaacgcac accgcaagca gcagaaagaa attcgatttg 

251 ttcattgccg ttcagacgtt tttctctgtt attattccgg tatcggaccg 

301 gcagtccgct ccgccacacg caaaactgcg ctcctcgccc tcgggttggc 

351 ggcaatttcc gcttcacccg gctttaatgc cctgcccacg atttbcaggg 

4 01 gcggatcggg caaatccgct tctctgaccg ccgcccagct cggcaggggc 

451 tcgtgttgcg aatatttttt gacaaactgc ttcacaatgc ggtcttccaa 

501 cgaatggaaa gcaatgaccg ccaaacgccc gccctctttc agacggcaca 

551 tgacctgcgg caataccgcc cctacttctt caagctcgcg gfctaataaag 

601 atgcggattg cctggaaggt gcgcgtcgca ggatcctgcc cccgctcgcg 

651 agtacggacg ttttgtgcca cgatctgcgc cagcttgcgg gttgtatcga 

701 ttggactttc cgcccgttgc gcgacaatgg cgcgcacaat ctggcggcta 

751 aaccgctctt caccataa 

This corresponds to the amino acid sequence <SEQ ID 894; ORF 239.ng>: 

g23 9.pep 

1 MFHHKGIARN RRMEVLFFCR RPDRFVI RQT RLLQPHLRII LLQGDFLFFR 

51 LVQSCEVEPV LVLLHHNGKS GNAHRKQQKE IRFVHCRSDV FLCYYSGIGP 

101 AVRSATRKTA LLALGLAAIS ASPGFNALPT IFRGGSGKSA SLTAAQLGRG 

151 SCCEYFLTNC FTMRSSNEWK AMTAKRPPSF RRHMTCGNTA PTSSSSRLIK 

201 MRIAWKVRVA G5CPRSRVR T FCATICASLR WSIGL SARC ATMARTIWRL 

251 NRSSP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 895>: 

m239.seq 

1 ATGCTCCACC ATAAAGGTmy kGCCCGAAAC CGGCkGATGG AGGTTTTGTT 

51 TTTCTGCCGC CGCCCTGATC GCTTCGTGGT TCGCCAAACG CGCCTGTTGC 

101 AGCCTCATTT GCGCATAATC CTGCTCCAAG GCGATTTCCT GTTTTTTCGC 

151 CTTATCCAAA GCTGTGAAAT TGAGCCTGTA CTGGTTTTGC TGCATCACAA 

201 CGGAAAAAGC GGAAACGCAC ACCGCAAGCA GCAGAAGGAA ATTCAATTTG 

251 TTCATTGCCA TTCAGACGTT TTTCTCTGTG ATTGTTCCGG TATCGGACCG 

301 GCAGTCCGCT CCGCCACACG CAAAACCGCA CTTCTCGCCC TCGGATTGGC 

351 GGCAATTTCC GCCTCACCCG GCTTTAATGC CCTGCCCACG ATTTTCAGGG 

4 01 GCAGCTCGGG CAAATCCGCT TCCCTGaCCG CCGCCCAGCG CGGCAGGGGC 
451 GCGTGTTGCG AATATTTTTT GACAAACTGC TTCACAATGC GATCTTCCAA 
501 CGAATGGAAA GCAATGACCG CCAAACGTCC GCCCTCTTTC AGACGACACA 

5 51 TGACCTGCGG CAATACTGCC CCTACTTCTT CAAGCTCGCG GTTAATAAAG 
601 ATGCGGACCG CCTGGAAGGT GCGCGTCGCA GGATCCTGCC CCCGCTCGCG 
651 AGTACGGACG TTTTGTGCCA CGATCTGCGC CAGCTTGCGG GTTGTATCGA 
701 TTGGACTTTC CGCCCGTTGC GCAACAATGG CGCGCGCAAT cCGGCGGCTa 
751 AACCGCTCTT CACCATAA 

This corresponds to the amino acid sequence <SEQ ID 896; ORF 239>: 
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m2 39 .pep 

I MLHHKGXARN RXMEVLFFCR RPDRFWRQT RLLOPHLRI I LLQGDFLFFR 
51 LIQSCEIEPV LVLLHHNGKS GNAHRKQQKE IQFVHCHSDV FLCDCSGIGP 
101 AVRSATRKTA LLALGLAAIS ASPGFNALPT IFRGSSGK3A SLTAAQRGRG 
151 ACCEYFLTNC FTMRSSNEWK AMTAKRPPSF RRHMTCGNTA PTSSSSRLIK 
201 MRTAWKVRVA GSCPRSRV RT FCATICASLR WSIGL SARC ATMARAJRRL 
251 NRSSP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 239 shows 93.7% identity over a 255 aa overlap with a predicted ORF (ORF 239.ng) 
from N. gonorrhoeae: 

m239/g239 

10 20 30 40 50 60 

MLHHKGXARNRXME VL FFCRRPDRFWRQTRLLQ PHLR 1 1 LLQGDFL FFRL I QS CE I E PV 

i = iiii mi 1 1 1 1 } 1 1 1 1 1 1 1 1 1 - 1 r 1 1 i 1 1 i 1 1 1 1 1 1 i 1 1 f e 1 1 1 r i = 1 1 1 1 - 1 1 1 

MFHHKG I ARNRRMEVLFFCRRPDRFVI RQTRLLQPHLRI ILLQGDFliFFRLVQS CEVEPV 
10 20 30 40 50 60 

70 80 90 100 110 120 

LVLIJtfHNGKSGNAIIRKQQKEIQFVHOT 

MIIIIIIIIMIIIM lllhlllhlll III lllllllllllllllllllllllll 

LVLLHHNGKSGNAHRKQQKEI RFVHCRSDVFLCYYSG I GPAVRSATRKTALLALGLAAI S 
70 80 90 100 110 120 

130 140 150 160 170 180 

ASPGFNALPTIFRGSSGKSASLTAAQRGRGACCEYFLTNCFTMRSSNEWKAMTAKRPPSF 

1 1 1 1 1 1 f 1 1 ! 1 1 1 i ^ I ! f I ! 1 1 1 1 ! I llhil III III IHIIMIINII Ml MM I 

ASPGFNALPTIFRGGSGKSASLTAAQLGRGSCCEYFLTNCFTMRSSNEWKAMTAKRPPSF 
130 140 150 160 170 180 

190 200 210 220 230 240 

RRHMTCGNTAPTSS SSRLI KMRTAWKVRVAGSCPRSRVRTFCATI CASLRWS IGLSARC 

Illlllllllllllllllllll lllllll Mi III III IMIIIM III II Mlillll 

RRHMTCGNTAPTSS SSRLI KMRIAWKVRVAGSCPRSRVRTFCATICASLRWS IGLSARC 
190 200 210 220 230 240 

250 

ATMARAIRRLNRSSPX 

Mllhl llllllll 

ATMARTIWRLNRSSPX 
250 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 897>: 
a239.seq 

1 ATGCTCCACC ATAAAGGTAT TGCCCGAAAC CGGCGGATGG AGGTTTTGTT 

51 TTTCTGCCGC CGCCCTGATC GCTTCGTGGT TCGCCAAACG CGCCTGTTGC 

101 AGCCTCATTT GCGCATAATC CTGCTCCAAG GCGATTTCCT GTTTTTTCGC 

151 CTTATCCAAA GCTGTGAAGT TGAGCCTGTA CTGGTTTTGC TGCATCACAA 

201 CGGAAAAAGC GGAAACGCAC ACCGCAAGCA GCAGAAGGAA ATTCAATTTG 

251 TTCATTGCCA TTCAGACGTT TTTCTCTGTG ATTGTTCCGG TATCGGACCG 

301 GCAGTCCGCT CCGCCACACG CAAAACCGCA CTTCTCGCCC TCGGATTGGC 

351 GGCAATTTCC GCCTCACCCG GCTTTAATGC CCTGCCCGCG ATTTTCAGGG 

401 GCGGCTCGGG CAAATCCGCT TCCCTGACCG CCGCCCAGCG CGGCAGGGGC 

4 51 GCGTGTTGCG AATATTTTTT GACAAACTGC TTCACAATGC GGTCTTCCAA 

501 CGAATGGAAA GCAATGACCG CCAAACGTCC GCCCTCTTTC AGACGACACA 

551 TGACCTGCGG CAATACTGCC CCTACTTCTT CAAGCTCGCG GTTAATAAAG 

601 ATGCGGATTG CCTGGAAGGT GCGCGTCGCA GGATCCTGCC CCCGCTCGCG 

651 AGTACGGACG TTTTGTGCCA CGATCTGCGC CAGCTTGCGG GTTGTATCGA 

701 TTGGACTTTC CGCCCGTTGC GCAACAATGG CGCGCGCAAT CTGGCGGCTA 

751 AACCGCTCTT CACCATAA 



m239 .pep 
g239 

m239 .pep 
g239 

m239 .pep 
g239 

m239 -pep 
g239 

m239 .pep 
9239 
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This corresponds to the amino acid sequence <SEQ ID 898; ORF 239.a>: 

a239.pep 

1 MLHHKGIARN RRMEVLFFCR RPDRFVVRQT RLLQPHLRII LLQGDFLFFR 

51 LIQSCEVEPV LVLLHHNGKS GNAHRKQQKE IQFVHCHSDV FLCDCSGIGP 

101 AVRSATRKTA LLALGLAAI S ASPGFNALPA IFRGGS GKSA SLTAAQRGRG 

151 ACCEYFLTNC FTMRSSNEWK AMTAKRPPSF RRHMTCGNTA PTSSSSRLIK 

201 MRIAWKVRVA GSCPRSRVRT FCATICASLR VVSIGLSARC ATMARAIWRL 

251 NRSSP* 

m239/a239 97.3% identity in 255 aa overlap 

10 20 30 40 50 60 

ra239.pep M LHHKGXARN RXME VL F FCRR P DR FWRQT RLLQ P HLR 1 1 L LQG DFLFFRLIQSCEIEPV 

I I I I I I MM MIIMIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIMIhlll 
a239 MLHHKGIARNRRMEVLFFCRRPDRFWRQTRLLQPHLRIILLQGDFLFFRLIQSCEVEPV 
10 20 30 40 50 60 

70 80 90 100 110 120 

m239.pep LVLLHHNGKSGNAHRKQQKEIQFVHCHSDVFLCDCSGIGPAVRSATRKTALLALGLAAIS 
M I M M M M M M M M M M M M M M M I M I M M M M M M M M M I M M 
a239 LVLLHHNGKSGNAHRKQQKEIQFVHCHSDVFLCDCSGIGPAVRSATRKTALLALGLAAIS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m239 . pep A S PG FN A L PT I FRG S S GKS AS LT AAQRGRG AC C E Y FLT N C FTMR S SN EWKAMT AKR P P S F 

M M M M I M M I M M M M M M M M M M M i M M M I M I M M M M M M I 
a239 A S PG FN A L P A I FRG GS GKS AS LT AAQRGRG ACCE Y FLT NC FTMR S SN EWKAMT AKR P P S F 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 3 9 . pep RRHMTCGNTAPTSSSSRLIKMRTAWKVRVAGSCPRSRVRTFCATICASLRWS IGLSARC 

M I M M M M M I M M M M M t I M M I M I M M M I M I M I M M I i M 1 M I 
a239 RRHMTCGNTAPTSSSSRLIKMRIAWKVRVAGSCPRSRVRTFCATICASLRWS IGLSARC 

190 200 210 220 230 240 

250 

m2 3 9 . pep ATMARAI RRLNRS S PX 

M M M I M I M M I 
a2 3 9 ATMARAI WRLNRS S PX 

250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 899>: 

g240.seq 

1 atgatagaag tcatacattt cttcggcgcc gaaacgcgca gacagtttgc 

51 tcgtgccgac gttggacgat ttctgcataa tgccgcgcac atccaaagag 

101 gggtaaacat gggtatcatc gcgcacggga gacggtccga ttttataagg 

151 ctgcgtattc agccgttcgt tcaaatcggt tttgcccgca tccaatgcct 

201 tcgcaatcac gaacggtttg attgccgaac caggttcgat catatcggtt 

251 acggcacggt tgcgccgctg ttcgctgtct gcccggccgg gtctgttggg 

301 atcgtaggcg ggcgtattgg ccaaggcgag gatttccccc gtgcgggcat 

351 ccaaaaccac caccgttccg gcttttgcct gatggtattc gaccgccttg 

401 ttcaactctt cataggccaa ggbctgaatc ctctgatcga gggaaaggat 

451 gatgtctttg ccgttttgcg gtgctttatt gcgcggggag tccaagctgt 

501 ccacaatatt gccctgccgg tcccgcaaaa caacttccgc gccgtcttcg 

551 ccatacaggc tgtcttcaag cgaaagttcc aaaccttcct gacctttgcc 

601 gtcaatatcg gtaaatccga tgacgtgtgc aaacaggttg cccatcgggt 

651 aatggcgttt taa 

This corresponds to the amino acid sequence <SEQ ID 900; ORF 240.ng>: 

g240.pep 

1 MIEVIHFFGA ETRRQFACAD VGRFLHNAAH IQRGVNMGII AH3RRSDFIR 

51 LRIQPFVQIG FARIQGLRKH ERFDCRTRFD HIGYG TVAPL FAVCPAGSVG 

101 IVGGRIGOGE DFPRAGIQNH HRSGFCLMVF DRLVQLFIGQ GLNPLIEGKD 

151 DVFAVLRCFI ARGVQAVHNI ALPVPQNNFR AVFAIQAVFK RKFQTFLTFA 
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201 VNIGKSDDVC KQVAHRVMAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 901 >: 
ra240 . seq 

1 ATGATAGAAG 

51 TTGTGCCGAC 

101 GGGTAAACAT 

151 CGTATTCAGC 

201 CAATCACAAA 

251 GCACGGTTGC 

301 GTAGGCGGGC 

351 ArACCACCAC 

401 AACTCTTCAT 

451 GTCTTTGCCG 

501 CAATATTGCC 

551 TGCAAGCTGT 

601 AATATCGGTA 

651 GGCGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 902; ORF 240>: 
m240.pep 

1 MIEVIHFFGT ETRRQFACAD VGRFLHDAAH IQRGVNMGIA HGRRSDFIRL 

51 RIQPFVQIGF ARIQCLRNHK RFDCRTGFDH IGYG TVAPLF AVCPAGPVGI 

101 VGGRIGQGED FPRAGIOXHH RSGFCLMVFD RLVQLFIGQG LNPLIEGKDD 

151 VFAVFRGFXA RGVQAVHNIA LPVPQNDFRA VFAMQAVFKR KFQTFLTFAV 

201 NIGKSDDVCK QVAHRVMAF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 240 shows 94.5% identity over a 220 aa overlap with a predicted ORF (ORF 240.ng) 
from N. gonorrhoeae: 

m240/g240 

10 20 30 40 50 59 

MI EVI HF FGTE TRRQFACADVGR FLHD AAH I QRGVNMG I - AHGRRS D FI RLR I QP FVQ I G 

M MIMI Mill MMIIMMMMIIMIMM III I MM II MM 1 1 1 1 III II 

MIEVIHFFGAETRRQFACADVGRFLHNAAHIQRGVNMGI IAHGRRSDFIRLRI QPFVQIG 
10 20 30 40 50 60 

60 70 80 90 100 110 119 

FARIQCIiRNHKRFDCRTGFDHIGYGTVAPLFAVCPAGPVGIVGGRIGQGEDFPRAGIQXH 

1 1 1 1 1 1 ! I I h 1 1 1 1 1 I 1 1 1 1 M M I ! M ! 1 1 M I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I 

FARIQCLRNHERFDCRTRFDHIGYGTVAPLFAVCPAGSVGIVGGRIGQGEDFPRAGIQNH 
70 80 90 100 110 120 

120 130 140 150 160 170 179 

HRSGFCLMVFDRLVQLF IGQGLNPLI EGKDDVFAVFRGFXARGVQAVHNI AL P VPQNDFR 

I M 1 1 M I 1 1 M 1 1 1 1 1 M 1 1 M 1 1 II M 1 1 1 1 1 M I I llllllllllllllllhll 

tlRSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVLRCFIARGVQAVHNIALPVPONNFR 
130 140 150 160 170 180 

180 190 200 210 220 

AVFAMQAV FKRKFQTFLT FAVN I GKS DDVCKQVAHRVMAFX 

II Ihlll MIIIIIIIIMIMIIIIMIIIIIIMIII 

AVFA I QAV FKR KFQTFLT FAVNI G KS DDVCKQ VAHRVMAF 
190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 903>: 

a240.seq 

1 ATGATAGAAG TCATACATTT CTTCGGCACC GAAACGCGCA GACAGTTTGC 

51 TTGTGCCGAC GTTGGACGAT TTCTGCATGA TGCCGCGCAC ATCCAAAGAG 

101 GGGTAAACAT GGGTATCGCG CACGGGAGAC GGTCCGATTT TATAAGGCTG 

151 CGTATTCAGC CGTTCGTTCA AATCGGTTTT GCCCGCATCC AATGCCTTCG 

201 CAATCACAAA CGGTTTGATT GCCGAACCGG GTTCGATCAT ATCGGTTACG 



TCATACATTT CTTCGGCACC 
GTTGGACGAT TTCTGCATGA 
GGGTATCGCG CACGGGAGAC 
CGTTCGTTCA AATCGGTTTT 
CGGTTTGATT GCCGAACCGG 
GCCGCTGTTC GCTGTCTGCC 
GTATTGGCCA AGGCGAGGAT 
CGTTCCGGCT TTTGCCTGAT 
AGGCCAAGGT CTGAATCCTC 
TTTTTCGGGG CTTTAkTGCG 
CTGCCGGTCC CGCAAAACGA 
CTTCAAGCGA AAGTTCCAAA 
AATCCGATGA CGTGTGCAAA 



GAAACGCGCA GACAGTTTGC 
TGCCGCGCAC ATCCAAAGAG 
GGTCCGATTT TATAAGGCTG 
GCCCGCATCC AATGCCTTCG 
GTTCGATCAT ATCGGTTACG 
CGGCCGGGCC TGTTGGGATC 
TTCCCCCGTG CGGGCATCCA 
GGTATTCGAC CGCCTTGTTC 
TGATCGAGGG AAAGGATGAT 
CGGGGAGTCC AAGCTGTCCA 
CTTCCGCGCC GTCTTCGCCA 
CCTTCCTGAC CTTTGCCGTC 
CAGGTTGCCC ATCGGGTAAT 



m240 .pep 
g240 

m240 .pep 
9240 

m240 .pep 
9240 

m24 0 .pep 
g240 
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251 GCACGGTTGC GCCGCTGTTC GCTGTCTGCC CGGCCGGGCC TGTTGGGATC 
301 GTAGGCGGGC GTATTGGCCA AGGCGAGGAT TTCCCCCGTG CGGGCATCCA 
351 AAACCACCAC CGTTCCGGCT TTTGCCTGAT GGTATTCGAC CGCCTTGTTC 
401 AACTCTTCAT AGGCCAAGGT CTGAATCCTC TGATCGAGGG AAAGGATGAT 
451 GTCTTTGCCG TTTTTCGGGG CTTTATTGCG CGGGGAGTCC AAGCTGTCCA 
501 CAATATTGCC CTGCCGGTCC CGCAAAACGA CTTCCGCGCC GTCTTCGCCA 
551 TGCAGGCTGT CTTCAAGCGA AAGTTCCAAA CCTTCCTGAC CTTTGCCGTC 
601 AATATCGGTA AATCCGATGA CGTGTGCAAA CAGGTTGCCC ATCGGGTAAT 
651 GGCGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 904; ORP 240.a>: 

a240.pep 

1 MIEVIHFFGT ETRRQFACAD VGRFLHDAAH IQRGVNMGIA HGRRSDFIRL 

51 RIQPFVQIGF ARIQCLRNHK RFDCRTGFDH IGYGT VAPLF AVCPAGPVGI 

101 VGGRIGQGED FPRAGIQNHH RSGFCLMVFD RLVQLFIGQG LNPLIEGKDD 

151 VFAVFRGFIA RGVQAVHNIA LPVPQNDFRA VFAMQAVFKR KFQTFLTFAV 

201 NIGKSDDVCK QVAKRVMAF* 

m240/a240 99. 1 % identity in 2 1 9 aa overlap 

10 20 30 40 50 60 

MIEVIHFFGTETRRQFACADVGRFLHDAAHIQRGVNMGIAHGRRSDFIRLRIQPFVQIGF 

iiiiiiiiiiiii in. it iiiiiiMiiiiii nun iiiiiiiniii mi inn 

MIEVIHFFGTETRRQFACADVGRFLHDAAHIQRGVNMGIAHGRRSDFIRLRIQPFVQIGF 
10 20 30 40 50 60 

70 80 90 100 110 120 

ARIQCLRNHKRFDCRTGFDHIGYG T VA PLFAV C PAG P VG I VGGR I G QGE D FPRAG I QXH H 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I | M I I || 

AR I QCLRNHKRFDCRTGFDHIGYGTVAPLFAVCPAGPVG I VGGRIGQGED FPRAGIQNHH 
70 80 90 100 110 120 

130 140 150 160 170 180 

RSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVFRGFXARGVQAVHNIALPVPQNDFRA 
I M I I I I I I I I I I I I I I I I I I I I II I I I I M | I I I I | I || | | | | | | | | I | 1 | | | | | } | | 
RSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVFRGFIARGVQAVHNIALPVPQNDFRA 
130 140 150 1G0 170 180 

190 200 210 220 

VFAMQAVFKRKFQTFLTFAVNIGKSDDVCKQVAHRVMAFX 
I I I I I I I M I I I I I I I I I I I I I I | I I I M | | | | | I | | | | | 
VFAMQAV FKRK FQT FLT FAVN IGKS D D VCKQVAH RVMA FX 
190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 905>: 

g241 . seq 

1 ATGATAGAAG TCATACATTT CTTCGGCACC GAAACGCGCA GACAGTTTGC 

51 TTGTGCCGAC GTTGGACGAT TTCTGCATGA TGCCGCGCAC ATCCAAAGAG 

101 GGGTAAACAT GGGTATCGCG CACGGGAGAC GGTCCGATTT TATAAGGCTG 

151 CGTATTCAGC CGTTCGTTCA AATCGGTTTT GCCCGCATCC AATGCCTTCG 

201 CAATCACAAA CGGTTTGATT GCCGAACCGG GTTCGATCAT ATCGGTTACG 

251 GCACGGTTGC GCCGCTGTTC GCTGTCTGCC CGGCCGGGCC TGTTGGGATC 

301 GTAGGCGGGC GTATTGGCCA AGGCGAGGAT TTCCCCCGTG CGGGCATCCA 

351 ArACCACCAC CGTTCCGGCT TTTGCCTGAT GGTATTCGAC CGCCTTGTTC 

4 01 AACTCTTCAT AGGCCAAGGT CTGAATCCTC TGATCGAGGG AAAGGATGAT 

451 GTCTTTGCCG TTTTTCGGGG CTTTAkTGCG CGGGGAGTCC AAGCTGTCCA 

501 CAATATTGCC CTGCCGGTCC CGCAAAACGA CTTCCGCGCC GTCTTCGCCA 

551 TGCAAGCTGT CTTCAAGCGA AAGTTCCAAA CCTTCCTGAC CTTTGCCGTC 

601 AATATCGGTA AATCCGATGA CGTGTGCAAA CAGGTTGCCC ATCGGGTAAT 

651 GGCGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 906; ORF 241 .ng>: 

g241.pep 

1 MPTRPTRAAN PPTPTTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 



ra240.pep 
a240 

m240 -pep 
a240 

m240 .pep 
a240 

m24 0 .pep 
a24 0 
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51 ANRRENSHNA QPTYLLHPSN KMPSETEQTL. FRRHQIPPSC RQSVWMTVR 

101 TVDMTVCDFL IGCIAHAFNR SFKADFHACO RMVAVHHRLA VGNIGYTIDD 

151 NIAGFRIVRF KHHTDLDFNR ERARIFNTDQ LRIMLTERIV GRKRHFDRIA 

201 GILTVQRLFH QRENAWTAV QIRN RFFGFI QKLIVGIIHL IM QRNHGIFC 

251 NSHICPFRNS RLITGAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 907>: 

m241.seq (partial) 

1 . .CGGCAATCAG TGGTGGTGAT GACCGTGCGG GCCGTGGACA TGACCGTGTG 

51 CGATTTCCTC ATCGGATGCA TCGCGCACGC TTTCAACTGT AGCCTTAAAG 

101 CGGATTTTCA TGCCTGCCAA AGGATGGTTG CCGTCCACCA CCGCCTTGCC 

151 GTCGGCAACA TCGGTTACAC GATAGACGAC AACATCGCCG GTTTCAGGAT 

201 CGTCGGCTTC AAACATCATG CCGACTTCGA CTTCAACAGG GAACACGCCC 

251 GCATCTTCGA TACGGACCAA CTCCGGATCC TGCTCGCCGA ACGCATCGTC 

301 GGGCGACAGC GCCACATCGA CCGTATCGCC GGCATCCTTA CCGTGCAACG 

351 CCTCTTCCAC CAAAGGGAAA ATGCCGTCGT AACCGCCGTG CAGATACGCA 

401 ATCGGTTCTT CGGTTTTGTC CAAAAGCTGA TTGTTGGCAT CATACATCTC 

451 ATAATGCAGC GAAACCACGG AATTTTTCAC GATAGCCATA TTTGTCCTTT 

501 CAGGAACAGC AGATTAATTA CAGGCGCATT CTAA 

This corresponds to the amino acid sequence <SEQ ID 908; ORF 24t>: 

m241.pep (partial) 

1 . .RQSVWMTVR AVDMTVCDFL IGCIAHAFNC SLKADFHACQ RMVAVHHRLA 

51 VGNIGYTIDD NIAGFRIVGF KHHADFDFNR EHARIFDTDQ LRILLAERIV 

101 GRQRHIDRIA GILTVQRLFH QRENAWTAV QIRN RFFGFV QKLIVGIIHL 

151 IMQRNHGIFH DSHICPFRNS RLITGAF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 241 shows 91.5% identity over a 177 aa overlap with a predicted ORF (ORF 241.ng) 
from N. gonorrhoeae: 

m24l/g241 



10 20 30 

m241.pep RQSVWMTVRAVDMTVCDFLIGCIAHAFNC 

mill iii hi iii mi iimii ill i 

g24 1 QP TYLLH PSNKM P S ETEQTLFRRHQ I P P SCRQS VWMTVRTVDMTVCDFL I GC I AHA FNR 

70 80 90 100 110 120 



40 50 60 70 80 90 

m241 - pep SLKADFHACQR^AVHHRLAVGNIGYTIDDNIAGFRIVGFIOmADFDFNREHARIFDTDQ 
IMIIIIIMIIIIIMM III III II III II MUM I I I I = I = I I I I i = I I I I r | | | 
g241 S FKAD FHACQRMVAVHHRLAVGN I GYT I DDN I AG FRI VRFKHHTDLD FNRERAR I FNTDQ 

130 140 150 160 170 180 



100 110 120 130 140 150 

m24 1 . pep LR I LLAERIVGRQRHI DRI AG ILTVQRLFHQRENAWTAVQI RNRFFGFVQKL I VG I IHL 

IMMMIMMMIMM III Ml II III II MUM IMMIIilMMMM Mill 
g241 LR I MLT ER I VGRKRHFDR I AG I LTVQRL FHQRENAWT AVQ I RNRFFGF I QKL I VG 1 1 HL 

190 200 210 220 230 240 



160 170 
m241 .pep IMQRNHGIFHDSHICPFRNSRLITGAFX 

IIIIIMII MINIM I [ I I I I I ! I 
g241 IMQRNHGIFCNSHICPFRNSRLITGAFX 
250 260 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 909>: 

a241 .seq 

1 ATGCCAACAC GTCCAACTCG CGCCGCAAAG CATCCAACCC CGCCAACCTG 

51 GCTTCAGACG GCATACTGCC CTCGTCCGCC ATATCGTCCG CCGTCCGTGC 

101 AAACGCATAC ACCGCATGAA CCGGCTTCCT CAACCTGCGC GGCAAAATCA 

151 GCGAACCGAC GGGAAAATTT TCATAATGCC CAACCGACAT ACCTTCTCCA 

201 TCCATCAAAC AAAATGCCGT CTGAAATGGA ACAAACCCTT TTCAGACGGC 
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251 ATCAGATACC TCCAAGCTGC CGGCAATCAG TGGTGGTGAT GACCGTGCGG 

301 ACCGTGGACA TGACCGTGTG CGATTTCCTC ATCGGATGCA TCGCGCACAC 

351 TTTCAACCGT AGCCTTAAAG CGGATTTTCA TGCCTGCCAA AGGATGGTTG 

401 CCGTCCACCA CCGCCTTACC GTCGGCAACA TCGGTTACAC GATAGACGAC 

451 AACATCGCCG GTT7CAGGAT CGTCGGCTTC AAACATCATG CCGACTTCGA 

501 CTTCAACAGG GAACACGCCC GCATCTTCAA TACGGACCAA CTCCGGATCC 

551 TGCTCGCCGA ACGCATCGTC GGGCGAAAGC GCCACATCGA CCGTATCGCC 

601 GGCATCCTTA CCGTGCAACG CCTCTTCCAC CAAAGGGAAA ATGCCGTCGT 

651 AACCGCCGTG CAGATACGCA ATCGGTTCTT CGGTTTTGTC CAAAAGCTGA 

701 TTGTTGGCAT CATACATCTC ATAATGCAGC GAAACCACGG AATTCTTCAC 

751 GAT AG CC AT A TTTGTCCTTT CAGGAACAGC AGATTAATTA CAGGCGCATT 

801 CTAA 



This corresponds to the amino acid sequence <SEQ ID 910; ORF 241.a>: 
a241.pep 

1 MPTRPTRAAK HPTPPTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 

51 ANRRENFHNA QPTYLLHPSN KMPSEMEQTL FRRHQIPPSC RQSVWMTVR 

101 TVDMTVCDFL IGCIAHTFNR SLKADFHACQ RMVAVHHRLT VGNIGYTIDD 

151 NIAGFRIVGF KHHADFDFNR EHARIFNTDQ LRILLAERIV GRKRHIDRIA 

201 GILTVQRLFH QRENAWTAV QIRN RFFGFV QKLIVGIIHL IM QRNHGILH 

251 DSHICPFRNS RLITGAF* 



m24 1 /a24 1 96.0% identity in 1 77 aa overlap 

10 20 30 

m241.pep RQS VWMT VRAVDMT VC D FL I GC I AHAFNC 

I I I I I I I 1 I I : I I I I I I I I I I I I I I 1 : I I 
a241 QPTYLLHPSNKMPSEMEQTLFRRHQIPPSCRQS\ r WMT^/RTVDMTVCDFLIGCIAHTFNR 
70 80 90 100 110 120 

40 50 60 70 80 90 

m241 . pep SLKADFIiACQRMVAVHHRLAVGNIGYTIDDNIAGFRIVGFKHHADFDFNREHARIFDTDQ 
I I I I I I I M I I I I I I I ! I I : I I II I 1 I I I If I I I I I I I I I I I I I I II I I I I M I I I : I I I 
a241 SLKADFHACQRMVAVHHRLTVGNIGYTIDDNIAGFRIVGFKHHADFDFNREHARIFNTDQ 
130 140 150 160 170 180 

100 110 120 130 140 150 

m241.pep LRILLAERIVGRQRHIDRIAGILTVQRLFHQRENAWTAVQIRNRFFGFVQKLIVGIIHL 
I 1 1 I I I I I I I I I: I II I 1 1 I I I I I I I I I I I I I I I I I I I I 1 1 I 1 1 M I I I I I I I I I ! I I II 
a241 LR I LLAE R I VG RKR H I DR I AG I LTV QRL FHQRE N AWT AVQ I RN RFFG FVQKL I VG 1 1 HL 

190 200 210 220 230 240 



160 170 
m241 .pep IMQRNHGIFHDSHICPFRNSRLITGAFX 
I I I II I I I : I I I I I I II 1 1 I I I M I I 1 1 
a241 IMQRNHGILHDSHICPFRNSRLITGAFX 

250 260 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 91 1>: 

g241-l. seq 

1 ATGATAGAAG TCATACATTT CTTCGGCACC GAAACGCGCA GACAGTTTGC 

51 TTGTGCCGAC GTTGGACCAT TTCTCCATGA TGCCGCGCAC ATCCAAAGAG 

101 GGGTAAACAT GGGTATCGCG CACGGGAGAC GGTCCGATTT TATAAGGCTG 

151 CGTATTCAGC CGTTCGTTCA AATCGGTTTT GCCCGCATCC AATGCCTTCG 

201 CAATCACAAA CGGTTTGATT GCCGAACCGG GTTCGATCAT ATCGGTTACG 

251 GCACGGTTGC GCCGCTGTTC GCTGTCTGCC CGGCCGGGCC TGTTGGGATC 

301 GTAGGCGGGC GTATTGGCCA AGGCGAGGAT TTCCCCCGTG CGGGCATCCA 

351 ArACCACCAC CGTTCCGGCT TTTGCCTGAT GGTATTCGAC CGCCTTGTTC 

401 AACTCTTCAT AGGCCAAGGT CTGAATCCTC TGATCGAGGG AAAGGATGAT 

4 51 GTCTTTGCCG TTTTTCGGGG CTTTAJcTGCG CGGGGAGTCC AAGCTGTCCA 

501 CAATATTGCC CTGCCGGTCC CGCAAAACGA CTTCCGCGCC GTCTTCGCCA 

551 TGCAAGCTGT CTTCAAGCGA AAGTTCCAAA CCTTCCTGAC CTTTGCCGTC 

601 AATATCGGTA AATCCGATGA CGTGTGCAAA CAGGTTGCCC ATCGGGTAAT 

651 GGCGTTTTAA 



This corresponds to the amino acid sequence <SEQ ID 912; ORF 241-1 .ng>: 
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g241-l.pep 

1 MPTRPTRAAN PPTPTTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 

51 ANRRENSHNA QPTYLLHPSN KMPSETEQTL FRRHQIPPSC RQSWVMTVR 

101 TVDMTVCDFL I GC I AHA FUR SFKADFHACQ RMVAVHHRLA VGNISYTIDD 

151 NIAGFRIVRF KHHTDLDFNR ERARIFKTDQ LRIMLTERIV GRKRHFDRIA 

201 GILTVQRLFH QRENAWTAV QIRN RFFGFI QKLIVGIIHL IM QRNHGIFC 

251 NSHICPFRNS RLITGAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 91 3>: 

m241-l.seq 

1 ATGCCAACAC GTCCAACTCG CGCTGCAAAC CCTCCAACCC CGCCAACCTG 

51 GCTTCAGACG GCATACTGCC CTCGTCCGCC ATATCGTCCG CCGTCCGTGC 

101 AAACGCGTAC ACCGCGTGAA CCGGCTTCCT CAACCTGCGC GGCAAAATCA 

151 GCGAACCGAC GGGAAAATTC TCATAATGCC CAACCGACAT ACCTTCTCCA 

201 TCCATCAAAC AAAATGCCGT CTGAAACGGA ACAAACCCTT TTCAGACGGC 

251 ATCAGATACC TCCAAGCTGC CGGCAATCAG TGGTGGTGAT GACCGTGCGG 

301 GCCGTGGACA TGACCGTGTG CGATTTCCTC ATCGGATGCA TCGCGCACGC 

351 TTTCAACTGT AGCCTTAAAG CGGATTTTCA TGCCTGCCAA AGGATGGTTG 

401 CCGTCCACCA CCGCCTTGCC GTCGGCAACA TCGGTTACAC GATAGACGAC 

451 AACATCGCCG GTTTCAGGAT CGTCGGCTTC AAACATCATG CCGACTTCGA 

501 CTTCAACAGG GAACACGCCC GCATCTTCGA TACGGACCAA CTCCGGATCC 

551 TGCTCGCCGA ACGCATCGTC GGGCGACAGC GCCACATCGA CCGTATCGCC 

601 GGCATCCTTA CCGTGCAACG CCTCTTCCAC CAAAGGGAAA ATGCCGTCGT 

651 AACCGCCGTG CAGATACGCA ATCGGTTCTT CGGTTTTGTC CAAAAGCTGA 

701 TTGTTGGCAT CATACATCTC ATAATGCAGC GAAACCACGG AATTTTTCAC 

751 GATAGCCATA TTTGTCCTTT CAGGAACAGC AGATTAATTA CAGGCGCATT 

801 CTAA 

This corresponds to the amino acid sequence <SEQ ID 914; ORF 241-1>: 

m241-l .pep 

1 MPTRPTRAAN PPTPPTWLQT AYCPRPPYRP PSVQTRTPRE PASSTCAAKS 

51 ANRRENSHNA QPTYLLHPSN KMPSETEQTL FRRHOIPPSC RQSVWMTVR 

101 AVDMTVCDFL IGCIAHAFNC SLKADFHACQ RMVAVHHRLA VGNIGYTIDD 

151 NIAGFRIVGF KHHADFDFNR EKARIFDTDQ LRILLAERIV GRQRHIDRIA 

201 GILTVQRLFH QRENAWTAV QIRN RFFGFV QKLIVGIIHL IM QRNHGIFH 

251 DSHICPFRNS RLITGAF* 

m241-l/g241-l 93.3% identity in 2 67 aa overlap 

10 20 30 00 50 60 

m241-l.pep MPTRPTRAANPPTPPTWLQTAYCPRPPYRPPSVQTRTPREPASSTCAAKSANRRENSHNA 
I I I I 1 M I I I I I I I I I I I I I I I! I I I II I I I I I I : I I : I 1 I I I I I I I I I I I I I I I I I I I 
g24 1 MPTRPTRAANPPTPTTWLQTAYCPRPPYRPPSVQTHTPHEPASSTCAAKSANRRENSHNA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m241-l.pep QPTYLLHPSNKMPSETEQTLFRRHQIPPSCRQSVWMTVRAVDMTVCDFLIGCIAHAFNC 

I I II I I 1 1 I M 1 1 1 1 1 1 I J I I I I I I 1 1 I 1 1 I I I I I I I I 1 1 : 1 I 1 1 ! 1 1 I I I ! I 1 1 I I I I 
g24 1 QPTYLLHPSNKMPSETEQTLFRRHQIPPSCRQSVWMTVRTVDMTVCDFLIGCIAHAFNR 

70 80 90 100 110 120 

130 140 150 160 170 180 

m241-l .pep SLKADFHACQRMVAVHHRLAVGNIGYTIDDNIAGFRIVGFKHHADFDFNREHARI FDTDQ 

IHilliillll II I II II M I I I III I III I! Il|]:|:|!ll|:||||:|t I 

g2 4 1 SFKADFHACQRMVAVHHRLAVGN I GYT I DON I AGFRI VRFKHHTDLD FNRERARI FNTDQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

10241-1 .pep LRILLAERIVGRQRHIDRIAGILTVQRLFHQRENAWTAVQIRNRFFGFVQKLIVGIIHL 
t I ! M : I 1 1 1 1 I : I I : H I I I i I I M 1 I ! I t i I t 1 1 I I I M M i I I I I : I I 1 I I I 1 I I I 
g241 LRIMLTERIVGRKRHFDRIAGILTVQRLFHQRENAWTAVQIRNRFFGFI QKLIVGIIHL 

190 200 210 220 230 240 

250 260 
102 41-1. pep IMQRNHGIFHDSHICPFRNSRLITGAFX 
! I I I I I I I I : I I I I I I I I I I I I I I I I ! 
g24 1 IMQRNHGI FCNSHICPFRNSRT.TTGAFX 

250 260 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ED 91 5>: 

a241-l.seq 

1 ATGCCAACAC GTCCAACTCG CGCCGCAAAG CATCCAACCC CGCCAACCTG 
51 GCTTCAGACG GCATACTGCC CTCGTCCGCC ATATCGTCCG CCGTCCGTGC 
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101 AAACGCATAC ACCGCATGAA CCGGCTTCCT CAACCTGCGC GGCAAAATCA 

151 GCGAACCGAC GGGAAAATTT TCATAATGCC CAACCGACAT ACCTTCTCCA 

201 TCCATCAAAC AAAATGCCGT CTGAAATGGA ACAAACCCTT TTCAGACGGC 

251 ATCAGATACC TCCAAGCTGC CGGCAATCAG TGGTGGTGAT GACCGTGCGG 

301 ACCGTGGACA TGACCGTGTG CGATTTCCTC ATCGGATGCA TCGCGCACAC 

351 TTTCAACCGT AGCCTTAAAG CGGATTTTCA TGCCTGCCAA AGGATGGTTG 

401 CCGTCCACCA CCGCCTTACC GTCGGCAACA TCGGTTACAC GATAGACGAC 

451 AACATCGCCG GTTTCAGGAT CGTCGGCTTC AAACATCATG CCGACTTCGA 

501 CTTCAACAGG GAACACGCCC GCATCTTCAA TACGGACCAA CTCCGGATCC 

551 TGCTCGCCGA ACGCATCGTC GGGCGAAAGC GCCACATCGA CCGTATCGCC 

601 GGCATCCTTA CCGTGCAACG CCTCTTCCAC CAAAGGGAAA ATGCCGTCGT 

651 AACCGCCGTG CAGATACGCA ATCGGTTCTT CGGTTTTGTC CAAAAGCTGA 

701 TTGTTGGCAT CATACATCTC ATAATGCAGC GAAACCACGG AATTCTTCAC 

751 GATAGCCATA TTTGTCCTTT CAGGAACAGC AGATTAATTA CAGGCGCATT 

801 CTAA 

This corresponds to the amino acid sequence <SEQ ID 916; ORF 241-l.a>: 

a2 4 1-1. pep 

1 MPTRPTRAAK HPTPPTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 

51 ANRRENFHNA QPTYLLHPSN KNPSEMEQTL FRRHQIPPSC RQSWVMTVR 

101 TVDMTVCDFL IGCIAHTFNR SLKADFHACQ RMVAVHHRLT VGNIGYTIDD 

151 NIAGFRIVGF KHHADFDFNR EHARIFNTDQ LRILLAERIV GRKRHIDRIA 

201 GILTVQRLFH QRENAWTAV QIRNRFFGFV QKLIVGIIHL IM QRNHGILH 

251 DSHICPFRNS RLITGAF* 

m241-l/a241-l 95.1% identity in 267 aa overlap 

10 20 30 40 50 60 

MPTRPTRAANPPTPPTWLQTAYCPRPPYRPPSVQTRTPREPASSTCAAKSANRRENSHNA 
I I I I I I I I ! : I I i I I II I I I I I I I I I I I I IN I I : II : I I 1 I I I I I I I I I I I ! I I III 
MPTRPTRAAKH PTPPTWLQTAYCPRPPYRPPSVQTHTPHEP AS STCAAKS ANRRENFHNA 
10 20 30 40 50 60 

70 80 90 100 110 120 

Q PT YLLH P SN KMP S ET EQTL FRRHQI P PS C RQS WVMT V RAVDMTV C D FL I GC I AHAFN C 
I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I [ I I I I I I I I I I : II 
QP T YLLH P SN EUI1P SEMEQTL FRRHQI P PS C RQS VVVMTVRTV DMT VCDFL1GC I AHTFT1R 
70 80 90 100 110 120 

130 140 150 160 170 180 

SLKADFHACQRMVAVHHRLAVGN I G YT I DDN I AGFRI VG FKHHADFDFNREHARI FDTDQ 
I III I III I II II II! II CM I 1111111111111 III IIIIIIH Ml It Hit:l II 
SLKADFHAC QRMVAVHHRLTVGNI GYT I DDN I AGFRI VG FKHHADFDFNREHARI FNTDQ 
130 140 150 160 170 180 

190 200 210 220 230 240 

LRILLAERIVGRQRHIDRIAGILTVQRLFHQRENAWTAVQIRNRFFGFVQKLIVGIIHL 
I I I I I I I I I I I f r I I i I M ! I M ! I I I I I I I I I I I I I I 1 I I I I I [ I I I I I I I I I I I I I I I 
LRILLAERI VGRKRHI DRI AGI LTVQRLFHQRENAVVTAVQIRNRFFGFVQKLIVGI1HL 
190 200 210 220 230 240 

250 260 
IMQRNHGIFHDSHICPFRNSRLITGAFX 
I III t 111:11 II lllllll lllllill 
IMQRNHGI LH DSH I C P FRNS RL I TG AFX 
250 260 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 91 7>: 
g242 . seq 

1 atgatcggcg aacttgttgt tttgttcgtg atcgagcact tcaagcaacg 

51 cgctggcggg atcgccccga aagtcgctgc ccaatttgtc gatttcgtcg 

101 agcaggaaca acgggtttct tacgcctgct tttgccatat tctgcaaaat 

151 cttgccgggc atagagccga tataggtacg gcggtgcccg cggatttcgc 

201 tttcgtcgcg cacgccgccc aaggccatac ggacatattt ccgccccgtt 

251 gctttggcga tggattcgcc caaagaggtt ttgcccacgc ccggagggcc 

301 gaccaaacac agaatcggac ctttgagctt gtccacacgt ttttggacgg 

351 cgaggtattc caaaatccgt tctttgactt tttccaggcc gtagtggtcg 

401 gcatccagca ccagtccggc tttggcgatg tctttgctga cgcgggattt 

451 tttcttccac ggcagtccga gcagggtgtc gatgtagttg cgtacgacgg 



ra24 1-1 .pep 
a241 

m241-l .pep 
a241 

m241-l .pep 
a241 

m2 4 1-1. pep 
a241 

ra241-l.pep 
a241 
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501 tggattcggc agacatcggc ggcatcattt tgagtttttt cagttcggac 

551 aggcattttt cttccgcttc tttggtcata cccgcctttt tgatgcctgc 

601 ctccaaggca tccagttcgc cgttttcgtc ttcttcgccc aattctttgt 

651 gtatcgcttt aatctgttcg ttcagataat attcgcgttg ggatttttcc 

701 atttggcgtt tgacgcgtcc gcgtatgcgt ttttcggcct gcataatgtc 

751 gagttcggat tccagctttg ccagcaggaa ttccatccgt ttgccgattt 

801 cgggaatctc caaaatctgt tggcgttgcg ccagtttcaa ctgcaaatgc 

851 gctgcgaccg tatcggttag 

This corresponds to the amino acid sequence <S£Q ID 918; ORF 242. ng>: 

g242 .pep 

1 MIGELWLFV IEHFKQRAGG IAPKVAAQFV DFVEQEQRVS YACFCHILQN 

51 LAGHRADIGT AVPADFAFVA HAAQGHTDIF PPRCFGDGFA QRGFAHARRA 

101 DQTQNRTFEL VHTFLDGEVF QNPFFDFFQA VWGIQHQSG FGDVFADAGF 

151 FLPRQSEQGV DWAYDGGFG RHRRHHFEFF QFGQAFFFRF FGHTRLFDAC 

201 LQGIQFAVFV FFAQFFV YR F NLFVQIIFAL GFFHLA FDA5 AYAFFGLHNV 

251 EFGFQLCQQE FHP7ADFGNL QNLLALRQFQ LQMRCDRIG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 919>: 

m242 . seq 

1 ATGATCGGCA AACTTGTTGT TTTGTTCGGG ATCGAGCACT TCGAGCAACG 

51 CGCTGGCGGG ATCGCCTCGG AAGTCGTTAC CCAATTTGTC GATTTCGTCG 

101 AGCAGGAACA AGGGGTTTTT CACGCCGGCT TTTGCCATAT TCTGCAAAAT 

151 CTTACCGGGC ATAGAGCCGA TATAGGTGCG GCGGTGTCCC CTGATTTCGC 

201 TTTCGTCGCG CACGCCGCCC AAAGCCATGC GGACATATTT CCGCCCCGTT 

251 GCTTTGGCGA TGGATTCGCC CAAAGAGGTT TTGCCCACGC CCGGAGGGCC 

301 GACCAGGCAC AGAATCGG3C CTTTGAGTTT GTCCATACGT TTTTGGACGG 

351 CGAGGTATTC CAAAATCCGT TCTTTGACTT TTTCCAGGCC GTAGTGGTCG 

401 GCATCCAGCA CCAGTCCGGC TTTGGCGATG TCTTTGCTGA CGCGGGATTT 

451 TTTCTTCCAC GGCAGCTCGA GCAAAGTGTC GATGTAGTTG CGTACGACGG 

501 TGGATTCCGC AGACATCGGT GGCATCATTT TGAGCTTTTT CAGTTCGGAC 

551 AGGCATTTTT CTTCCGCTTC TTTGGTCATA CCCGCCTTTT TGATATCTGC 

601 TTCCAAGGCA TCCAGTTCGC CGTTTTCGTC TTCTTCGCCC AGTTCTTTGT 

651 GTATCGCTTT AATCTGTTCG TTCAGATAAT ATTCGCGCTG GGATTTTTCC 

701 ATTTGGCGTT TGACGCGTCC GCGTATGCGT TTTTCGGCCT GCATAATGTC 

751 GAGTTCGGAT TCCAGCTGTG CCAGCAGGAA TTCCATCCGT TTGCCGATTT 

801 CGGGAATTTC CAAAATCTGT TGGCGTTGCG CCAGTTTCAA CTGCAAATGC 

851 GCTGCGACCG TATCGGTTAG 

This corresponds to the amino acid sequence <SEQ ID 920; ORF 242>: 

m242 . pep 

1 MIGKLWLFG IEHFEQRAGG IASEWTQFV DFVEQEQGVF HAGFCHILQN 

51 LTGHRADIGA AVSPDFAFVA HAAQSHADIF PPRCFGDGFA QRGFAHARRA 

101 DQAQNRAFEF VHTFLDGEVF QNPFFDFFQA VWGIQHQSG FGDVFADAGF 

151 FLPRQLEQSV DWAYDGGFR RHRWHHFELF QFGQAFFFRF FGHTRLFDIC 

201 FQGIQFAVFV FFAQFFV YR F NLFVQIIFAL GFFHLA FDAS AYAFFGLHNV 

251 EFGFQLCQQE FHPFADFGNF QNLLALRQFQ LQMRCDRIG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 242 shows 90.3% identity over a 289 aa overlap with a predicted ORF (ORF 242.ng) 
from N. gonorrhoeae: 

m242/g24290 . 3% identity in 289 aa overlap 

10 20 30 40 50 60 

m242 . pep MI GKLWLFG I EHFEQRAGG IAS EWTQFVDFVEQEQGV FHAG FCH I LQNLTGHRADI GA 

IIMIIII IIIMIIIIIl I II I II I I I I I I I I I I h I I I I I I h 

g 2 4 2 MIGELWLFV I EHFKQRAGGI APKVAAQFVDFVEQEQRVSYACFCHILQNLAGHRADIGT 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 42 . pep AVS PDFAFVAHAAQSHADI F PPR CFGDG F AQRG F AHARRADQ AQNRAFE FVHTF LDGEV F 

II IIMIIim:|:|IMIIIIIIIil!|l!||||||||:il|:||:|MIIIIH| 
g 2 4 2 AVPADFAFVAHAAQGHTDI FPPRCB'GDG FAQRGFAHARRADQTQNRTFELVHTFLDGEVF 

70 80 90 100 110 120 
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130 140 150 160 170 180 

QN PF FDF FQAVWG I QHQ SGFGDV FADAGFFL P RQLEQS VDVVA YDGG FRRHRWHH FEL F 

III MM IMIMII III II INI II Mill Nil 1 1 •■ 1 1 1 i 1 1 1 1 1 III HIM 

QN PF FDF FQAVWG I QHQSGFGDV FADAG FFL P RQS EQGVDWA YDGG FG RHRRHH F EF P 
130 140 150 160 170 180 

190 200 210 220 230 24G 

QFGQAFFFRFFGHTRLFDICFQGIQFAVFVFFAQFFVYRFNLFVQI I FALGFFHLAFDAS 

i mi 1 1 ii ii ii ii i ii hi i ii i iii mini mi ill minium ii ii ii 

QFGQAFFFRFFGHTRLFDACLQGIQFAVFVFFAQFFVYRFNLFVQI I FALGFFHLAFDAS 
190 200 210 220 230 24C 

250 260 270 280 290 

AYAFFGLHNVEFGFQLCQQEFHPFADFGNFQNLLALRQFQLQMRCDRIGX 

IIIIIIIIMIMII IMIIIMIIMIhlllllllll lllllllll II 

AYAF FGLHNVE FG FQLCQQE FH P FADFGNLQNLLALRQFQLQMRCDR I GX 
250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 92 1>: 

a242.seq 

1 ATGATCGGCG 

51 CGCTGGCGGG 

101 AGCAGGAACA 

151 CTT ACCGGGC 

201 TTTCGTCGCG 

251 GCTTTGGCGA 

301 GACCAGGCAC 

351 CGAGGTATTC 

401 GTATCCAGCA 

4 51 TTTCTTCCAC 

501 TGGATTCGGC 

551 AGGCATTTTT 

601 TTCCAAGGCA 

651 GTATCGCTTT 

701 ATTTGGCGTT 

751 GAGTTCGGAT 

801 CGGGAATTTC 

851 GCTGCGACCG 

This corresponds to the amino acid sequence <SEQ ID 922; ORF 242.a>: 

a242.pep 

1 MIGELWLLG IKHFEQRAGG IAPEVAXQFV DFVEQEQWVF YAGFCHILQN 

51 LTGHGADIGA AVSPDFAFVA HAAQSHADIF PPRCFGDGFA QRGFAHAWRA 

101 DQAQNRAFEF VHTFLDGEVF QNPFFDFFQA VVVGIQHQSG FGDV FADAG F 

151 FLPRQFEQGV DWAYDGGFG RHRRHHFELF QFGQAFFFRF FGHT RLFDIC 

201 FQGIQFAVFV F FAQFFVY RF NLFVQIIFAL GFFHL AFDAS AYAFFGLHNV 

251 EFG FQLCQQE FHPFADFGNF QNLLALRQFQ LQMRCDRIG* 

m242/a242 95.2% identity in 289 aa overlap 

10 20 30 40 50 60 

m242 . pep MIGKLWLFGIEHFEQRAGGIASEWTQFVDFVEQEQGVFHAGFCHILQNLTGHRADIGA 
I I I : I I t I : I I : I I t I I I I I I I I I I I I I I I I I I I ! i : I I I I [J ! I I I I I I Mill 
a242 MIGELWLLGIKHFEQRAGGIAPEVAXQFVDFVEQEQWVFYAGFCHILQNLTGHGADIGA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m242 . pep AVS PDFAFVAHAAQSHADI FPPRC FGDGFAQRGFAHARRADQAQNRAFEFVHT FLDGEVF 

M I I I I II I M i I I I II I I I I I I I M I I I I I I I I I I i I I I I I I I I I i I I I I I I I I I | | I 
a242 AVS PDFAFVAHAAQSHADIFPPRCFGDGFAQRGJAHAWRADQAQNRAFEFVHT FLDGEVF 

70 80 90 100 110 120 

130 140 150 160 1.70 180 

m242 . pep QN P F FD F FQAVWG I QHQS G FG D V FA DAG FFLPRQLEQSV D WA Y DGG FRR HRWHH FE L F 

I I N I I I II I I I I I I I I I I I I I I I | | | | | | | | | | | j : | | : | | | | | | | M | ||t | | | | | | 



11)24 2. pep 
g242 

m24 2.pep 
g242 

m24 2.pep 
g242 



AACTTGTTGT 
ATCGCCCCGG 
ATGGGTTTTT 
ATGGAGCCGA 
CACGCCGCCC 
TGGATTCGCC 
AGAATCGG3C 
CAAAATCCST 
CCAATCCGGC 
GGCAGTTCGA 
AGACATCGGC 
CTTCCGCTTC 
TCCAGTTCGC 
AATCTGTTOG 
TGACGCGTCC 
TCCAGCTGTG 
CAAAATCTGT 
TATCGGTTAG 



TTTGCTCGGG 
AAGTCGCTAN 
TACGCCGGCT 
TATAGGTGCG 
AAAGCCATGC 
CAAAGAGGTT 
CTTTGAGTTT 
TCTTTGACTT 
TTTGGCGATG 
GCAGGGTGTC 
GGCATCATTT 
TTTGGTCATA 
CGTTTTCGTC 
TTCAGATAAT 
GCGTATGCGT 
CCAGCAGGAA 
TGGCGTTGCG 



ATCAAGCACT 
CCAATTTGTC 
TTTGCCATAT 
GCGGTGTCCC 
GGACATATTT 
TTGCCCACGC 
GTCCATACGT 
TTTCCAGGCC 
TCTTTGCTGA 
GATGTAGTTG 
TGAGCTTTTT 
CCCGCCTTTT 
TTCTTCGCCC 
ATTCGCGCTG 
TTTTCGGCCT 
TTCCATCCGT 
CCAGTTTCAA 



TCGAGCAACG 
GATTTCGTCG 
TCTGCAAAAT 
CGGATTTCGC 
CCGCCCCGTT 
CTGGAGGGCC 
TTTTGGACGG 
GTAGTGGTCG 
CGCGGGATTT 
CGTACGACGG 
CAGTTCGGAC 
TGATATCTGC 
AGTTCTTTGT 
GGATTTTTCC 
GCATAATGTC 
TTGCCGATTT 
CTGCAAATGC 
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a242 QNPFFDFFQAVWGIQHQSGFGDVFADAGFFLPRQFEQGVDWAYDGGFGRHRRHHFELF 
130 140 150 160 170 180 

190 200 210 220 230 240 

. ra242 .pep QFGQAFFFRFFGHTRLFDICFQGIQFAVFVFFAQFFVYRFNLFVQIIFALGFFHLAFDAS 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a242 QFGQAFFFRFFGHTRLFDICFQGIQFAVFVFFAQFFVYRFNLFVQIIFALGFFHLAFDAS 
190 200 210 220 230 240 

250 260 270 280 290 

m242 . pep AYAFFGLHNVEFGFQLCQQEFHPFADFGNFQNLLALRQFQLQNRCDRIGX 
I I I I I I 1 1 1 1 1 1 ! 1 1 I I i I I I I M I i f I f 1 1 M I I I I II I 1 1 I I I I I I I I 
a242 AYAFFGLHNVE FGFQLCQQE FH P FADFGNFQNLLALRQFQLQMRC DR I GX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 923>: 

g24 3 -seq 

1 ATGGTaatcg tctGGTTGCc cgAGTTaccg CCGATGCCGG CGACGATGGG 

51 CATCAGCGCG GCGAGTGCGA CGATTTTTTC gatactgcCT TCAAACGCGC 

101 CGATGACGCG GCTGGCGAGG AAGGCGGTGC AGAGGTTGAC GGCGAGCCAC 

151 ATCCAGCGGT TTTTGACGGA ATCCAAGACG GGGGCGAACA GGTCTTCCTC 

201 TTCCTGCAAA CCTGCCATGT TCAACATATC CGCTTCGGAT TCTTCGCGGA 

251 TCACGTCCAC CATCTCGTCG ATGGTAATCc tgCCGATGAG CTTTTTGTTT 

301 TCATCAACGA CGGGCGCGGT AACCAAGTCG TAG 

This corresponds to the amino acid sequence <SEQ ED 924; ORF 243 .ng>: 

g243.pep 

1 MVIVWLPELP PMPATMGISA ASA TIFSILP SNAPMTRLAR KAVQRLTASH 
51 IQRFLTESKT GANRSSSSCK PAMFNISASD SS RITSTISS MVILPMSFLF 
101 SSTTGAVTKS * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 925>: 

m24 3 . seq 

1 ATGGTAATCG TCTGGTTGCC CGAGTTACCG CCTATGCCGG CGACGATGGG 

51 CATCAGCGCG GyGAGTGCGA CGATTTTTTC GATGCTGCCT TCAAACGCGC 

101 CGATAACACG GyTGGCGAGG AAGGCGGTGC AGAGGTTGAC GGCGAGCCAC 

151 ATCCAGyGGT TTTTCACCGA ATCCCACACG GGGGCGAAyA GGTCTTCCTC 

201 TTCCTGCAAA CCCGCCATAT TCAG CATATC CGCTTCCGAT TCTTCGCGGA 

251 TCACGTCCAC CATCTCGTCG ATGGTAATCC TGCCGATGAG CTTTTTGTTT 

301 TCATCGACGA CGGGCGCGGT AACCAAGTCG TAG 

This corresponds to the amino acid sequence <SEQ ID 926; ORF 243>: 

m243 .pep 

1 MVIVWLPELP PMPATMGISA XSAT IFSMLP SNAPITRLAR KAVQRLTASH 

51 IOXFFTBSHT GANRSSSSCK PAIFSISASD SSR ITSTISS MVILPMSFLF 

101 SSTTGAVTKS * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 243 shows 92.7% identity over a 1 10 aa overlap with a predicted ORF (ORF 243. ng) 
from N. gonorrhoeae: 

m243/g243 

10 20 30 40 50 60 

m243 . pep MVIVWLPELP PMPATMGISAXS ATI FSMLPSNAPITRLARKAVQRLTASHIQXFFTESHT 

llllllllMMIIIIilll III I lh lllllhl I II I II IMIM IM I hllhl 
g243 MVI VWLPELPPMPATMG I S AAS ATI FS I LPSNAPMTRLAR KAVQRLTASH I QRFLTESKT 

10 20 30 40 50 60 

70 80 90 100 110 

m243 . pep GANRSSSSCKPAIFSISASDSSRITSTISSMVILPMSFLFSSTTGAVTKSX 

lllllllllllhhllllllllllllllllllllllllllllllllllll 

g243 GANRSSSSCKPAMFNISASDSSRITSTISSMVILPMSFLFSSTTGAVTKSX 
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70 80 90 100 110 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 927>: 

a243.seq 

1 ATGGTAATCG TCTGGTTGCC CGAGTTACCG CCTATGCCGG CGACGATGGG 

51 CATCAGCGCG GCGAGTGCGA CGATTTTTTC GATGCTGCCT TCAAACGCGC 

101 CGATAACACG GCTGGCGAGG AAGGCGGTGC AGAGGTTGAC GGCGAGCCAC 

151 ATCCAGCGGT TTTTGACGGA ATCCAAGACG GGGGCGAATA AGTCTTCCTC 

201 TTCTTGCAAA CCCGCCATAT TCAACATATC CGCTTCGGAT TCTTCGCGGA 

251 TCACGTCCAC CATTTCGTCA ACGGTCACCC TGCCGATGAG CTTTTTGTTT 

301 TCATCGACGA CGGGCGCGGT AACCAAGTCA TAG 

This corresponds to the amino acid sequence <SEQ ID 928; ORF 243. a>: 

a243 .pep 

1 MVIVWLPELP PMPATMGISA ASA TIFSMLP SNAPITRI.AR KAVQRLTASH 

51 IQRFLTESKT GANKSSSSCK PAIFNISASD SSRITSTISS TVTLPMSFLF 

101 SSTTGAVTKS * 

m243/a243 92.7% identity in 1 1 0 aa overlap 

10 20 30 40 50 60 

ru243 . pep MVIVWLPELPPMPATMGISAXSATIFSMLPSNAPITRLARECAVQRLTASHIQXFFTESHT 
! I I I I I M I I I I I I I I I I I I I I I I I I I I I I 1 I I I | | | I I | I | | | | | | | | | | I : I | I : i 
a243 MVIVWLPELPPMPATMGISAASATIFSMLPSNAPITRLARKAVQRLTASHIQRFLTESKT 
10 20 30 40 50 60 

70 80 90 100 110 

GANRS S SSCKPAI FS I SAS DSSRIT ST I SSMVI L PMS FLFS STTGAVTKSX 
I I I : I I ( I I I I I I I : I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I 
GANKS S SS CKPAI FN I SAS DS SR IT ST IS ST VTL PMS FLFS STTGAVTKSX 
"70 80 90 100 110 



m243.pep 
a243 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 929>: 

g244 . seq 

1 atgccgcctg aagcccggcc ggcgggttca gacggcattg ccgctttact 

51 tcgatcggtt tatacgcaaa acgcgcttca ggaaataaat cagattattc 

101 cccagacgcc ttcaggcttc cttccgtgcc accgtaacca tagccgggcg 

151 caacacacgg tcggacaggg tataaccctt cttcatcaca ccaaccacgg 

201 tattgggttc ctgctcactg gccaccgcct gcatcgcctg atggatattc 

251 ggatcgagct tatcgcccgc tttaggattg atttccttga tttgcgtggc 

301 atcaaacgcc ttctgcaact cattcaaagt catctgcaca cccattttca 

351 gcgcatcgaa attaccgctc tgatccaaaa gcgccatttc cagataatcc 

401 ttgaccggca acatttccac ggcaaacttc tgtccggcga acttgtgcgt 

451 atcggcaatt tcctgctggt ggcggcggcg caggttttgc tcgtttgcca 

501 aagcgcgcag ttgttcgtct ttcaactgcg cttccagctc ggcaatccgc 

551 gcctgcaaat cctcataagc cggctcggcg gcagcctgtt cctgtacacc 

601 gtccgcattt cctactgtct cgacggtttc caccgcctcc acattttcaa 

651 ccgcttcttc actgttttgc tgctgtgtct gttcgctcat atcgtatccc 

701 tcaaaacaaa ttggaaatca aaatccggtt attacccgag caagataagg 

751 acattttcaa gaaacttcaa gcaaaggcag gaaatttcac atccgccgcc 

801 gaatacccta ccgcaaaaac catataaacg gtaa 
This corresponds to the amino acid sequence <SEQ ID 930; ORF 244.ng>: 

g244 .pep 

1 MPPEARPAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LPCHRNHSRA 

51 QHTVGQGITL LHHTNHGIGF LLTGHRLHRL MDIRIELIAR FRIDFLDLRG 

101 IKRL»LQIiIQS HLHTHFQRIE ITALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IGNFLLVAAA QVLLV CQSAQ LFVFQLRFQL GNPRLQILIS RLGGSLFLYT 

201 VRISYCLDGF HRLHIFN RFF TVLLLCLFAH IVSLK TNWKS KSGYYPSKIR 

251 TFSRNFKQRQ EISHPPPNTL PQKPYKR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 93 1>: 

m244.seq 
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1 ATGCCGTCTG AAGCCCGACA GGCGGGTTCA GACGGCATTG CCGCTTTACT 

51 TCGATCGGTT TATACGCAAA ACGCGCTTCA GGAAATAAAT CAGATTATTC 

101 CCCAGACGCC TTCAGGCTTC CTTCTGCGCC ACCGTAACCA TAGCCGGGCG 

151 CAACACGCGG TCGGACAGCG TATAACCCTT CTTCATCACA CCCACCACGG 

201 TATTCGGCTC CTGTTCGCTT GCCACCGCCT GCATCGCCTG ATGGATATTC 

251 GGATCGAGCT TATCGCCCGC TTTAGGGTTG ATTTCCTTGA TTTGCGTAGC 

301 ATCAAATGCT TTCTGCAACT CGTTCAAAGT CATCTGCACG CCCATTTTCA 

351 GCGCATCGAA ATTGCCGCTC TGATCCAAAA GCGCCATTTC CAGATAATCC 

401 TTGACCGGCA GCATTTCCAC GGCAAACTTC TGTCCGGCGA ACTTGTGCGT 

451 ATCCGCAATT TyCTGCTGGT GGCGGCGGCG CAGGTTTTGC TCGTTTGCCA 

501 AAGCGCGCTG CTCGTCTTTC AACTG CGTTT CCAGCTCGGC AATCCGCGCC 

551 TGCAAATCCT CATAAGCCGG CTCTGCGGCA GCCTGTTCCT GCACACCGTC 

601 CGCATTTCCT ACTGTTTCGA CGGTTTCCAC CGCCTCCACA TTTTCAACCG 

651 CTTCTTCACT GTTTTGCTGC TGTGTCTGTT CGCTCATATC GTATCCCTTA 

701 AAACAAATTG GAAATCAAAA TCCAGTTATT ACCCGCGCAA GATAAGGACA 

751 TTTTCAAGAA ACTTCAAkGA AAAkCAGAGA ATTTCAAATT CATTTTCAAA 

801 TCCCCTACCG AAAAAATAAT ATAGACGGTA A 

This corresponds to the amino acid sequence <SEQ ID 932; ORF 244>: 

m24 4 .pep 

1 MPSEARQAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LLRHRNHSRA 

51 QHAVGQRITL LHHTHHGIRL LFACHRLHRL MDIRIELIAR FRVDFLDLRS 

101 IKCFLQLVQS HLHAHFQRI E IAALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IRNFLLVAAA QVLLV CQSAL LVFQLRFQLG NPRLQILISR LCGSLFLHTV 

201 RISYCFDGFH RLHIFN RFFT VLLLCLFAHI VSLK TNWKSK SSYYPRKIRT 

2 51 FSRNFXQXQR ISNSFSNPLP KKXYRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 244 shows 86.3% identity over a 277 aa overlap with a predicted ORF (ORF 244.ng) 
from N. gonorrhoeae: 

M244/G244 



10 20 30 40 50 60 

m2 4 4 . pep MPSEARQAGSDGIAALLRSVYTQNALQEINQI I PQT PSGFLLRHRNHSRAQHAVGQR I TL 

II III MIIIIIIIMIIIIIIIIIMIIIIIIIIIIII llllllllhlll 111 
g2 4 4 MPPEARP AGSDGI AALLRS VYTQNALQEINQI I PQTPSGFLPCHRNHSRAQHTVGQG I TL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m244 . pep LHHTHHG I RLLFACHRLHRLMDI RI ELI AR FRVDFLDLRS I KCFLQLVQSHLHAHFQRI E 

lllhlll «h« MIIIIIMMIIIIIIhMlllhll MlhllMhllllll 

g24 4 LHHTNHGIGFLLTGHRLHRLMDIRIELIARFRIDFLDLRGIKRLLQLIQSHLHTHFQRIE 
70 80 90 100 110 120 



130 140 150 160 170 180 

m244 . pep I AALIQKRHFQI I LDRQHFHGKLLSGELVR IRNFIjLVAAAQVLLVCQSAALLVFQLRFQL 

|:|IIIIIIIIIIIIIIIIIIIIIMIIIII lllllllllllllllll hllllllll 
g 2 4 4 I TALIQKRHFQI I LDRQHFHGKLLS GE LVR IGNFLLVAAAQVLLVCQS AQLFVFQ LRFQL 

130 140 150 160 170 180 



190 200 210 220 230 240 

m24 4 .pep GNP RLQ I LI S RLCGS LFLHTVR I S YC FDG FHRLH I FNRFFTVLLLCLFAH I VSLKTNW KS 

IIIMIIIMM llllhll III M-llll II II III IIMII III lllllllilli M 

g244 GNPRLQlLISRLGGSLFLYTVRISYCLDGFHRLHIFNRFFTVLLLCLFAHIVSLKm^KS 
190 200 210 220 230 240 



250 260 270 

m244 .pep KSSYYPRKIRTFSRNFXQXQRISNSFSNPLPKKXYRRX 
Ihlll lllllllll I |:||: I t I : I hll 
g244 KSGYYPSKIRTFSRNFKOROEISHPPPNTLPQKPYKRX 

250 260 270 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 933>: 

a24 4 . seq 

1 ATGCCGTCTG AAGCCCGACA GGCGGGTTCA GACGGCATTG CCGCTTTACT 

51 TCGATCGGTT TATACGCAAA ACGCGCTTCA GGAAATAAAT CAGATTATTC 

101 CCCAGACGCC TTCAGGCTTC CTTCTGTGCC ACCGTAACCA TAGCCGGGCG 

151 CAACACGCGG TCGGACAGCG TATAACCCTT CTTCATCACG CCCACCACGG 

201 TATTGGGTTC CTGTTCGCTT GCCACCGCCT GCATCGCCTG ATGGATATTC 

251 GGATCGAGCT TATCGCCCGC TTTAGGATTG ATTTCCTTGA TTTGCGTAGC 

301 ATCAAATGCT TTCTGCAACT CGTTCAAAGT CATCTGCACG CCCATTTTCA 

351 GCGCATCGAA ATTGCCGCTC TGATCCAAAA GCGCCATTTC CAGATAATCC 

401 TTGACCGGCA GCATTTCCAC GGCAAACTTC TGTCCGGCGA ACTTGTGCGT 

4 51 ATCCGCAATT TCCTGCTGGT GGCGGCGGCG CAGGTTTTGC TCGTTTGCCA 

501 AAGCGCGCAG CTGCTCGTCT TTCAACTGCG CTTCCAGCTC GGCAATCCGC 

551 GCCTGCAAAT CCTCATAAGC CGGCTCTGCG GCAGCCTGTT CCTGCACACC 

601 GTCCGCATTT CCTACTGTCT CGACGGTTTC CACCGCCTCC ACATTTTCAA 

651 CCGCTTCTTC ACTGTTTTGC TGCTGTGTCT GTTCGCTCAT ATCGTATCCC 

701 TTAAAACAAA TTGGAAATCA AAATCCAGTT ATTACCCGCG CAAGATAAGG 

751 ACATTTTCAA GAAACTTCAA GCAAAGGCAG AGAATTTCAA ATTCATTTTC 

801 AAATCCCCTA CCGAAAAAAT AATATAGACG GTAA 

This corresponds to the amino acid sequence <SEQ ID 934; ORF 244.a>: 

a244 .pep 

1 MPSEARQAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LLCHRNHSRA 

51 QHAVGQRITL LHHAHHGIGF LFACHRLHRL MDIRIELIAR FRIDFLDLRS 

101 IKCFLQLVQS HLHAHFQRIE IAALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IRNFLL VAAA QVLLVCQSAQ LLV FQLRFQL GNPRLQILIS RLCGSLFLHT 

201 VRISYCLDGF HRLHIFNR FF TVLLLCLFAH IVSLK TNWKS KSSYYPRKIR 

251 TFSRNFKQRQ RISNSFSNPL PKK*YRR* 

m244/a244 96.8% identity in 277 aa overlap 

10 20 30 40 50 60 

MPSEARQAGSDGIAALLRSVYTQNALQEINQIIPQTPSGFLLRHRNHSRAQHAVGQRITL 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I II I I I I II I I I 
MPSEARQAGSDGIAALLRSVYTQNALQEINQIIPQTPSGFLLCHRNHSRAQHAVGQRITL 
10 20 30 40 50 60 

70 80 90 ICO 110 120 

LHHTHHGIRLLFACHRLHRLMDIRIELIARFRVDFLDLRSIKCFLQLVQSHLHAHFQRIE 
111:1111 : I I I I I I I I I 1 I I I I I I I I I II I : I II I I I I 1 I I ! I I I I I I I II I II I I I I 
LHHAHHGIGFLFACHRLHRLMDIRIELIARFRIDFLDLRSIKCFLOLVQSHLHAHFQRIE 
70 80 90 ICO 110 120 

130 140 150 160 170 179 

IAALIQKRHFQIILDRQHFHGKLLSGELVRIRNFLLVAAAQVLLVCQSA-LLVFQLRFQL 
I I I I t I I I I I I I I I I II I I t I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I 
IAALIQKRIIFQIILDRQHFHGKLLSGELVRIRNFLLVAAAQVLLVCQSAQLLVFQLRFQL 
130 140 150 160 170 180 

180 190 200 210 220 230 239 

GNPRLQILISRLCGSLFLHTVRISYCFDGFHRLHIFNRFFTVLLLCLFAHIVSLKTNWKS 
I I I I I I I I II I I f I I I I II I I I I I II: I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I 
GNPRLQILISRL2GSLFLHTVRISYCLDGFHRLHIFNRFFTVLLLCLFAHIVSLKTNWKS 
190 200 210 220 230 240 

240 250 260 270 

KSSYYPRKIRTFSRNFXQXQRISNSFSNPLPKKXYRRX 
I I I I I I I I I I I I I I I I I I I I t II II I I I I I I I i I I I 
KSSYYPRKIRTFSRNFKQRQRISNSFSNPLPKKXYRRX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 935>: 

g244-l.eeq 

1 atgccgcctg aagcccggcc ggcgggttca gacggcattg ccgctttact 



m24 4 .pep 
a244 

m24 4 -pep 
a244 

m24 4 .pep 
a244 

m244 .pep 
a244 

m244 .pep 
a244 
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51 tcgatcggtt tatacgcaaa acgcgcttca ggaaataaat cagattattc 

101 cccagacgcc ttcaggcttc cttccgtgcc accgtaacca tagccgggcg 

151 caacacacgg tcggacaggg tataaccctt cttcatcaca ccaaccacgg 

201 tattgggttc ctgctcactg gccaccgcct gcatcgcctg atggatattc 

2 51 ggatcgagct tatcgcccgc tttaggattg atttccttga tttgcgtggc 

3 01 atcaaacgcc ttctgcaact cattcaaagt catctgcaca cccattttca 
351 gcgcatcgaa attaccgctc tgatccaaaa gcgccatttc cagataatcc 

4 01 ttgaccggca acatttccac ggcaaacttc tgtccggcga acttgtgcgt 
4 51 atcggcaatt tcctgctggt ggcggcggcg caggttttgc tcgtttgcca 
501 aagcgcgcag ttgttcgtct ttcaactgcg cttccagctc ggcaatccgc 
551 gcctgcaaat cctcataagc cggctcggcg gcagcctgtt cctgtacacc 
601 gtccgcattt cctactgtct cgacggtttc caccgcctcc acattttcaa 
651 ccgcttcttc actgttttgc tgctgtgtct gttcgctcat atcgtatccc 
701 tcaaaacaaa ttggaaatca aaatccggtt attacccgag caagataagg 
751 acattttcaa gaaacttcaa gcaaaggcag gaaatttcac atccgccgcc 
801 gaatacccta ccgcaaaaac catataaacg gtaa 

This corresponds to the amino acid sequence <SEQ ID 936; ORF 244-l.ng>: 

g244-l.pep 

1 MPPEARPAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LPCKRNHSRA 

51 QHTVGQGITL LHHTNHGIGF LLTGHRLHRL MDIRIELIAR FRIDFLDLRG 

101 IKRLLQLIQS HLHTHFQRIE ITALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IGNFLLVAAA QVLLV CQSAQ LFVFQLRFQL GNPRLQILIS RLGGSLFLYT 

201 VRISYCLDGF HRLHIFNR FF TVLLLCLFAH IVSLK TNWKS KSGYYPSKIR 

251 TFSRNFKQRQ EISHPPPNTL PQKPYKR* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 937>: 

m244-l.seq 

1 ATGCCGTCTG AAGCCCGACA GGCGGGTTCA GACGGCATTG CCGCTTTACT 

51 TCGATCGGTT TATACGCAAA ACGCGCTTCA GGAAATAAAT CAGATTATTC 

101 CCCAGACGCC TTCAGGCTTC CTTCTGCGCC ACCGTAACCA TAGCCGGGCG 

151 CAACACGCGG TCGGACAGCG TATAACCCTT CTTCATCACA CCCACCACGG 

201 TATTCGGCTC CTGTTCGCTT GCCACCGCCT GCATCGCCTG ATGGATATTC 

251 GGATCGAGCT TATCGCCCGC TTTAGGGTTG ATTTCCTTGA TTTGCGTAGC 

301 ATCAAATGCT TTCTGCAACT CGTTCAAAGT CATCTGCACG CCCATTTTCA 

351 GCGCATCGAA ATTGCCGCTC TGATCCAAAA GCGCCATTTC CAGATAATCC 

401 TTGACCGGCA GCATTTCCAC GGCAAACTTC TGTCCGGCGA ACTTGTGCGT 

451 ATCCGCAATT TyCTGCTGGT GGCGGCGGCG CAGGTTTTGC TCGTTTGCCA 

501 AAGCGCGCTG CTCGTCTTTC AACTGCGTTT CCAGCTCGGC AATCCGCGCC 

551 TGCAAATCCT CATAAGCCGG CTCTGCGGCA GC CTGTTCCT GCACACCGTC 

601 CGCATTTCCT ACTGTTTCGA CGGTTTCCAC CGCCTCCACA TTTTCAACCG 

651 CTTCTTCACT GTTTTGCTGC TGTGTCTGTT CGCTCATATC GTATCCCTTA 

701 AAACAAATTG GAAATCAAAA TCCAGTTATT ACCCGCGCAA GATAAGGACA 

751 TTTTCAAGAA ACTTCAAkCA AAAkCAGAGA ATTTCAAATT CATTTTCAAA 

801 TCCCCTACCG AAAAAATAA 

This corresponds to the amino acid sequence <SEQ ID 938; ORF 244- 1>: 

m244-l.pep 

1 MPSEARQAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LLRHRNHSRA 

51 QHAVGQRITL LHHTHHGIRL LFACHRLHRL MDIRIELIAR FRVDFLDLRS 

101 IKCFLQLVQS HLHAHFQRIE IAALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IRNFLLVAAA QVLLV CQSAL LVFQLRFQLG NPRLQILISR LCGSLFLHTV 

201 RISYCFDGFH RLHIFNR FFT VLLLCLFAHI VSLK TNWKSK SSYYPRKIRT 

251 FSRNFXQXQR ISNSFSNPLP KK* 

m244-l/G244-l 86.3% identity in 277 aa overlap 

10 20 30 40 50 60 

m244- 1 . pep MP S EARQAG SDG I AALLRS VYTQNALQE INQ 1 1 PQT P SG FLLRHRNHSRAQHAVGQR I TL 

II 111 llllllllllllilllllllllllilllllllll lillllllMII III 

g 24 4 - 1 MPPEARPAGSDGI AALLRSVYTQNALQEINQI IPQTPSGFLPCHRNHSRAQHTVGQGITL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ro24 4 - 1 . pep LHHTHHGIRIJiFACHRLHRI,MDIRIELIARFRVDFLDLRSIKCFLQLVQSHLHAHFQRIE 
1111 = 111 -h. IIM.IIIMIMIIilhilllihll :|fhll|||:|||||| 
g244- 1 IJfflTNHGIGFLLTGHRLHRLiMDIRIE LIAR FRIDFLDLRG I KRLLQL I QSHLHTHFQRIE 

70 80 90 100 110 120 



WO 99/57280 



PCT/US99/09346 



563 



130 140 150 160 170 180 

m244 - 1 . pep IAALIQKRHFQI ILDRQHFHGKLLSGELVRIRNFLLVAAAQVLLVCQSAALLVFQLRFQL 

h !M Mi! II ! II lllllll II Mi llili lllllllllllllllll hllllllll 

g244 - 1 ITALIQKRHFQI I LDRQHFHGKLLSG ELWIGNFLLVAAAQVLLVCQSAQLFVFQLRFQL 

130 140 150 160 170 180 



190 200 210 220 230 240 

m244- 1 . pep GN PRLQI L I S RLCGS LFLHTVR I S YC FDGFHRLHI FNRFFTVLLLC LFAH I VS LKTNWKS 

11 lllillllii I III h I! Illlhllllli II III! lilllill II II IIMI Mil 

g244-l GN PRLQI LIS RLGG SLFLYTVRI S YC LDGFHRLHI FNR FFTVLliLCLFAH I VS LKTNWKS 

190 200 210 220 230 240 



250 260 270 

m24 4 - 1 . pep KSSYYPRKIRTFSRNFXQXQRISNSFSNPLPKKX 

I I : 1 1 I Ml' I | hlh I I I : i 
g24 4 - 1 KSGYYPSKIRTFSRNFKQRQEISHPPPNTLPQKPYKRX 

250 260 270 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 939>: 

a244-l.seq 

1 ATGCCGTCTG AAGCCCGACA GGCGGGTTCA GACGGCATTG CCGCTTTACT 

51 TCGATCGGTT TATACGCAAA ACGCGCTTCA GGAAATAAAT CAGATTATTC 

101 CCCAGACGCC TTCAGGCTTC CTTCTGTGCC ACCGTAACCA TAGCCGGGCG 

151 CAACACGCGG TCGGACAGCG TATAACCCTT CTTCATCACG CCCACCACGG 

201 TATTGGGTTC CTGTTCGCTT GCCACCGCCT GCATCGCCTG ATGGATATTC 

251 GGATCGAGCT TATCGCCCGC TTTAGGATTG ATTTCCTTGA TTTGCGTAGC 

301 ATCAAATGCT TTCTGCAACT CGTTCAAAGT CATCTGCACG CCCATTTTCA 

351 GCGCATCGAA ATTGCCGCTC TGATCCAAAA GCGCCATTTC CAGATAATCC 

401 TTGACCGGCA GCATTTCCAC GGCAAACTTC TGTCCGGCGA ACTTGTGCGT 

451 ATCCGCAATT TCCTGCTGGT GGCGGCGGCG CAGGTTTTGC TCGTTTGCCA 

501 AAGCGCGCAG CTGCTCGTCT TTCAACTGCG CTTCCAGCTC GGCAATCCGC 

551 GCCTGCAAAT CCTCATAAGC CGGCTCTGCG GCAGCCTGTT CCTGCACACC 

601 GTCCGCATTT CCTACTGTCT CGACGGTTTC CACCGCCTCC ACATTTTCAA 

651 CCGCTTCTTC ACTGTTTTGC TGCTGTGTCT GTTCGCTCAT ATCGTATCCC 

701 TTAAAACAAA TTGGAAATCA AAATCCAGTT ATTACCCGCG CAAGATAAGG 

751 ACATTTTCAA GAAACTTCAA GCAAAGGCAG AGAATTTCAA ATTCATTTTC 

801 AAATCCCCTA CCGAAAAAAT AA 



This corresponds to the amino acid sequence <SEQ ID 940; ORF 244-1 .a>: 

a244-l .pep 

1 MPSEARQAGS DGIAALLRSV YTQNALQEIN QIIPQTPSGF LLCHRNHSRA 

51 QHAVGQRITL LHHAHHGIGF LFACHRLHRL MDIRIELIAR FRIDFLDLRS 

101 IKCFLOLVQS HLHAHFQRIE IAALIQKRHF QIILDRQHFH GKLLSGELVR 

151 IRNFLL VAAA QVLLVCQSAQ LLV FQLRFQL GNPRLQILIS RLCGSLFLHT 

201 VRISYCLDGF HRLHI FNR FF T\ f LLLCLFAH IVSLK TOWKS KSSYYPRKIR 

251 TFSRNFKQRQ RISNSFSNPL PFvK* 



m244-l/a244-l 96.8% identity in 274 aa overlap 



10 20 30 40 50 60 

m24 4-1 . pep MPSEARQAGSDGIAALLRSVYTQNALQEINQII PQTPSGFLLRHRNHSRAQHAVGQRITL 

ii i iimimn ii ii inn ii i mm ii i! ii i in 1 1 1 1 1 1 1 1 m i m 1 1 

a244-l MPSEAROAGSDGIAALLRSVYTQNALOEINQIIPQTPSGFLLCHRNHSRAQHAVGQRITL 
10 20 30 40 50 60 



70 80 90 100 110 120 

m24 4-l . pep LHHTHHGIRLLFACHRLHRLMDI RI ELIARFRVDFLDLRSI KC FLQLVQSHLHAHFQRIE 

111:1111 : I I III I llll I III I I III I I I : III I I I I III I III I I I I ill I I Hi I 
a24 4-l LHHAHHGIGFLFACHRLHRLMDI RI EL I ARFR1 DFLDLRSI KCFLQL VQSHLHAHFQRI E 

70 80 90 100 110 120 



130 140 150 160 170 179 

m2 4 4-1. pep IAALIQKRHFQI I LDRQHrHGKLLSGELVRIRNFLLVAAAQVLLVCQS A- LLVFQLRFQL 

mimnmmiminniHiiiiiiiHiimimim iiiimni 

a244-l I AALI QKRH FQI I LDRQH FHGKLLSGELVRIRM FLLV AAAQVL L VCQS AQLL V FQLRFQL 

130 140 150 160 170 180 



180 190 200 210 220 230 239 

m24 4-l.pep GNP RLQI LI SRLCGS LFLHTV RI SYC FDGFHRLHI FN RF FT VLLLC LFAH I VS LKTNWKS 

M 1 I I I I i I I M I M M f I I M 11 I I : M I 1 1 I I t I 1 I I I I I I I I I I I I I M I I I f I II I 
a 2 4 4 - 1 GNPRLQI LI SRLCGS LFLHTVRI S YCLDGFHRLHI FN RFFTVLLLCLFAH I VS LKTNWKS 

190 200 210 220 230 240 



WO 99/57280 



PCT/US99/09346 



564 



240 250 260 270 

m24 4-l.pep KSSYYPRKIRTFSRNFXQXQRISNSFSNPLPKKX 
I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I 
a24 4-l KSSYYPRKIRTFSRNFKORORISNSFSNPLPKKX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 941 >: 

g246.seq 

1 atgtacgggc ggaacggtag tactcaagcg gccgttgcct tcgttttcga 

51 ccagacacag cgtgcccgtt tcggcaacgg cgaagtttac gccgctcaag 

101 ccgacatcgg cagtgctgta aatatcgcgc agggctttgc gggcgaatcc 

151 ggtcagttgg tccacgtcgt ctgtaagcgg tgtgccgagg ttttggtgga 

201 acagttcgct gacctgttct ttggttttat ggattgcggg catcacgata 

251 tgggtcggtc tttcgcctgc catttggacg ataaactcgc ccaagtcgct 

301 ttccaccgcc ttaatgcctt ttgcttcaag ataatggttc agctcgattt 

351 cttcgctgac catggatttg cctttgacca tcagcttgcc gtttttggct 

401 gtgatgatgt cgtggataat ttggcaggct tcggcagggg tttccgccca 

4 51 gtgtactttc acgcccaact tagtcaggtt ttcttccaac tgctccagca 

501 gcgcgggtaa 

This corresponds to the amino acid sequence <SEQ ID 942; ORF 246.ng>: 

g246.pep 

1 MYGRNGSTQA AVAFVFDQTQ RARFGNGEVY AAQAD IGSAV NIAQGFAGES 

51 GQLVHWCKR CAEVLVEQFA DLFFGFMDCG HHDMGRFFAC HLDDKLAQVA 

101 FHRLNAFCFK IMVQLDFFAD HGFAFDHQLA VFGCDDWDN LAGFGRGFRP 

151 VYFHAQLSQV FFQLLQQRG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 943>: 

m246.seq (partial) 

1 ATGCACGGGC GGTACGGTGG TACTCAAGCG ACCGTTgCTT CGTTTTCCAC 

51 CAGACACAGC GTACCTGTTT CAGCAACGGC AAAGTTTACG CCACTCAAAC 

101 CGACATCGGC AGTGCTGTAA ATATCGCGCA GTGCTTTACG GGCGAAGCCG 

151 GTCAGTTGGT CTACATCGTC TGTCAGCGGC GTACCGAGGT TTTGGTGGAA 

201 CAGTTCGCTA ACCTGTTCTT TGGTTTTGTG GATAGCAGGC ATCACGATAT 

251 GGGTCGGTTT TTCGCCTGCC ATTTGGACGA TGAACTCGCC CAAGTCGCTT 

301 TCTACCGCTT TAATGCyTTT TGCTTCAAGA TAATGrTTCA GCTCGATTTC 

3 51 CTCGCTGACC ATCGATTTGC CTTTGACCAT CAGCTTGCCG TTTTTGGCTG 

401 TGATGATGTC GTGGATAATT TGGCAGGCTT CGGTCGGGGT TTCTGCCCG. . . 

This corresponds to the amino acid sequence <SEQ ID 944; ORF 246>: 

m246.pep (partial) 

1 MHGRYGGTQA TVAFVFHQTQ RTCFSNGKVY ATQT DIGSAV NIAQCFTGEA 

51 GQLVYIVCQR RTEVLVEQFA NLFFGFVDSR HHDMGRFFAC HLDDELAQVA 

101 FYRFNAFCFK IMXQLDFLAD HRFAFDHQLA VFGCDDWDN LAGFGRGFCP . . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 246 shows 80.0% identity over a 150 aa overlap with a predicted ORF (ORF 246.ng) 
fromJV. gonorrhoeae: 

m246/g246 

10 20 30 40 50 60 

m2 4 6 . pep MHGRYGGTQATVAFVFHQTQRTCFSNGKV YATQTDI GS AVN I AQCFTGEAGQLVY I VCQR 

hll hllhlllll lllh hlhllhhllllllllll hlhlllh:|h: 

g24 6 MYGRNGS TQAAVAFV FDQTQRAR FGNGEV YAAQAD I GS AVN I AQG FAGESGQL VHWCKR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 4 6 . pep RTEVLVEQFANLFTOFVDSRHHDMGRFFACHLDDEIiAOVAFYRFNAFCFKIMXOLDFIiAD 

Ollllllhllllhl IIIIIIIMI II I M II II hh II 1 1 II 1 1 Nihil 

g246 CAEVLVEQFADLF FGFMD CGHHDMGRF FACH LDDKLAQ VAFH RLNAFC FKI MVQLDF FAD 
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70 80 90 100 110 120 

130 140 150 

m2 4 6 . pep HR FAFDHQLAVFGCDD WDNLAG FGRG FC P 

I lllllllllllllillllllllllll I 
g246 HGFAFDHQLAVFGCDDVVDNltAGFGRGFRPVYFHAQLSOVFFQLLQQRGX 
130 140 150 160 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 945>: 

a246.seq (partial) 

1 . ATGCACGGGC GGAACGGTGG TACTCAAGCG ACCGTTGCCT TCGTTTTCCA 

51 CCAGACACAG CGTACCTGTT TCAGCAACGG CGAAGTTCAC GCCACTCAAA 

101 CCGACATCGG CAGTGCTGTA AATATCGCGC AGTGCTTTAC GGGCGAAGCC 

151 GGTCAGTTGG TCTACGTCGT CCGTTAACGG TGTGCCGAGG TTTTGGTGGA 

201 ACAGTTCGCT AACCTGTTCT TTGGTTTTAT GGATTGCGGG CATCACGATA 

251 TGGGTCGGTT TTTCACCTGC CATTTGGACG ATGAACTCGC CCAAGTCGCT 

301 TTCCACCGCT TTAATGCCTT TTGCTTCAAG ATAATGGTTC AGCTCGATTT 

351 CCTCGCTGAC CATCGATTTG CCTTTGACCA TCAGCTTGCC GTTTTTGGCT 

401 GTGATGATGT CGTGGATGAT TTCGCAGGCT TCGGCCGGTG TTTCCGCCCA 

451 GTGTACTTTT ACGCCCAACT TGGTCAGGTT TTCTTCCAGC TGCTCCAGCA 

501 G 

This corresponds to the amino acid sequence <SEQ ID 946; ORF 246.a>: 

a246.pep (partial) 

1 MHGRNGGTQA TVAFVFHQTQ RTCFSNGF.VH ATQTDIGSAV NIAQCFTGEA 

51 GQLVYWR*R CAEVLVEQFA NLFFGFMDCG HHDMGRFFTC HLDDELAQVA 

101 FHRFNAFCFK IMVQLDFLAD HRFAFDHQLA VFGCDDWDD FAGFGRCFRP 

151 VYFYAQLGQV FFQLLQQ 

m246/a246 88.0% identity in 1 50 aa overlap 

10 20 30 40 50 60 

m2 4 6 . pep MHGRYGGTQATVAFV FHQTQKT C FS N GKV Y ATQT DIG S AVN IAQCBTGKAGQLVYI VCQR 

I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I I I 1 I I I I I I I I I I I I I I I I I : I I 
a246 MHGRNGGTQATVAFVFHQTQRTCFSNGEVHATQTDIGSAVNIAQCFTGEAGQLVYVVRXR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 4 6 . pep RTEVLVEQFANLFFGFVDSRHHDMGRFFACHLDDELAQVAFYRFNAFCFKIMXQLDFLAD 
: I I I I I I I I I I I I I I : I I I I II I I I : I I I i I I I I I I I I : I i I I I I I I I I I I 1 I I I I 
a24 6 CAEVLVEQFANLFFGFMDCGHHDMGRFFTCHLDDELAQVAFHRFNAFCFK1MVQLDFLAD 

70 80 90 100 110 120 

130 140 150 

m24 6 . pep HRFAFDHQLAVFGCDDWDNLAGFGRGFCP 
M I I I I I I I I I I I I I I I I ]:: I 1 I I i I I 
a24 6 HR FAFDHQLAVFGC D D WDD FAG FGRC FR PV YFYAQLGQVF FQLLQQ 

130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 947>: 

g247 . seq 

1 atgaaacgta aaatgctaaa cgtaccaaag ggcggttatg atggtatgaa 

51 gggttttacc attgttgaat ttctggttgc gggcctgctc agtataattg 

101 tcctgatagc ggtcgtatcg agttacttta catcccggaa attaaatgat 

151 gtggcaaacg agcgtcttgc cattcaacag gatttgcgga atgcggcaac 

201 attaattgtc cgcgatgcaa gaatggcggg gagcttcggt tgtttcaata 

251 tgtccgagca tactaaagac gatattgttg attcaagtaa tcaaactcaa 

301 tctaaccttg caaaacccgg tgccaaacaa gaaaatcccc ttttttcctt 

351 aaaaaggagc ggcatggata aacaactgat tcccgttgct gaatccatag 

401 atattaaata tccgggtttt atccagcgcc ttaacgcart ggttttccaa 

451 tacggtatcg atgatcttga tgcgagcgct gagactgtcg tagccagcag 

501 ctgttccaaa atagcaaaac cgggtaagaa aatatctacc ttgcaagaag 

551 caaagagtgc attacagatt actaatgatg ataaacaaaa tggaaatatc 
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601 acccgtcaga aacatgtggt caatgcctat gcggtcggca ggtttggcaa 

651 taatgaggaa agtttgttcc gcttccaatt ggatgataag ggcaagtggg 

701 gtaatcctca gttgctcgtg aaaaaggtta aacgtatgga tgtgcggtat 

751 atttatgttt ccggttgtcc tgaagatgaa gatgccggca aagaggaaaa 

801 attcagatat acgaataaat tcgacaaatc caaaaatgct gttacgcctg 

851 ccggggtgga ggttttattg gatagcggcc ttaatgccaa gattgccgct 

901 tcttcagaca atagtattta tgcttaccgt atcaatgcga caatacgcgg 

951 gggaaatgta tgcgcaaaca gaacactttg a 

This corresponds to the amino acid sequence <SEQ ID 948; ORF 247. ng>: 

g247 .pep 

i MKRKMLNVPK GGYDGMKGFT IVEFLVAGLL SIIVLIA WS SYFTSRKLND 

51 VANERLAIQQ DLRNAATLIV RDARMAGSFG CFNMSEHTKD DIVDSSNQTQ 

101 SNLAKPGAKQ ENPLFSLKRS GMDKQLIPVA ESIDIKYPGF IQRLNALVFQ 

151 YGIDDLDASA ETVWSSCSK IAKPGKKIST LQEAKSALQI TNDDKQNGNI 

201 TRQKHWNAY AVGRFGNNEE SLFRFQLDDK GKWGNPQLLV KKVKRMDVRY 

251 IYVSGCPEDE DAGKEEKFRY TNKFDKSKNA VTPAGVEVLL DSGLNAKIAA 

301 SSDNSIYAYR INATIRGGNV CANRTL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 949>: 



m2 47 . seq 


(partial) 






1 


ATsAGACGTA 


AAATGCTAAA 


CGTwsyArAA 


51 


AGGTTTTACC 


ATTATTGAAT 


TTTTGGTTGC 


101 


TCCTGATGGC 


GGTCGGATCG 


AGTTACTTCA 


151 


GCGGCAAACG 


AGCGTCTTGC 


CGCGCAACAG 


201 


ATTGATTGTC 


CGCGATGCGA 


GAATGGCAGG 


251 


TGTCCGAGCA 


TCCTGCAACT 


GATGTTATTC 


301 


TCTCCTTTTT 


CCTTAAAAAG 


GAACGGTATA 


351 


GGAATCTTCA 


AAT AT CAATT 


ATCAGAATTT 


401 


TGATTTTTCA 


ATACGGAATC 


GATGATGTTA 


451 


GTCGTCAGCA 


GCTGTGCCGC 


AATATCGAAA 


501 


TTTAGAAGAT 


GCAAAAAAAG 


AATTGAAGAT 


551 


AAAATGGCAA 


TATAGCG CGT 


CAAAGGCATG 


601 


GGCAGGATTG 


CCGATGAGGA 


AAGTTTGTTC 


651 


GGGCAAGTGG 


GGTAATCCTC 


AGTTGC . . . 



This corresponds to the amino acid sequence <SEQ ID 950; ORF 247>: 

m247.pep (partial) 

1 XRRKMLNVXX GSYDGMKGFT IIEFLVAGLL SMIVLMA VGS SYFTSRKLND 

51 AANERLAAQQ DLRNAATLIV RDARMAGGFG CFNMSEKPAT DVI PDTTQQN 

101 SPFSLKRNGI DKLIPIAESS NINYQNFFQV GSALIFQYGI DDVKASTATT 

151 WSSCAAISK PGKQIPTLED AKKELKIPDQ DKEQNGNIAR QRHWNAYAV 

201 GRIADEESLF RFQLDDKGKW GNPQL 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 247 shows 69.3% identity over a 238 aa overlap with a predicted ORF (ORF 247.ng) 
from N. gonorrhoeae: 

m247/g247 

10 20 30 .40 50 60 

m24 7 . pep XRRKMLNVXXGS YDGMKGFTI IEFLVAGLLSMIVLMAVGSSYFTSRKLNDAANERLAAQQ 

'Mini hiiiiiiiihiiuiiiihiihii iMiiimihiimi n 

g247 MKR KMLNV P KGG YDGM KGFT I VE FLVAGLLS 1 1 VL I AWS S YFTSRKLNDV AN ERLAI QQ 

10 20 30 40 50 60 

70 80 90 100 

m247 . pep DLRNAATLI VRDARMAGGFGCFNMSEHPATDVI PDTTQQNSPFSLKRN 

llllllllllllllllhlllllllll h: | : |:| Milh 

g24 7 DLRNAATLIVRDARMAGSFGCFNMSEHTKDDIVDSSNQTOSNLAKPGAKOENPLFSLKRS 

70 80 90 100 110 120 

110 120 130 140 150 160 

m247 .pep GIDK-LIPIAESSNINYQNFFQVGSALIFQYGIDDTOASTATTVVSSC7VAISKPGK0IPT 
I'll I I I = J I I :|'| :hl . I I . I I I I I I h « I I • hllllh hlllhl I 
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g247 GMDKQL I P VAE SIDIKYPGFI QRLNAL V FQYGI DDLD AS AETVWS S CS K I AKPGKK I S T 

130 140 150 160 170 180 

170 180 190 200 210 220 

m24 7 . pep LEDAKKBLKI PDQDKEQNGNIARQRHWNAYAVGRI AD- EES LFRFQLDDKGKWGN PQL 

h'lh hi -M I lllhlhl II ill 1 II h- t I I I I I I 1 I I E I I I 1 ( I I I I 
g24 7 LQE AKS ALQ I TNDD K - QNGN I TRQ KHWNA Y AVG RFGNNE ES L FR FQLDDKGKWGNPQL L 

190 200 210 220 230 

. g24 7 VKKVKRMDVRYIYVSGCPEDEDAGKEEKFRYTNKFDKSKNAVTPAGVEVLLDSGLNAKIA 
240 250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 951>: 

a247 . seq 

1 ATGAGACGTA AAATGCTAAA CGTACCAAAA GGCAATTATG ATGGTATGAA 

51 GGGTTTTACC ATTATTGAAT TTTTGGTTGC GGGCATGCTC AGTATGATTG 

101 TCCTGATGGC GGTCGGATCG AGTTACTTCA CATCCCGGAA ATTAAATGAT 

151 GCGGCAAACG AGCGTCTTTC CGCGCAACAG GATTTGCGGA ATGCGGCAAC 

201 ATTGATTGTC CGCGATGCAA GAATGGCAGG GGGCTTCGGT TGTTTCAATA 

251 TGTCCGAGCA TACTAAAAAT GATATTATTG TTGATCCAAG TAAGCAAACT 

301 CAACATGTCC CTGTAAAACC CGGTGCCAAA CAAGAAAATC CCCTTTTTTC 

351 TTTAGAGTGG GCTAATACTA ATAATACTAA TAATAATACA GCTAAATTGA 

401 TTCCTATTGC TGAATCCACA GATATTAAAT ATCCGGGTTT TGCCCAGGCT 

451 CGTCCGGCAT TGATTTTCCA ATACGGCATC GATGATCTTG ATGCGAGTGC 

501 TGAGACTGTT GTAGTCAGCA GCTGTTCCAA AATAGCAAAA CCGGGTAAGA 

551 AAATATCTAC CTTGCAAGAA GCAAAGAGTG CAT T AC AG AT TACTAATGAT 

601 GATAAACAAA ATGGAAATAT CACCCGTCAA AGGCATGTGG TCAATGCCTA 

651 TGCGGTCGGC AGGATTGCCG GTGAGGAAGG TTTGTTCCGC TTCCAATTGG 

701 ATGATAAGGG CAAGTGGGGT AATCCTCAGT TGCTCGTGAA AAAGATTAGA 

751 CATATGAAAG TGCGGTATAT CTATGTTTCC GACTGTCCTG AAGATGACGA 

801 TGCCGGCAAA GAGGAAAAAT TCAAATATAC GGGTACATTC GACAGCTCCA 

851 CAAATGCTGT TACGCCCGCC GGGGTGGAGG TTTTATTGAG TANCGGTACT 

901 GATACCAAGA TTGCCGCTTC TTCAGACAAT CATATTTATG CTTACCGTAT 

951 CGATGCGACA ATACGCGGGG GAAATGTATG CGCAAACAGA ACACTTTGA 

This corresponds to the amino acid sequence <SEQ ID 952; ORF 247.a>: 

a247 .pep 

1 MRRKMLNVPK GNYDGMKGFT IIEFLVAGML SMIVLMA VGS SYFTSRKLND 

51 AANERLSAQQ DLRNAATLIV RDARMAGGFG CFNMSEHTKN DIIVDPSKQT 

101 QHVPVKPGAK QENPLFSLEW ANTNNTNNNT AKLIPIAEST DIKYPGFAQA 

151 R PALI FQYGI DDLDASAETV WSSCSKIAK PGKKISTLQE AKSALQITND 

201 DKQNGNITRO RHWNAYAVG RIAGEEGLFR FQLDDKGKWG NPQT.LVKKIR 

251 HMKVRYIYVS DCPEDDDAGK EEKFKYTGTF DSS7NAVTPA GVEVLLSXGT 

301 DTKIAASSDN HIYAYRIDAT IRGGNVCANR TL* 

m247/a247 70.9% identity in 244 aa overlap 

10 20 30 40 50 60 

m247 .pep XRRKMLNVXXGSYIXSMKGFTIIEFLVAGLLSMIVLMAVGSSYHTSRKLNDAANERLAAQQ 
I I I I I I I I : I I I I I I I I I I I I I I I I : I i I I t I I t i I I I I I I I I I i I I I I I I I : I I I 
a247 MRRKMLNVPKGNYDGMKGFTIIEFLVAGMLSMIVLMAVGSSYFTSRKLNDAANERLSAQQ 

10 20 30 40 50 60 

70 80 90 100 

m247 . pep DLRNAATLIVRDARMAGGFGCFNMSEHPATDVI PDTTQQNS PFSLK- 

I I I I I I I I I 1 I I 1 II II I I I I I I I I I I : I : I 1:1:1 III: 

a247 DLRNAATLIVRDARMAGGFGCFNMSEHTKNDIIVDPSKQTQHVPVKPGAKQENPLFSLEW 

70 80 90 100 110 120 

110 120 130 140 150 160 

m247.pep RNG IDKLI PI AES SNI N YQN FFQVGSALI FQYG I DDVNASTATT WS SCAAI SK 

I : I I I I I I I I :: I : I : I I : I I I I I I I I I I : : ! I : I : I I I I I : I : I 
a247 ANTNNTNNNTAKLIPIAESTDIKYPGFAQARPALIFQYGIDDLDASAETVWSSCSKIAK 

130 140 150 150 170 180 
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170 180 190 200 210 220 

m247 . pep PGKQIPTLEDAKKELKIPDQDKEQNGNIARQRH'/VNAYAVGRIADEF.ST.FRFQT.DOKGKW 
111:1 II:: I h hi ::ll I I I I I : I I I I II I I I I I I 1 I I I |: I II I I I I I I I I I 
a24 7 PGKKISTLQEAKSALQITNDDK-QNGNITRQRH^/VNAYAVGRIAGEEGLFRFQLDDKGKW 

190 200 210 220 230 



m247.pep GNPQL 

mil' 

a247 GNPQLLVKKIRHMKVRYIYVSDCPEDDDAGKEEKFKYTGTFDSSTNAVTPAGVEVLLSXG 
240 250 260 270 280 290 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 953>: 

g247-l.eeq (partial) 

1 CCCGGTGCCA AACAAGAAAA TCCCCTTTTT TCCTTAAAAA GGAGCGGCAT 

51 GGATAAACAA CTGATTCCCG TTGCTGAATC CATAGATATT AAATATCCGG 

101 GTTTTATCCA GCGCCTTAAC GCATTGGTTT TCCAATACGG TATCGATGAT 

151 CTTGATGCGA GTGCTGAGAC TGTTGTAGTC AGCAGCTGTT CCAAAATAGC 

201 AAAACCGGGT AAGAAAATAT CTACCTTGCA AGAAGCAAAG AGTGCATTAC 

251 AGATTACTAA TGATGATAAA CAAAATGGAA ATATCACCCG TCAGAAACAT 

301 GTGGTCAATG CCTATGCGGT CGGCAGGTTT GGCAATAATG AGGAAAGTTT 

351 GTTCCGCTTC CAATTGGATG ATAAGGGCAA GTGGGGTAAT CCTCAGTTGC 

401 TCGTGAAAAA GGTTAAACGT A7GGATGTGC GGTATATTTA TGTTTCCGGT 

451 TGTCCTGAAG ATGAAGATGC CGGCAAAGAG GAAAAATTCA GATATACGAA 

501 TAAATTCGAC AAATCCAAAA A?GCTGTTAC GCCTGCCGGG GTGGAGGTTT 

551 TATTGGATAG CGGCCTTAAT GCCAAGATTG CCGCTTCTTC AGACAATAGT 

601 ATTTATGCTT ACCGTATCAA TGCGACAATA CGCGGGGGAA ATGTATGCGC 

651 AAACAGAACA CTTTGA 

This corresponds to the amino acid sequence <SEQ ID 954; ORF 247-1 .ng>: 

g247-l.pep (partial) 

1 PGAKQENPLF SLKRSGMDKQ LIPVAESIDI KYPGFIQRLN ALVFQYGIDD 

51 LDASAETWV SSCSKIAKPG KKISTLQEAK SALQITNDDK QNGNITRQKH 

101 VVNAYAVGRF GNNEESLFRF QLDDKGKWGN PQLLVKKVKR MDVRYIYVSG 

151 CPEDEDAGKE EKFRYTNKFD KSKNAVTPAG VEVLLDSGLN AKIAASSDNS 

201 IYAYRINATI RGGNVCANRT L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 955>: 

m247-l. seq 

1 ATGAGACGTA AAATGCTAAA CGTACCAAAA GGCAGTTATG ATGGTATGAA 

51 AGGTTTTACC ATTATTGAAT TTTTGGTTGC GGGCCTGCTC AGTATGATTG 

101 TCCTGATGGC GGTCGGATCG AGTTACTTCA CATCCCGGAA ATTAAATGA? 

151 GCGGCAAACG AGCGTCTTGC CGCGCAACAG GATTTGCGGA ATGCGGCAAC 

201 ATTGATTGTC CGCGATGCGA GAATGGCAGG CGGCTTCGGT TGTTTCAATA 

251 TGTCCGAGCA TCCTGCAACT GATGTTATTC CCGATACGAC GCAACAAAA? 

301 TCTCCTTTTT CCTTAAAAAG GAACGGTATA GATAAACTTA TTCCCATAGC 

351 GGAATCTTCA AATATCAATT ATCAGAATTT TTTCCAGGTT GGTAGCGCA? 

401 TGATTTTTCA ATACGGAATC GATGATGTTA ATGCAAGCAC CGCGACTACC 

451 GTCGTCAGCA GCTGTGCCGC AATATCGAAA CCGGGCAAGC AAATCCCTAC 

501 TTTAGAAGAT GCAAAAAAAG AATTGAAGAT TCCGGATCAG GATAAGGAGC 

551 AAAATGGCAA TATAGCGCGT CAAAGGCATG TGGTCAATGC CTATGCGGTC 

601 GGCAGGATTG CCGATGAGGA AGGTTTGTTC CGCTTCCAAT TGGATGATAA 

651 GGGCAAGTGG GGTAATCCTC AGTTGCTCGT GAAAAAGGTT AGACATATGA 

701 AAGTGCGGTA TATCTATGTT TCCGGCTGTC CTGAAGATGA CGATGCCGGC 

"751 AAAGAGGAAA CATTCAAATA TACGGATAAA TTCGACAGCG CCCAAAATGC 

801 TGTTACGCCC GCCGGGGTGG AGGTTTTATT GAGTAGCGGT ACTGATACCA 

851 AGATTGCCGC TTCTTCAGAC AATCATATTT ATGCTTACCG TATCGATGCG 

901 ACAATACGCG GGGGAAATGT ATGCGCAAAC AGAACACTTT GA 

This corresponds to the amino acid sequence <SEQ ID 956; ORF 247- 1>: 

n247-l.pop 

1 MRRKMLNVPK GSYDGMKGFT IIEFLVAGLL SMIVLMAVGS SYFTSRKLKD 

51 AANERLAAQQ DLRNAATLIV RDARMAGGFG CFNMSEHPAT DVIPDTTQQIi 

101 SPFSLKRNGI DKLIPIAESS NINYQNFFQV GSALIFQYGI DDVNASTAT? 

151 WSSCAAISK PGKQI PTLED AEKELKIPDQ DKEQNGNIAR QRHWNAYAV 

201 GRIADEEGLF RFQLDDKGKW GNPOLLVKKV RHMKVRYIYV SGCPEDDDAG 

251 KEETFKYTDK FDSAQNAVTP AGVEVLLSSG TDTK1AASSD NHIYAYRIDA 

301 TIRGGNVCAN RTL* 
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m247-l / g247-l 72.1% identity in 222 aa overlap 

70 80 SO 100 110 120 

m247-l.pep NAATLIVRDARMAGGFGCFNMSEHPATDVI PDTTQQNSPFSLKRNGIDK-LIPIAESSNI 

I : hi lllll:|:|l 111 = 111 :| 
q 2 4 7 - 1 PGAKQEN PLFS LKRSGMDKQLI PVAE SIDI 

10 20 30 



130 140 150 160 170 180 

m24 7- 1 . pep NYQNFFQVGSALI FQYGI DDVNASTATTWSSCAAISKPGKOIPTLEDAKKELKI PDODK 
:l : I I : I 1 1 1 1 I I : : I 1 : 1:11111: l:||ll:l 1:1 ::ll 

g247-l KYPGFIQRLNALVFQYGIDDLDASAETWVSSCSKIAKPGKKISTLQEAKSALQITNDDK 
40 50 60 70 80 90 



190 200 210 220 230 240 

m24 7- 1 . pep EQNGNIARQRHWNAYAVGRIAD-EEGLFRFQLDDKGKWGNPQLLVKKVRHMKVRYI YVS 
I I I I I : I I : I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I I I I I : : I I I 1 I I I I 
g247-l -QNGNITRQKHWNAYAVGRFGNNEESLFRFQLDDKGKWGNPQLLVKKVKRMDVRYIYVS 
100 110 120 130 140 



250 260 270 280 290 300 

m247-l. pep GCPEDDDAGKEETFKYTDKFDSAQNAVTPAGVEVLLSSGTDTKIAASSDNHIYAYRIDAT 
I I 1 I I : I 1 I I I I I : I I : I I : : : I II I I I I I I I I I : I I : : 1 1 I I I I 1 1 111111:11 
g247-l GCPEDEDAGKEEKFRYTNKFDKSKNAVTPAGVEVLLDSGLNAKIAASSDNSIYAYRINAT 
150 160 170 180 190 200 



310 

m247-l.pep I RGGNVCANRTLX 
II I! Illllllll 
g2 4 7- 1 I RGGNVCANRTLX 

210 220 



The following partial DNA sequence was identified in N. me?iingitidis <SEQ ID 957>: 

a247-l.aeq (partial) 

1 AATAATACAG CTAAATTGAT TCCTATTGCT GAATCCACAG ATATTAAATA 

51 TCCGGGTTTT GCCCAGGCTC GTCCGGCATT GATTTTCCAA TACGGCATCG 

101 ATGATCTTGA TGCGAGTGCT GAGACTGTTG TAGTCAGCAG CTGTTCCAAA 

151 ATAGCAAAAC CGGGTAAGAA AATATCTACC TTGCAAGAAG CAAA3AGTGC 

201 ATTACAGATT ACTAATGATG ATAAACAAAA TGGAAATATC ACCCGTCAAA 

251 GGCATGTGGT CAATGCCTAT GCGGTCGGCA GGATTGCCGG TGAGGAAGG? 

301 TTGTTCCGCT TCCAATTGGA TGATAAGGGC AAGTGGGGTA ATCCTCAGT? 

351 GCTCGTGAAA AAGATTAGAC ATATGAAAGT GCGGTATATC TATGTTTCCG 

401 ACTGTCCTGA AGATGACGAT GCCGGCAAAG AGGAAAAATT CAAATATACG 

451 GGTACATTCG ACAGCTCCAC AAATGCTGTT ACGCCCGCCG GGGT3GAGG7 

501 TTTATTGAGT AGCGGTACTG ATACCAAGAT TGCCGCTTCT TCAGACAATC 

551 ATATTTATGC TTACCGTATC GATGCGACAA TACGCGGGGG AAATGTATGC 

601 GCAAACAGAA CACTTTGA 

This corresponds to the amino acid sequence <SEQ ID 958; ORF 247-1 .a>: 

a247-l .pep (partial) . . 

1 NNTAKLIPIA ESTDIKYPGF AQARPALIFQ YGIDDLDASA ETVVVSSCSK 
51 IAKPGKKIST LQEAKSALQI TNDDKQNGNI TRQRHVVNAY AVGRIAGEEG 
101 LFRFQLDDKG KWGNPQLLVK KIRHMKVRYI YVSDCPEDDD AGKEEKFKYT 
151 GTFDSSTNAV TPAGVEVLLS SGTDTKIAAS SDNHIYAYRI DATIRGGNVC 
201 ANRTIi* 



m247-l / a247-l 80.6% identity in 206 aa overlap 



10 20 30 

a247-l .pep NNTAKLI PI AESTDI KYPGFAQARPALI FQ 

I: lllllll l::|:l :| |: INN 
m247-l GFGCFNMSEHPATDVIPDTTQQNSPFSLKRNGIDKLIPIAESSNINYQNFFQVGSALIFQ 
80 90 100 110 120 130 



40 50 60 70 80 89 

a247-l.pep YGIDDLDASAETVVVSSCSKIAKPGKKISTLQEAKSALQITNDDK-QNGNrTRQRKWNA 

lllll::ll: IMIill: ;:llll:l lt::ll: 1:1 ::|| 11111:11111111 
m247-l YGI DDVNASTATT VVSSC AAI SKPGKQI PTLEDAKKELKI PDQDKEQNGN I ARQRK WNA 

140 150 160 170 180 190 



a247-l .pep 



90 100 110 120 130 140 149 
YAVGRIAGEEGLFRFQLDDKGKWGNPQLLVKKIRHMKVRYIYVSDCPEDDDAGKEEKFKY 
lllllll Mil I 1 I II I M f M I : I I I I M II lllllll III 
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m247-l YAVGRIADEEGLFRFQLDDKGKWGNPQLLVKKVRHMKVRYIYVSGCPEDDDAGKEETFKY 
200 210 220 230 240 250 

150 160 170 180 190 200 

a247-l . pep TGT FDS S TN AVTP AGVEVLL S S GTDTK I AA S S DNH I Y AY R I DAT I RGGN VC AN RTLX 
I III: I I I I I I I I I I t I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I : II I I I I 
m247-l TDKFDSAQNAVTPAGVEVLLSSGTDTKIAASSDNHIYAYRIDATIRGGNVCANRTLX" 
260 270 280 290 300 310 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 959>: 

g248 . seq 

1 atgcgcaaac agaacacttt gacaggaatc ccgacttctg acggacagag 

51 ggggtccgca ctgtttatcg tgctgatggt gatgatagtc gtggcctttt 

101 tggttgtaac tgccgcccag tcctacaata ccgaacagag gatcagtgcc 

151 aacgaatcag acaggaaatt ggctttgtct ttagccgagg cggctttgcg 

201 ggagggcgaa tttcaggttt tggatttgga atatgctgcg gacagtaagg 

251 ttacgtttag cgaaaactgt gaaaaaggtc tgtgtaccgc agtgaatgtg 

301 cggacaaata ataatggtag tgaagaggct tttggcaata tcgtggtgca 

351 aggcaagccc gccgttgagg cggtgaaacg ttcttgccct gcaaagtctg 

401 gcaaaaattc taccgacctg tgcattgaca ataaagggat ggaatataat 

451 aaaggcgcgg caggcgtcag caaaatgccg cgctatatta tcgaatattt 

501 aggcgtgaag aacggacaaa atgtttatcg ggttactgcc aaggcttggg 

551 gtaagaatgc caataccgtg gtcgtccttc aatcttatgt aggcaataat 

601 gatgagcaat aa 

This corresponds to the amino acid sequence <SEQ ID 960; ORF 248.ng>: 
g248 .pep 

1 MRKQNTLTGI PTSDGQRGS A LFIVLMVMIV VAFLW TAAQ SYNTEQRISA 
51 NESDRKLALS LAEAALREGE FQVLDLEYAA DSKVTFSENC EKGLCTAVNV 
101 RTNNNGSEEA FGN1WQGKP AVEAVKRSCP AKSGKNSTDL C I DNKGMEYN 
151 KGAAGVS KM P RYIIEYLGVK NGQNVYRVTA KAWGKNANTV WLQSYVGNN 
201 DEQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 96 1>: 

m248.seq (partial) 

1 . .GGGTTTGCAC TGTTAATCGT GCTGATGGTG ATrATCGTCG TGGCT . TywT 

51 gGwTGTAACT GCCGCGCAGT CTTACAATAC cGAGCAGCGk ATCAGTkCCA 

101 ACGAATCAGA CAGGAAATTG GCTwTGTCTT TGGCCGAGkC GkdVTGCGG 

151 GAAGGCGAAC TTCAGGTTTT GGAT TTGGAA TATGATACGG ACAGTAAGGT 

201 TACATTTAGC GAAAACTGTG GAAAAGGTCT GTsTGCCGCA GTGAATGTGC 

251 GGACAAATAA TGATAATGAA GAGGCTTTTG ACAATATCGT GGTGCAAGGC 

301 AAGCCCACCG TTGAGGCGGT GAAGCGTTCT TGCCCTGCAA ATTCTAC CGA 

351 CCTGTGCATT GACAAGAAAG GGwTGGAATA TAAGAAAGGC ACGAGAAGCG 

401 TCAC . AAAAT GCCACGTTAT ATTATCGAAT ATTTGGGCGT GwAGAACGGA 

451 GAAAATGTTT ATCGGGTTAC TGCCAAGGCT TGGGGtAAGA ATGCCAATAC 

501 CGTGGTCGTC CTTCAATCTT ATGTAAGCAA TAATGATGAG TAA 

This corresponds to the amino acid sequence <SEQ ID 962; ORF 248>: 

m248 .pep (partial) 

1 . . G FALLIVLMV XIWAFXXVT AAQSYNTEQR ISXNESDRKL AXSLAEXXXR 

51 EGELQVLDLE YDTDSKVTFS ENCGKGLXAA VNVRTNNDNE EAFDNIWQG 

101 KPTVEAVKRS CPANSTDLCI DKKGXEYKKG TRSVTKMPRY IIEYLGVXNG 

151 ENVYRVTAKA WGKNANTWV LQSYVSNNDE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 248 shows 81.1% identity over a 1 85 aa overlap with a predicted ORF (ORF 248.ng) 
from N. gonorrhoeae: 

m248/g248 

10 20 30 40 

ra248 .pep GFALL I VLMVX I WAFXXVTAAOS YNTEQR I SXNESDRKLAXS 

II hi II MM III MM III I II I III llllll II I 
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g248 



MRKQNTLTG I PTSDGQRGS ALFI VLMVM I VVAFLVVTAAQS YNTEQRI S ANESDRKLALS 
10 20 30 40 50 60 



50 60 70 80 90 100 

m248 . pep LAEXXXREGELQVLDLEYDTDSKVTFSENCGKGL^ 

III lllhlllllll :|IMIIIIII III :IHIIII|: =1111 IIIIIMI 
g248 IjAEAAIjREGEFQVLDLEYAADSKVTFSENCEKGLCTAVIT^ 

70 80 90 100 110 120 

110 120 130 140 150 

m248 .pep TVEAVKRSCPA NSTDLCI DKKGXEYKKGTRSVTKMPRYI I EYLGVXNGENVYRVTA 

*■ I I I I I I I ill I I I I I I I I • I I 1 I • I I • H • I II I I I I I II I I I I • I I I I I I I 

g248 AVEAVKRS CPAKSGKNSTDLCI DNKGME YNKGAAGVS KMPRY 1 1 E YLGVKNGQNVYRVTA 

130 140 150 160 170 180 

160 170 180 

m24 8 . pep KAWGKNANTVWLQS YVSNNDEX 

III llllllll II lllhl I II 

g248 KAWGKNANTVWLQSYVGNNDEQX 
190 200 



The following partial DNA sequence was 

a248 . seq 

1 ATGCGCAAAC AGAACACTTT GACGGGAATC 

51 GGGGTTTGCA CTGTTTATCG TGCTGATGGT 

101 TGGTTGTAAC TGCCGCGCAG TCTTACAATA 

151 AACGAATCAG ACAGGAAATT GGCTTTGTCT 

201 GGAAGGCGAA CTTCAGGTTT TGGATTTGGA 

251 TTACATTTAG CGAAAACTGT GGAAAAGGTC 

301 CGGACAAATA ATGATAATGA AGAGGCTTTT 

351 CAAGCCCACC GTTGAGGCGG TGAAGCGTTC 

401 GCCTGTGCAT TGACAATAAA GGGATGGAAT 

451 GTCAGCAAAA TGCCACGTTA TATTATCGAA 

501 AGAAAATGTT TATCGGGTTA CTGCCAAGGC 

551 CCGTGGTCGT CCTTCAATCT TAT GTAAGCA 



identified in N. meningitidis <SEQ ID 963>: 



CCGACTTCTG 
GATGATCGTC 
CCGAGCAGCG 
TTGGCCGAGG 
ATATGATACG 
TGTGTACCGC 
GACAATATCG 
TTGCACTGCA 
ATAAGAAAGG 
TATTTGGGCG 
TTGGGGTAAG 
ATAATGATGA 



ACGGACAGAG 
GTGGCTTTTT 
GATCAGTGCC 
CGGCTTTGCG 
GACAGTAAGG 
AGTGAATGTG 
TGGTGCAAGG 
AAATCTACAG 
CACGCAAAGC 
TGAAGAACGG 
AATGCCAATA 
GTAA 



This corresponds to the amino acid sequence <SEQ ID 964; ORF 248.a>: 

a248 .pep 

1 MRKQNTLTG I PTSDGQRGF A LFIVLMVMIV VAFLW TAAQ SYNTEQRISA 

51 NESDRKLALS LAEAALREGE LQVLDLEYDT DSKVTFSENC GKGLCTAVNV 

101 RTNNDNEEAF DNIVVQGKPT VEAVKRSCTA KSTGLCIDNK GMEYKKGTQS 

151 VSKMPRYIIE YLGVKNGENV YRVTAKAWGK NANTVWLQS YVSNNDE * 

m248/a248 89.4% identity in 1 80 aa overlap 

10 20 30 40 

m2 4 8 . pep GFALLIVLMVXIWAFXXVTAAQS YNTEQRI SXNE S DRKLAXS 

I I I I: I I i I I I I I I I I I I I I I I I I I I I I I I I M I ! I I I 
a248 MRKQNTLTGIPTS DGQRG FAL F I VLMVM I W AFLWT AAQ S YNT EQR I SANE S DRKLALS 

10 20 30 40 50 60 



m248 .pep 
a248 

m248 .pep 
a248 

m248 .pep 



50 60 70 80 90 100 

LAEXXXREGELQVLDLEYDTDSKVTFSENCGKGLXAAVNVRTNNDNEEAFDNIWQGKPT 
ill I I I I I I I I I I M I i I I I I I II I I I I I I I : I I I I I ! I I II I I I I ! I I I I I I I I I 
LAEAALREGELQVLDLEYDTDSKVTFSENCGKGLCTAVWVRTNNDNEEAFDNIVVQGKPT 
70 80 90 100 110 120 

110 120 130 140 150 160 

VEAVKRSC PANSTDLC I DKKGXE YKKGTRS VTKMPRY I IE YLGVXNGENVYRVTAKAWGK 
I I I I I I I I I : I I 1111:11 I I I I f I : I I : I I I I I III I I I I I I I I I I I I I I I I I II 
VEAVKRSCTAKSTGLCIDNKGMEYKKGTQSVSKNPRYIIEYLGVKNGENVYRVTAKAWGK 
130 140 150 160 170 180 

170 180 
NANTVWLQS YVSNNDEX 
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1 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 1 1 

a24 8 NANT WVLQS YVSNNDEX 

190 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 965>: 



m24fi-l . seq 

1 ATGCGCAAAC 



51 
101 
151 
201 
251 
301 
351 
401 



GGGGTTTGCA 
TGGTTGTAAC 
AACGAATCAG 
GGAAGGCGAA 
TT AC ATT TAG 
CGGACAAATA 
CAAGCCCACC 
ACCTGTGCAT 
451 GTCAGCAAAA 
501 AGAAAATGTT 
551 CCGTGGTCGT 



AGAACACTTT 
CTGTTTATCG 
TGCCGCGCAG 
ACAGGAAATT 
CTTCAGGTTT 
CGAAAACTGT 
ATGATAATGA 
GTTGAGGCGG 
TGACAAGAAA 
TGCCACGTTA 
TATCGGGTTA 
CCTTCAATCT 



GACGGGAATC 
TGCTGATGGT 
TCTTACAATA 
GGCTTTGTCT 
TGGATTTGGA 
GGAAAAGGTC 
AGAGGCTTTT 
TGAAGCGTTC 
GGGATGGAAT 
TATTATCGAA 
CTGCCAAGGC 
TATGTAAGCA 



CCGACTTCTG 
GATGATCGTC 
CCGAGCAGCG 
TTGGCCGAGG 
ATATGATACG 
TGTGTGCCGC 
GACAATATCG 
TTGCCCTGCA 
ATAAGAAAGG 
TATTTGGGCG 
TTGGGGTAAG 
ATAATGATGA 



ACGGACAGAG 
GTGGCTTTTT 
GATCAGTGCC 
CGGCTTTGCG 
GACAGTAAGG 
AGTGAATGTG 
TGGTGCAAGG 
AATTCTACCG 
CACGAGAAGC 
TGAAGAACGG 
AATGCCAATA 
GTAA 



This corresponds to the amino acid sequence <SEQ ID 966; ORF 248- 1>: 

m248-l .pep 

1 MRKQNTLTGI PTSDGQRG FA LriVLMVMIV VAFLVV TAAQ SYNTEQRISA 
51 KESDRKLALS LAEAALREGE LQVLDLEYDT DSKVTFSENC GKGLCAAVNV 
101 RTNNDNEEAF DNIVVQGKPT VEAVKRSCPA NSTDLCIDKK GMEYKKGTRS 
151 VSKMPRYIIE YLGVKNGENV YRVTAKAWGK NANTVWLQS YVSNNDE* 

m248-l/g248 89.1% identity in 202 aa overlap 

10 20 30 40 50 60 

m248-l . pep MRKQNTLTGIPTSDGQRGFALFIVLMVMIWAFLWTAAQSYNTEQRISANESDRKLALS 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 mmMiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiii 

g248 MRKQNTLTG I PT S DGQRGS ALFI VLMVMI VVAFLWT AAQS YNTEQRI S ANES DRKLALS 

10 20 30 40 50 60 

70 80 90 100 110 119 

m24 8- 1 . pep LAEAALREGELOVLDLEYDTDSKVTFSENCGKGLCAAVNVRTNND-NEEAFDNI WQGKP 
I I I I I I I I I I : I I I I I I I tllllllllll I II 1:1 llll Ml: :tltl lllllill 
g248 LAEAALREGEFQVLDLEYAADSKVTFSENCEKGLCTAVNVRTNNNGSEEAFGNIWOGKP 

70 80 90 100 110 120 

120 130 140 150 160 170 

m24 8-1 . pep TVEAVKRSCPA NSTDLCI DKKGMEYKKGTRS VSKMPRYI I EYLGVKNGENVYRVTA 

: I I I I I I I I I I I I I I I ! I I : I I I I i : I I : : I I I I I I I I I I I I I I I I I : I I I I I I I 
g248 AVEAVKRSCPAKSGKNSTDLCI DNKGMEYNKGAAGVSKMPRYI I EYLGVKNGQNVYRVTA 

130 140 150 160 170 180 



180 190 
m24 8-1 . pep KAWGKNANT WVLQS YVSNN DEX 

I I I I I I I I I I I I I I I II: I t : I 
g24 8 KAWGKHANT WVLQS YVGNNDEQX 

190 200 

m248-l/a248 97.0% identity in 197 aa overlap 

10 20 30 40 50 60 

m2 4 8-1 .pep MRKQN TLTG I PT SDGQRGFALFIVLMVM I WAFLWTAAQSYNTE2R1S ANES DRKLALS 
I I I t I I I I I I I I I I I I I I I I I I 1 ! I ! I I I 1 I I I I I I I I I I I I I II I 1 I I I I I I I I I I I I I 
a248 MRKQNTLTGI PT SDGQRGFALFIVLMVM I VVAFLWTAAQS YNTEQRI SAKESDRKLALS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m24 8- 1 . pep LAEAALREGELQVL D LEY DT DS KVT FS ENC GKGLC AA VN VRTNNDN EEAFDN I WQGK PT 
I I I I I I I I I I I I I I I I I I I I I I ! I I It I I I I I I I !: 1 I 1 I I I I I I I I I I I I I I I I I I I ! I 
a24 8 LAEAALREGELQVLDLEYDTDSKVTFSENCGKGLCTAVNVRTNNDNEEAFDNI WQGKPT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m24 8- 1 . pep VEAVKRSC PAN STDLCI DKKGMEYKKGT RS VS KMP RY 1 1 E YLGVKNGEK VY RVTAK AWGK 
( i I I I t 1 1 I : I I I I M : I i I I 1 1 I II : I I I I I I I I I I i I I I 1 I I I I I I . I I I I I I I I I 
a248 VEAVKRSCTAKSTGLCIDNKGMEYKKGTQSVSKMPRYIIEYLGVKNGENVYRVTAKAWGK 

130 140 150 160 170 180 
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190 

m248-l.pep NRNTVWLQSYVSNNDEX 
I I I I I ! I I I I I I II I I I I 
a248 N ANT WVLQS YVS NNDEX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 967>: 

g249 .seq 

1 atgaagaata atgattgctt gcgcctgaaa aatccccagt ccggtatggc 

51 gttgatagaa gtcttggtcg ctatgctcgt tctgaccatc ggtattttgg 

101 cattgctgtc cgtacagttg cggacagtcg cttccgtcag ggaggcggaa 

151 acgcaaacca tcgtcagcca aatcacgcaa aacctgatgg aaggaatgtt 

201 gatgaatccg accattgatt tggacagcaa caagaaaaac tatagtcttt 

251 acatgggaaa acagacacta tcagctgtgg atggtgagtt tafcgcttgat 

3 01 gccgagaaaa gtaaggcgca gttggcagag gaacaattga agagatttag 

3 51 trcatgagctg aaaaatgcct tgccggatgc ggtagctatt cactacgccg 
401 tctgcaagga ttcgtcgggt gacgcgccga cattgtccga cagcggtgct 
451 ttttcttcaa attgcgacaa taaggcaaac ggggatactt tgattaaagt 
501 attgtgggta aatgattcgg caggggattc ggatatttcc cgtacgaatc 
551 ttgaagtgag cggcgacaat atcgtatata cctatcaggc aagggtcgga 
601 ggtcgtgaat ga 

This corresponds to the amino acid sequence <SEQ ID 968; ORF 249.ng>: 

g249.pep 

1 MKNNDCLRLK NPQSGMALIE VLVAMLVLTI GILALLSV QL RTVASVREAE 

51 TQTIVSOITQ NLMEGMLMNP TIDLDSNKKN YSLYMGKQTL SAVDGEFMLD 

101 AEKSKAQLAE EQLKRFSHBL KNALPDAVAI HYAVCKDSSG DAPTLSDSGA 

151 FSSNCDNKAN GDTLIKVLWV NDSAGDSDIS RTNLEVSGDN IVYTYQARVG 

201 GRE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 969>: 

m24 9 .seq 

1 ATGAAGAATA ATGATTGCTT CCGCCTGAAA GATTCCCAGT CCGGTATGGC 

51 GCTGATAGAA GTCTTGGTTG CTATGCTCGT TCTGACCATC GGTATTTTGG 

101 CACTATTGTC TGTACAGTTG CGGACAGTCN NNNNNNNNNN NNNNNNNNNN 

151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNTTGATGG AGGGAATGTT 

201 GATGAATCCG ACCATTGATT CGGACAGCAA CAAGAAAAAC TATAATCTTT 

251 ACATGGGAAA CCATACACTA TCAGCTGTGG ATGGCGATTT TGCGATTGAT 

301 GCCATGAAAA CTAAGGGGCA ATTGGCAGAG GCACAATTGA AGAGATTTAG 

351 TTATGAGCTG AAAAATGCCT TGCCGGATGC GGCAGCCA7C CATTACGCCG 

4 01 TCTGCAAGGA TTCGTCGGGT AACGCGCCGA CATTGTCCGG CAATGCTTTT 
451 TCTTCAAATT GCGACAATAA GGCAAACGGG GATACTTTAA TTAAAGTATT 
501 GTGGGTAAAT GATTCGGCAG GGGATTCGGA TATTTCCCGT ACGAATCTTG 
551 AGGTGAGCGG CGACAATATC GTATATACTT ATCAGGCAAG GGTCGGAGGT 
601 CGGGAATGA 

This corresponds to the amino acid sequence <SEQ ID 970; ORF 249>: 

m24 9 -pep 

1 MKNNDCFRLK DSQSGMALIE V LVAMLVLTI GILALLSV QL RTVXXXXXXX 
51 XXXXXXXXXX XLMEGMLMNP TIDSDSNKKN YNLYMGNHTL SAVDGDFAID 
101 AMKTKGQLAE AQLKRFSYEL KNALPDAAAI HYAVCKDSSG NAPTLSGNAF 
151 SSNCDNKANG DTLIKVLWVN DSAGDSDISR TNLEVSGDNI VYTYQARVGG 
201 RE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 249 shows 81.3% identity over a 203 aa overlap with a predicted ORF (ORF 249.ng) 
from N. gonorrhoeae: 

m24 9/g249 

10 20 30 40 50 60 

m24 9 . pep MKNNDCFRLKDSQSGMAL I EVLVAMLVLTI GI LALLSVQLRTVXXXXXXXXXXXXXXX3CX 

lllllhllh IIIIIIIIMIIIIMIflllllllMIIII : : 
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g249 MKNNDCLRLKNPQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETOTIVSQITQ 
10 20 30 40 50 60 



70 80 90 100 110 120 

m249.pep XLMEGMLMNPTIDSDSNKKNYNLY^ 

IMIIIMIIII llllllhllll-llllllhl :|| hhllll MINIMI 
g24 9 ^^J1EGMIJMNPTIDLDSNKKNYSLYMGKQTLSAVDGEFMLDAEKSKAQIAEEQLKRFSHEL 
70 80 90 100 110 120 



130 140 150 160 170 179 

m24 9 . pep KNALPDAAAI H YAVCKDS SGNAPTLSGN - AFSSNCDNKANGDTL I KVLWVNDS AGDSDI S 

MNMIMIMMIMIIIMMM : lllimilllllllllllllllllMIIII 

g24 9 KNALPDAVA I H YAVCKDS SGDAPTLSDSGAFSSNCDNKANGDTL I KVLWVNDS AGDSDI S 

130 140 150 160 170 180 



180 190 200 

m2 4 9 . pep RTNLEVSGDNI VYTYQARVGGREX 

illinium minium ii 

g24 9 RTNLEVSGDNIVYTYQARVGGREX 
190 200 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 97 1>: 

a249.seq 

1 ATGAAGAATA ATGATTGCTT CCGCCTGAAA AACCCCCAGT CCGGTATGGC 

51 GCTGATAGAA GTCTTGGTCG CTATGCTCGT TCTGACCATC GGTATTTTGG 

101 CACTATTGTC TGTTCAGTTG CGGACAGTCG CTTCCGTCAG GGAGGCAGAG 

151 ACGCAAACCA TCGTCAGTCA AATCACGCAA AACCTGATSG AAGGAATGTT 

201 GATGAATCCG ACCATTGATT CGGACAGCAA CAAGAAAAAC TATAATCTTT 

251 ACATGGGAAA CCATCATGCA CTATCAGTTG TGGATGGCGA TTTTCAGGTT 

301 GATGCCATAA AAACTAAGAC GCAGTTGGCA GAGGCACAAT TGAAGAGATT 

351 T AG T TAT GAG CTGAAAAATG CCTTGCCGGA TGCGGCAGCC ATCCATTACG 

401 CCGTCTGCAA GGATTCGTCG GGTGTTGCGC CGACATTGTC CGCCGGCAGT 

451 ACTTTTTCTT CAAATTGCGA TGGTAGTGCA AATGGGGATA CTTTGATTAA 

501 AGTATTGTGG GTAAATGATT CGGCAGGGGA TTCGGATATC GCCCGTACGA 

551 ATCTTGAGAC GAACGGCAAC AATATCGTAT ATACCTATCA GGCAAGGGTC 

601 GGAGGTCGGG AATGA 



This corresponds to the amino acid sequence <SEQ ID 972; ORF 249.a>: 

a24 9. pep 

1 MKNNDCFRLK NPQSGMALIE V LVAMLVLTI GILALLSV QL RTVASVREAE 

51 TQTIVSQITQ NLMEGMLMNP TIDSDSNKKN YNLYMGNHHA LSVVDGDFQV 

101 DAIKTKTQLA EAQLKRFSYE LKNALPDAAA IHYAVCKDSS GVAPTLSAGS 

151 TFSSNCDGSA NGDTLIKVLW VNDSAGDSDI ARTNLETNGN NIVYTYQARV 

201 GGRE* 

m249/a249 8 1 .9% identity in 204 aa overlap 

10 20 30 40 50 60 

m24 9 . pep MKNNDCFRLKDSQSGMALIEVLVA^VLTIGIIAIASVQLRTVXXXXXXXXXXXXXXXXX 

lllinilll: I t I I I t I I I I I I I I I I f I t 1 1 t I I I J I I I I I : : : 

a24 9 MKNNDCFRLKNPQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITC 

10 20 30 40 50 60 



70 80 90 100 110 119 

m249.pep XLMEGMLMNPTIDSDSNKKNYNLYMGNH-TLSAVDGDFAIDAMKTKGQLAEAQLKRFSYE 
I II I I I I I I I I I I I I I I I I ! I I I I I I : I I: I I I I I : I I : I t I I I I I I I I I I I I I I 
a24 9 NLMEGMLMNPTZDSDSNKKNYNLYMGNHHALSWDGDFQVDAIKTKTQLAEAQLKRFSYE 

70 80 90 100 110 120 



120 130 140 150 160 170 

m249.pep LKNALPDAAAIHYAVCKDSSGNAPTLS-GNAFSSNCDNKANGDTLIKVLWVNDSAGDSDI 
I I I I I I I I 1 I II I M I I I I I I Mill I:: II I I I (:: I I I I I I I I I I I I I I I II I I I I 
a24 9 LKNALPDAAAIHYAVCKDSSGVAPTLSAGSTFSSNCDGSANGDTLIKVLWVNDSAGDSDI 
130 140 150 160 170 180 
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180 190 200 

m2 4 9 . pep S RTN LEV S GDN I V YT YQARVGGRE X 

: I I I I I :: I : I I I I I I I I I I I I I I I 
a2 4 9 ARTNLETNGNN I VYT YQARVGGREX 

190 200 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 973>: 

m249-l.aeq 

1 ATGAAGAATA ATGATTGCTT CCGCCTGAAA GATTCCCAGT CCGGTATGGC 

51 GCTGATAGAA GTCTTGGTTG CTATGCTCGT TCTGACCATC GGTATTTTGG 

101 CACTATTGTC TGTACAGTTG CGGACAGTCG CTTCCGTCAG GGAGGCGGAG 

151 ACACAAACCA TCGTCAGCCA AATCACGCAA AACCTGATGG AGGGAATGTT 

201 GATGAATCCG ACCATTGATT CGGACAGCAA CAAGAAAAAC TATAATCTTT 

251 ACATGGGAAA CCATACACTA TCAGCTGTGG ATGGCGATTT TGCGATTGAT 

301 GCCATGAAAA CTAAGGGGCA ATTGGCAGAG GCACAATTGA AGAGATTTAG 

351 TTATGAGCTG AAAAATGCCT TGCCGGATGC GGCAGCCATC CATTACGCCG 

401 TCTGCAAGGA TTCGTCGGGT AACGCGCCGA CATTGTCCGG CAATGCTTTT 

4 51 TCTTCAAATT GCGACAATAA GGCAAACGGG GATACTTTAA TTAAACTATT 

501 GTGGGTAAAT GATTCGGCAG GGGATTCGGA TATTTCCCGT ACGAATCTTG 

551 AGGTGAGCGG CGACAATATC GTATATACTT ATCAGGCAAG GGTCGGAGGT 

601 CGGGAATGA 

This corresponds to the amino acid sequence <SEQ ID 974; ORF 249- 1>: 

m249-l .pep 

1 MKNNDCFRLK DSQSGMALIE VLVAMLVLTI GILALLSV QL RTVASVREAE 
51 TQTIVSQITQ NLMEGMLMNP TIDSDSNKKN YNLYMGNHTL SAVDGDFAID 
101 AMKTKGQLAE AQLKRFSYEL KNALPDAAAI HYAVCKDSSG NAPTLSGNAF 
151 SSNCDNKANG DTLIKVLWVN DSAGDSDISR TNLEVSGDN1 VYTYQARVGG 
201 RE* 

m249-l/g249 90.1% identity in 203 aa overlap 

10 20 30 40 50 60 

MKNNDCFRLKDSQSGMALI EVLVAMLVLTIGI LALLSVQLRTVASVREAETQTIVSQI TQ 
III I I I : I I l: ! I I I ! I I I I 1 I I I I II I I I I I I I I I I ! II I I I I I I I I I I I I I I I I I I I 
M KNN DCLRLKN PQS GMAL I EVL VAML VLT I G I LALL S VQLRT VA S VRE AE TQT I VS QI TQ 
10 20 30 40 50 60 

70 80 90 100 110 120 

NLMEGMLMN PT I DS D S NKKN YNLYMGNHTL S A VDGD FAI D AMKT KGQL AEAQLKRFSY EL 
limillilll! II I IHI:| II MM lllll! :| :|| |:|:|||| MINIMI 

NLMEGMLMN PT I DLDSNKKNYSLYMGKQTLSAVDGEFMLDAEKSKAQLAEEQLKRFSHEL 
70 80 90 100 110 120 

130 140 150 160 170 179 

KNALPDAAAIHYAVCKDSSGNAPTLSGN-AFSSNCDNKANGDTLIKVLWVHDSAGDSDIS 
lllllll:illllll!lil|:imi : I I I I I I 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I 
KN AL P DA VA I H Y AVC K DS SG DAPTLS DS GA FS SN C DNKANGDT L IK VLWVNDS AGD S D I S 
130 140 150 160 170 180 

180 190 200 

m2 4 9- 1 . pep RTNLEVSGDNI VYT YQARVGGREX 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 n 1 1 1 1 1 1 

g249 RTNLEVSGDNI VYT YQARVGGREX 

190 200 

a249/ L36117 

gi 1 643582 (L36117) prepilin leader sequence requires cleavage to be active |Pseudomona3 
aeruginosa] 

>gi 11161222 (L48934) involved in type 4 firnbrial biogenesis; contains pre-pilin like leader 
sequence ( Pseudomonas aeruginosa] 

>gi I 1246299 (L76605) reference L36117, L48934 [Pseudomonas aeruginosa] Length = 185 
Score - 50.4 bits (118), Expect - 9e-06 

Identities - 45/183 (24%), Positives - 84/183 (45%), Gaps - 26/183 (14%) 

Query: 13 QSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQNLMEGMLMNPTI 72 

QSG ++IEVLVA+L+++IG+L ++++Q +T+ ++ + + + NL+E M- +P 
Sbjct: 12 QSGFSMIEVLVALLLISIGVLGMIAMQGKTIQYTADSVERNKAAMLGSNLLESMRASPKA 71 

Query: 73 D S D SNKKNYNL YMGNH H ALS VVDGDFQVDA I KTKTQLAEA QLKRFS YEL KN ALPD AA 129 



m24S-l .pep 
g249 

ra24S-l.pep 
g249 

m249-l .pep 
g249 



WO 99/57280 



PCT/US99/09346 



576 



D + M G A + T L +A +L ++ ++KK LP A 

Sbjct: 72 LYDVKDQ MATQSDFFKAKGSAFPTAPSSCTPLPDAIKDRLGCWAEQVKKELPGAG 126 

Query: 130 AI HYAVCKDSSGVAPTLSAGSTFSSNCDGSANGDTL-IKVLWVNDSAGDSDI ARTNL 185 

+ Y +C+ S +CDG G L I++ W + A +-+ 

Sbjct: 127 DLLKSDYYICRSSK PGDCDG — RGSMLEI RLAWRGKQGACVNAADSSA 172 

Query: 186 ETN 188 
+T+ 

Sbjct: 173 DTS 175 

m249-l/a249 90.7% identity in 204 aa overlap 

10 20 30 40 50 60 

m2 4 9-1 .pep MKNNDCFRLKDSQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQ 

minim: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 

a249 MKNNDCFRLKNPQSGMALIEVLVAMLVLTIGILALLSVQLRTVASVREAETQTIVSQITQ 
10 20 30 40 50 60 

70 80 90 100 110 119 

m2 4 9- 1 . pep NLMEGMLMNPTIDSDSNKKN YNLYMGNH-TLSAVDGDFAI DAMKTKGQLAEAQLKRFS YE 

I I I I I 1 1 1 1 I I I 1 1 I I I I I 1 1 1 1 1 1 I I I : I 1 : 1 1 1 1 I : I I : I I I I I { I I 1 1 1 I I I I I 

a249 NLMEGMLMN PT I DS DS NK KN YNLYMGN H HALS W DGD FQV DA I KT KTQL AEAQL K RFS YE 

70 80 90 100 110 120 

120 130 140 150 160 170 

m249-l . pep LKNALPDAAAIHYAVCKDSSGNAPTLS-GNAFSSNCDNKANGDTLIKVLWVNDSAGDSDI 
I I I M I I I I I I I II I I I I I I ( I I II I I : : I I I I I I :: I t I I I I I I I I I I I I I I I I I I I 
a249 LKNALPDAAAIHYAVCKDSSGVAPTLSAGSTFSSNCDGSANGDTLIKVLWVNDSAGDSDI 
130 140 150 160 170 180 

180 190 200 

m249-l .pep SRTNLEVSGDNIVYTYQARVGGREX 
:| II I |::|:| HIM III II IIM 
a249 ARTNLETNGNNIVYTYQARVGGREX 
190 200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 975>: 
g250 . seq 

1 atgacgcaca cagcctctcc acgtgatgaa ttcatacgcg gcataaaaga 

51 aagttcgccc atgctgattg ggcttttgcc ttgggcattg atactcggta 

101 tgcagggcgg gcaaaaaggt atgggccggc tggaaatgct gctgatgacg 

151 gggatgaact ttgccggcgg ctccgaattt gccacggtca acctgtgggc 

201 ggaacctctg ccgatactgc ttatcgccac cataaccttt atgattaatt 

251 cgcggcatat cctgatgggg ggcggcgctt gccacgcaca tgaaagaaat 

301 accgctgaaa aaagccgcgc ccgcgctgtt ttttatgtgt ga 

This corresponds to the amino acid sequence <SEQ ID 976; ORF 250.ng>: 

g250 . pep 

1 MTHTASPRDE FIRGIKE SSP MLIGLLPWAL ILGM QGGQKG MGRLEMLLMT 

51 GMNFAGGSEF ATVNLWAEPL PILLI ATITF MINSRHILMG GGACHAHERN 

101 TAEKSRARAV FYV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 977>: 
m250. seq 

1 ATGCACACCT TCCCCGCATA ACGAATTTAT ACGCGGCATC AAAGAAAGTT 

51 CGCCTATGCT GATTGGGCTG CTGCCTTGGG CATTAATACT CGGTATGCAG 

101 GGCGGACAAA AAGGCATGAG CTGGCTGGAA ATGTTGTTGA TGACCAGTAT 

151 GAACTTCGCC GGCGGCTCCG AGTTTGCCAC GGTCAACCTG TGGGCsGAAC 

201 CTCTGCCGAT ACTGCTTATC GCCACCGTAA CCTTTATGAT TAATTCTCGG 

251 CATATCCTGA T.GGGGGCGG CGCTTGCCCC GCACCTGAAA GGAaTACCGC 

301 TGAAAAAAGC CGTGCCCGCA CTGTTTTTTA TGTGTGA 

This corresponds to the amino acid sequence <SEQ ID 978; ORF 250>: 

m250.pep 

1 MHTPSPHNEF IRGIK ESSPM LIGLLPWALI LGM QGGQK3M SKLEMIiLMTS 

51 MNFAGGSEFA TVNLWAEPLP ILLI ATVTFM INSRHILMGG GA CPAPERNT 

101 AEKSRARTVF YV* 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 250 shows 91.0% identity over a 1 1 1 aa overlap with a predicted ORF (ORF 250.ng) 
from N. gonorrhoeae: 

m250/g250 

10 20 30 40 50 59 

m250.pep MHTPS PHNEFIRGI KESS PMLIGLLPWALI IX3MCX3GQKGMS WLEMLLMTSMNFAGGSEF 

II l|::|llllllimillMlllilllli|lllll|: I I I I I I I = I I I I I I I I I 
g250 MTHT AS PRDE F I RG I KES S PML I GLL PW AL I LGMQGGQKGMGRLEMLLMTGMN FAGGS E F 

10 20 30 40 50 60 

60 70 80 90 100 110 

m2 50 . pep ATVNLW AE PLP I LLI ATVT FMI NSRH I LMGGGACP A PERNTAEKS RARTVF YVX 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ri : 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 i ii ii him hi in 

g250 ATVNLWAEPLPI LLI ATI TFMI NSRH I LMGGGACHAHERNTAEKS RARAVFYV 

70 80 90 100 110 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 979>: 
a250.seq 

1 AT G AC AC AC A TAAGCTCGCC CCGTAACGAA TTTATACGCG GCATCAAAGA 

51 AAGTTCGCCC ATGCTGATCG GGCTTTTGCC TTGGGCATTA ATACTCGGTA 

101 TGCAGGGTGG ACAAAAAGGC ATGAGCTGGC TGGAAATGTT GTTGATGACC 

151 GGTATGAACT TCGCCGGCGG CTCCGAGTTT GCCACGGTCA ACCTGTGGGC 

201 GGAACCTCTG CCGATACTGC TTATCGCCAC CGTAACCTTT ATGATTAATT 

251 CTCGGCATAT CCTGATGGGG G . CGGCACTT GCCCCGCACC TGAAAGAAAT 

301 ACCGCTGAAA AAAGCCGTGC CCGCACTGTT TTTTATGTGT GA 

This corresponds to the amino acid sequence <SEQ ID 980; ORF 250.a>: 

a250.pep 

1 MTHISSPRNE FIRGIKE SSP MLIGLLPWAL ILGM QGGQKG MSWLEMLLMT 

51 GMNFAGGSEF ATVN LWAEPL PILLIATVTF MI NSRHILMG XGTCPAPERN 

101 TAEKSRARTV FYV* 

m250/a250 94.6% identity in 1 1 1 aa overlap 

10 20 30 40 50 

59 

m250.pep 

MHTPS PHNEFIRGI KESSPMLIGLLPWALILGMQGGQKGMSWLEMLLMTSMNFAGGSEF 
I 

f t : I I I I I I t I I I I I I I I I I 1 I I I I It I I I I I I I 1 I I I I I I I I I I :| I I t ! I I 1 I 

a250 

MTHISSPRNEFI RGIKESSPMLIGLLPWALILGMQGGQKGMSWLEMLLMTGMNFAGGSEF 

10 20 30 40 50 

60 

60 70 80 90 100 110 

m250.pep ATVNLWAEPLPILLIATVTFMINSRHILMGGGACPAPERNTAEKSRARTVFYVX 
I I I I I I I I I I I I M I 1 I I I I ! I I I I I I I I I I :l I I I S I I I I I I 1 I I I I I I I I I 
a250 ATVNLWAEPLPILLIATVTFMINSRHILMGXGTCPAPERNTAEKSRARTVFYVX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 981>: 

g251 . seq 

1 acgcctgacc caatagggat tcttttcgct gccgtcgggg ttgatttttt 

51 tgccgttgtt ttgagggggc gttttcaacg aataggcgcg gttggcatgt 

101 tgataataat aatcctgatg gcggaggtcg gaaccaaaac ggtcgtaacc 
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151 gaggttgacg ctcaggttgt ggcggatttt ggcggtatcg aaggattttt 

201 tgaatgccgc ctgcaagagc ctgtggcttt ccccgtaaat cacgcggtcg 

251 gatttgtagt aggaagacgg cttgtcggca ctcgggcggc aatatttgtc 

30i cgaaccgtcg gcggaacagt gcgtctgctg aaaatgattg tccaaaccga 

351 tgccctgccg gtcgtaagag aggcgggcat aatccgccca agtgtcttta 

4 01 tcggcattgg tatagacata tzccaaaccg tagcggcttt tggtgtgcgt 

4 51 ctcgtcgtaa aacacgcccg taccgtattc cgcgcccacc tccgcaccgt 

501 tttcaccgtt ggtaatcagc ccgctgtatt tgcggccgcc cgcgtatttg 

551 ccgtagcctc ttatcgatcc gtatttttta ttttcatcaa aaaccgcctt 

601 ggtcaggaat gccggaaccg tcatatcgcg cgtgtcgaaa gcttgctgcg 

651 tgcgttcgag tatgccgccg atgtagtgcc gtttgttttc aaaacgaaaa 

701 cccgggcgga acagccacga ccggctttcg tatga 

This corresponds to the amino acid sequence <SEQ ID 982; ORF 25 1 .ng>: 

g251 .pep 

1 MPDPIGILFA AVGVDFFAW LRGRFQRIGA VGMLIIIILM A EVGTKTWT 

51 EVDAQWADF GGIEGFFECR LQEPVAFPVN HAVGFWGRR LVGTRAAIFV 

101 RTVGGTVRLL KMIVQTDALP WREAGIIRP S VFIGIGIDI FQTVAAFG VR 

151 L»WKHARTVF RAHLRTVFTV GNQPAVFAAA RVFAVASYRS V7FI FIKNRL 

201 GQECRNRHIA RVESLLRAFE YAADWPFVF KTKTRAEQPR PAFV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 983>: 

ra251 . seq 

1 ATGCGTGCTG CGGTAGTCGT AGCGGAAGCC CGCGCCGACA TCCGCCCACC 

51 TGCCCAAACG GACATTGTCC CGAACTGCCG CGTAATAGCT TTTACCGTTG 

101 ATGCTGCGCG GCGTGCAGTC CGTATAAGTA TTGTTGCCCA AGCGGCAGAT 

151 TTGCCCCGTA ACGACATTTC CCCTGCCTAT GGTGACCCAA TAGGGGCTGG 

201 TTTCACTGCC GTTGGGGCTG ATTTTTTTGC CGT TGTTTTG AGGGGGCGTG 

251 TTCGACGAAT AGGCGCGGTT GGCATGTTGA TAATAATAAT CCTGATGGCG 

301 GAGATTAGAG CCAAAGCGGT CAAACCCGAG ATTCACGCTC AGGTTGTGGC 

351 GGATTTTGGC GGTATCGAAG GATTTTTTGA ATGCCGCCTG CAAGAGCCTG 

4 01 TGGCTTTCCC CGTAAATCAC GCGATCGGAT TTGTAATAGG AAAACGGCTT 

451 GTCGGCACTC GGGCGGCAAT ATTTGTCCGA ACCGTCGGCA GAACAGTGCG 

501 TCTGCTGAAA ATGATTATCC AAACCGATGC CCTGCCGGTC GTAAGAGAGG 

551 CGGGCATAAT CCGCCCAAGT GTCTTTATCG GCATTGGTAT AGACATATTC 

601 CAAAC CGTAG CGGCTTTTGG TGTGCGTCTC GTCGTAAAAC ACGCCCGTAC 

651 CGTATTCCGC GCCCACCAGC GCACCGTTTT CGCCGTTGGT AAACAGTCCG 

701 CCGTATTTGT GGTTGCCCGC GTATTTGCCG TTACCGGGCA AAGAACCCGC 

751 CTGTTTTTTA TTTGCATCAA AAACCGCCTT GGTCAGGAAT GCCGGAACCG 

801 TCATATCGCG CGTGTCGAAA GTTTGTTGCG TGTGTTCGAG TATGCCGCCG 

851 ATGTAGTGCC GCTTATTCTC AAAACGAAAA CCCGGGCGGA ACAGCCACGA 

901 CCGGCTTTCG TATGA 

This corresponds to the amino acid sequence <SEQ ID 984; ORF 25 1>: 

m251 .pep 

1 MRAAVWAQA RADIRPPAQT DIVPNCRVIA FTVDAARRAV RISIVAQAAD 

51 LPRNDISPAY GD P I GAG FT A VGADFFAWL RG RVRRIGAV GMLIIIILMA 

101 EIRAKAVKPE IHAQWADFG GIEGFFECRL QEPVAFPVNH AIGFVIGKRL 

151 VGTRAAI FVR TVGRTVRLLK MIIQTDALPV VREAGIIRPS VFIGIGIDIF 

201 QTVAAFGVRL WKHARTVFR AHQRTVFAVG KQSAVFWAR V FAVTGQRTR 

251 LFFICIKNRL GQECRNRHIA RVESLLRVFE YAADWPLIL KTKTRAEQPR 

301 PAFV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 251 shows 85.2% identity over a 243 aa overlap with a predicted ORF (ORF 25Lng) 
from N. gonorrhoeae: 

m251/g251 



40 50 60 70 80 90 

m251 . pep TVDAARRAVR I S I VA0AADL PRND I S PA YGDP I GAGFTAVGAD FF A WLRGRVRR I GAVG 

llll I :|l Mill II I I II =111111 
9251 MPDPIGI LFAAVGVDFFAWLRGRFQRIGAVG 

10 20 30 
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100 110 120 130 140 150 

m251.pep MLIIIILMAEIRAKAVKPErHAOWADFGGIEGFFECRLOEPVAFPVNHAIGFVIGKRLV 

I I I I I I I I I I ' '\'\ h I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I h I I I I 
g251 ML I IIILMAEVGTKTWTEVDAQWADFGGIEGFFECRLQEPVAFPVNHAVGFWGRRLV 

40 50 60 70 80 90 



160 170 180 190 200 210 

m2 51 . pep GTRAAI FVRTVGRTVRLLKM 1 1 QTDALP WREAGI IRPSVFIGIGIDI FQTVAAFGVRLV 

illinium iiiiiiihiiiiiiiiiiimmiiiiMiiiiiiniiimi 

g2 5 1 GTRAAI FVRTVGGTVRLLKM I VQTDALP WREAGI IRPSVFI GIG IDI FQTVAAFGVRLV 

100 110 , 120 130 140 150 



220 230 240 250 260 270 

m2 51 . pep VKHARTVFRAHQRTVFAVGKQSAVFWARVFAVTGQRTRLFFICI KNRLGQECRNRHI AR 

IIIIIMIIII Mlhlhl I I I r r | 1 | | | | r , |: :||| | | | | t ( I I I I I M I M 
g251 VKHARTVFRAHLRTVFTVGNQPAVFAAARVFAVAS YRS - VF F I F I KNRLGQECRNRHI AR 

160 170 180 190 200 210 



280 290 300 

m251 .pep VESLLRVFEYAADWPLILKTKTRAEQPRPAFVX 

I I I I I I : I I I I I I I I h : = I I I I I I I I I I I I I I I 

g251 VESLLRAFEYAADWPFVFKTKTRAEQPRPAFVX 
220 230 240 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 985>: 

a251.seq 

1 ATGCGTGCTG CGGTAGTCGT AGCGCAACCC CGCGCCGACA TCCGCCCACC 

51 TGCCCAAACG GACATTGTCC CGAACTGCCG CGTAATAGCT TTTGCCGTTG 

101 ATGCTGCGCG GCGTGCAGTC CGTATAAGTA TTGTTGCCCA AGCGGCAGAT 

151 TTGCCCCGTA ACCACATTTC CCCTGCCTAT GCTGACCCAA TAGGGTTGGT 

201 CCTTGCCGCC GTTGGGGTTG GCGGTTTTAG GGGGCGTTTT CGACGAATAG 

251 GCGCGGTTGG CATGTTGATA ATAATAATCC TGATGGCGGA GATTAGAGTC 

301 AAAGCGGTCA AAACCGAGAT TCACGCTCAG GTTGTGGCGG ATTTTGGCGG 

351 TATCGAAGGA TTTTTTGAAT GCCGCCTGCA AGAGCCTGTG GCTTTCCCCG 

4 01 TAAATCACGC GGTCGGATTT GTAGTAGGAA AACGGCTTGT CGGCACTCGG 

451 GCGGCAATAT TTGTCCGAAC CGTCGGCAGA ACAGTGCGTC TGCTGAAAAT 

501 GATTGTCCAA ACCGATGCCC TGCCGGTCGT AAGAGAGGCG GGCATAATCC 

551 ACCCAAGTGT CTTTATCGGC ATTGGTATAG ACATATTCCA AACCGTAGCG 

601 GCTTTTGGTG TGCGTCTCGT CGTAAAACAC GCCCGTACCG TATTCCGCGC 

651 CCACCAGCGC ACCGTTTTCG CCGTTGGTAA ACAGACCGCC GTATTTGTGG 

701 TCGCCCGCGT ATTTGCCGTT GCCTCTTATC GGTCCGTATT TTCTATTTTC 

751 ATCAAAAACC GCCTTGGTCA GGAATGCCGG AACCGTCATA TCGCGCGTGT 

801 CGAAAGTTTG TTGCGTGTGT TCGAGTATGC CGCCGATGTA GTGCCGTTTG 

851 TTTTCAAAAC GAAAACCCGG GCGGAACAGC CACGATCGGC TTTCGTATGA 

This corresponds to the amino acid sequence <SEQ ID 986; ORF 251.a>: 

a251.pep 

1 MRAAVWAQP RADIRPPAQT DIVPNCRVIA FAVDAARRAV RISIVAQAAD 

51 LPRNHISPAY ADPIGLVLAA VGVGGFRGRF R RIGAVGMLI IIILMAEI RV 

101 KAVKTEIHAQ WADFGGIEG FFECRLQEPV AFPVNHAVGF WGKRLVGTR 

151 AAIFVRTVGR TVRLLKMIVQ TDALPWREA GIIHPS VFIG IGIDIFQTVA 

201 AFGVRLWKH ARTVFRAHQR TVFAVGKQTA VFWARVFAV ASYRSVF SIF 

251 IKNRLGQECR NRHIARVESL LRVFEYAADV VPFVFKTKTR AEQPRSAFV* 

m25 l/a25 1 88.5% identity in 304 aa overlap 

10 20 30 40 50 60 

m251 . pep M RAA VWAQARAD I R P P AQT D I VPN CR V I AFTVDAARRAVR I S I VAQAAD L PRN CIS PAY 

I I I I I I I I I I I I I ! I I II I I I I I I I 1 I I I I : I I I t I I I I I I j I I I I I I I I I I I 1 I I I I 
a251 MRAAVWAQ PRADI R P P AQT D I V PN CR V I AFAVD AAR RAVR I S I V AQAA D L PRN H I S PAY 

10 20 30 40 50 60 



70 80 90 100 110 120 

ra2 51 . pep GDPIGAGFTAVGADFFAWLRGRVRRIGAVGMLI 1 1 1 LMAE I RAKAVK PE I HAQ WADFG 

s I II I : : I I I : I III I II I I I II II I I II I I I I I : I I I I I I I I I I I I I I I 
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a251 ADPIGLVLAAVGVGGF RGR FRR IGAVGML III I LMAEI RVKAVKTE I HAQWADFG 

70 80 90 ICO 110 

130 140 150 160 170 180 

m251 . pep GIEGFFECRLQEPVAFPV^HAIGFVIGKRLVGTRAAIFVRTVGRTVRLLKMIIQTDALPV 
I M I I I I 1 1 I I I I I I I I I I I I : | | I : | | | | | | | | | | | | | | | | | | | | | | | | | | : | | | | | | | 
a251 GIEGFFECRLQEPVAFPVNUAVGFWGKRLVGTRAAIFVRTVGRTVRLLKMIVQTDALPV 
120 130 140 150 160 170 

190 200 210 220 230 240 

m251 . pep VREAGI IRPSVFIGIGI DI FQTVAAFGVRLVVKHARTVFRAHCRTVFAVGKQSAVFVVAR 

M I 1 1 1 I : t I t I I I I I M I I I i I I I 1 1 M 1 1 1 1 I 1 1 I 1 1 I I M I I I I I 1 1 1 1 : i 1 1 I I I I 
a2 5 1 VREAGI 1HPSVFIG IG I DI FQTVAAFGVRLWKHARTVFRAHQRTVFAVGKQTAVFWAR 

180 190 200 210 220 230 

250 260 270 2B0 290 300 

m251 . pep VFAVTGQRTRLFFICIKNRLGQECRNRHIARVESLLRVFEYAADWPLILKTKTRAEQPR 
UN- h :| I I I I I i I I I I I I I I I I M I I I I j I Mi I | | I I | ::: 1 | [ M I i I M 
a251 VFAVASYRS-VFSIFII^NRLGQECRNRHIARVESLLRVFEYAADWPFVFKTKTRAEQPR 
240 250 260 270 280 290 

m251.pep PAFVX 
II I I 

a251 SAFVX 
300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 987>: 

g253 . seq 

1 atgatcgaca gggaccgtat gttgcgggac acgttggaac gtgtgcgtgc 

51 ggggtcgttc tggttatggg tggtggtggc atcgatgatg tttaccgccg 

101 gatttccagg cacttatctt ctgatggaca atcaggggct gaatttcttt 

151 ttagttttgg cgggagtgtt gggcatgaat acgctgatgc tggcagtatg 

201 gttggcaacg ttgttcctgc gcgtgaaagt gggacggttt ttcagcagtc 

251 cggcgacgtg gtttcggggc aaaggccctg taaatcaggc ggtgttgcgg 

301 ctgtatgcgg accsgtggcg gcaaccttcg gtacgatgga aaataggcgc 

351 aacggcgcac agcttgtggc tctgcacgct gctcggaatg ctggtgtcgg 

401 tattgctgct gcttttggtg cggcaatata cgttcaactg ggaaagcacg 

451 ctgttgagca atgccgcttc ggtacgcgcg gtggaaatgt tggcatggct 

501 gccgtcgaaa ctcggtttcc ctgtccccga tgcgcgggcg gtcatcgaag 

551 gtcgtctgaa cggcaatatt gccgatgcgc gggcttggtc ggggctgctg 

601 gtcggcagta tcgtctgcta cggcatcctg ccgcgcctct tggcttgggt 

651 agtgcgtaaa atccttttga aaacaagcga aaacggattg gatttggaaa 

701 aaacctatta tcaggcggtc atccgccgct ggcagaacaa aatcaccgat 

751 gcggatacgc gtcgggaaac cgtgtccgcc gtttcgccga aaatcgtctt 

801 gaacgatgcg ccgaaatggg cgctcatgct ggagaccgag tggcaggacg 

851 gccaatggtt cgagggcagg ctggcgcagg aatggctgga taagggcgtt 

901 gccgccaatc gggaacaggt tgccgcgctg gagacagagc tgaagcagaa 

951 accggcgcaa ctgcttatcg gcgtacgcgc ccaaactgtg ccggaccggg 

1001 gcgtgctgcg gcagattgtg cggctttcgg aagcggcgca gggcggcgcg 

1051 gtggtgcagc ttttggcgga acaggggctt tcagacgacc tttcggaaaa 

1101 gctggaacat tggcgtaacg cgctgaccga atgcggcgcg gcgtggcttg 

1151 agcctgacag ggtggcgcag gaaggccgtt tgaaagacca ataa 

This corresponds to the amino acid sequence <SEQ ID 988; ORF 253.ng>: 

g253.pep 

1 MIDRDRMLRD TLERVRAGS F WLWVWASMM FTAGFS GTYL LMDNQGLNFF 

51 LVLAGVLGMN TLMLAVW LAT LFLRVKVGRF FSSPATWFRG KGPVNQAVLR 

101 LYADQWRQPS VRWKIGATAH SLW LCTLLGM LVSVLLLLLV RQYTFNWEST 

151 LLSNAASVRA VEMLAWLPSK LGFPVPDARA VIEGRLNGNI ADARAWSGLL 

201 VGSIVCYGIL PRLLA WWCK ILLKTSENGL, DLEKTYYQAV IRRWQNKITD 

251 ADTRRETVSA VSPKIVLNDA PKWALMLETE WQDGQWFEGR LAQEWLDKGV 
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301 AANREQVAAL ETELKQKPAQ LLIGVRAQTV PDRGVLRQIV RLSEAAQGGA 
351 WQLLAEQGL SEDLSEKLEH WRNALTEQ3A AWLEPDRVAQ EGRLKDQ* 

The following partial DNA sequence was identified inN. meningitidis <SEQ ID 989>: 
m253 . seq 

1 ATGATTGACA GGAACCGTAT GCTGCGGGAG ACGTTGGAAC GTGTGCGTGC 

51 GGGGTCGTTC TGGTTGTGGG TGGTGGCGGC GACGTTTGCA TTTTTTACCG 

101 GTTTTTCAGT CACTTATCTT CTAATGGACA ATCAGGGTCT GAATTTCTTT 

151 TTGGTTTTGG CGGGCGTGTT GGGCATGAAT ACGCTGATGC TGGCAGTATG 

201 GTTGGCAATG TTGTTCCTGC GTGTGAAAGT GGGGCGTTTT TTCAGCAGTC 

251 CGGCGACGTG GTTTCGGGGC AAAGACCCTG TAAATCAGGC GGTGTTGCGG 

301 CTGTATGCGG ACGAGTGGCG GCAACCTTCG GTACGTTGGA AAATAGGCGC 

351 AACGTCGCAC AGCCTGTGGC TCTGCACGCT GCTCGGAATG CTGGTGTCGG 

401 TATTGTTGCT GCTTTTGGTG CGGCAATATA CGTTCAACTG GGAAAGCACG 

451 CTGTTGAGCA ATGCCGCTTC GGTACGCGCG GTGGAAATGT TGGCATGGCT 

501 GCCGTCGAAA CTCGGTTTCC CTGTCCCCGA TGCGCGGGCG GTCATCGAAG 

551 GCCGTCTGAA CGGCAATATT GCCGATGCGC GGGCTTGGTC GGGG CTGCTG 

601 GTCGGCAGTA TCGCCTGCTA CGGCATCCTG CCGCGCCTGC TGGCTTGGGT 

651 AGTGTGTAAA ATCCTTTTGA AAACAAGCGA AAACGGATTG GATTTGGAAA 

701 AG CCCTATTA TCAGGCGGTC ATCCGCCGCT GG C AG AACAA AATCACCGAT 

751 GCGGATACGC GTCGGGAAAC CGTGTCCGCC GTTTCACCGA AAATCATCTT 

801 GAACGATGCG CCGAAATGGG CGGTCATGCT GGAGACCGAG TGGCAGGACG 

851 GCGAATGGTT CGAGGGCAGG CTGGCGCAGG AATGGCTGGA TAAGGGCGTT 

901 GCCACCAATC GGGAACAGGT TGCCGCGCTG GAGACAGAGC TGAAGCAGAA 

951 ACCGGCGCAA CTGCTTATCG GCGTGCGCGC CCAAACTGTG CCGGACCGCG 

1001 GCGTGTTGCG GCAGATTGTC CGACTCTCGG AAGCGG CGCA GGGCGGCGCG 

1051 GTGGTGCAGC TTTTGGCGGA ACAGGGGCTT TCAGACGACC TTTCGGAAAA 

1101 GCTGGAACAT TGG CGTAACG CGCTGGCCGA ATGCGGCGCG GCGTGGCTTG 

1151 AGCCTGACAG GGCGGCGCAG GAAGGGCGTT TGAAAGACCA ATAA 

This corresponds to the amino acid sequence <SEQ ID 990; ORF 253>: 

m253 .pep 

1 MIDRNRMLRE TLERVRAGS F WLWWAATFA FFTGFSV TYL LMDNQGLNFF 

51 LVLAGVLGMN TLMLAVW LAM LFLRVKVGRF FSSPATWFRG KDPVNQAVLR 

101 LYADEWRQPS VRWKIGATSH SLW LCTLLGM LVSVLLLLLV RQYTFNWEST 

151 LLSNAASVRA VEMLAWLPSK LGFPVPDARA VIEGRLNGNI ADARAWSGLL 

201 VGSIACYGIL PRLLA WWCK ILLKTSENGL DLEKPYYQAV IRRWQNKITD 

251 ADTRRETVSA VSPKIILNDA PKWAVMLETE WQDGEWFEGR LAQEWLDKGV 

301 ATNREQVAAL ETELKQKPAQ LLIGVRAQTV PDRGVLRQIV RLSEAAQGGA 

351 WQLLAEQGL SEDLSEKLEH WRNALAECGA AWLEPDRAAQ EGRLKDQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 253 shows 94.7% identity over a 397 aa overlap with a predicted ORF (ORF 253.ng) 
from N. gonorrhoeae: 

m253/g253 

10 20 30 40 50 60 

m2 53 . pep MI DRNRMLRETLERVRAGS F^^VAATAATFAFFTGFS VTYLLMDNQGLNFFLVLAGVLGMN 

liihllihllllllllllllllhl:: I Mil lllllllllimilllllllll 
g253 MIDRDRMLRDTLERVRAGSFWLWVWASMMFTAGFSGTYLLMDNQGLNFFLVLAGVLGMN 
10 20 30 40 50 60 



70 80 90 100 110 120 

m2 5 3 . pep TLMLAVWLAKLFLRVKVGRFFSSPATWFRGKDPVNQAVLRLYADEWRQPSVRWKI GATSH 

I II I II III II II 111 MINI II III III II III I II lllh Mill Ml II I I hi 
g253 TLMLAVWLATLFLRVKVGRF FS S PATWFRGKG P VNQ AVLRL YADQWRQ P S VRWKI GAT AH 

70 80 90 100 110 120 



130 140 150 160 170 180 

m2 53 . pep SLWLCTLLGKLVSVLLLLLVRQYTFNWESTLLSNAAS\^VEMIJVWLPSKLGFPVPDARA 

I Mill II IN III I II MINIM II! II MIMII III Mill lllll III III i K I 

g2 53 SLWl^TLLGKLVSVLLLLLVRQYTFNWESTLLSNAASVRAVEMLAWLPSKLGFPVPDARA 
130 140 150 160 170 180 
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190 200 210 220 230 240 

VI EGRLNGNI ADARAWSGLLVGS I ACYGI LPRLLAWWCKI LLKTSENGLDLEKP YYQAV 

II II llllll M III III till Ihl MINI (II II III Ml I III II II Ml Illll 

VIEGRLNGNI ADARAWSGLLVGS IVCYGI LPRLLAWWCKI LLKTSENGLDLEKTYYQAV 
190 200 210 220 230 240 

250 260 270 280 290 300 

IRRWQNKITDADTRRETVSAVSPKIILNDAPKWAVMLETEWQDGEWFEGRLAQEWLDKG 1 / 

llllllllllllllllll IMIM hlllllllhllllllllhlllMIIII III III 

IPJWQNKITDADTRRETVSAVSPKIVLNDAPKJJALNTLE^ 

250 260 270 200 290 300 

310 320 330 340 350 360 

ATbn^OVAALETELKOKPAQLLIGVRAOWPDRGVMOIVRLSEAAQGGAVVQLLAEQGI, 

MINIMI II III III III III M III III IMIIIIIIII II! Mill II I II I III 

AANREQVAALETELKQKPAQLLIGVRAQTVPDRGVLROIVRLSEAAQGGAWQLLAEQGL 
310 320 330 340 350 360 

370 380 390 

S DDLS E KLEHWRNALAECGAAWLE PDRAAQEGRL KDQX 

I III I II III I IIIMI II I Mill hi I II INI 1 1 

SDDLSEKLEHWRNALTECGAAWLEPDRVAQEGRLKDQX 
370 380 390 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 99 1>: 

a253. seq 

1 ATGATCGACA GGAACCGTAT GCTGCGGGAG ACGTTGGAAC GTGTGCGTGC 

51 GGGGTCGTTC TGGTTGTGGG TGGCGGCGGC GACGTTTGCG TTTTTTACCG 

101 GTTTTTCAGT TACTTATCTT CTAATGGACA ATCAGGGTCT GAATTTCTTT 

151 TTGGTTTTGG CGGGCGTC-TT GGGCATGAAT ACGCTGATGC TGGCAGTATG 

201 GTTGGCAATG TTGTTCCTGC GCGTGAAAGT GGGGCGTTTT TTCAGCAGTC 

251 CGGCGACGTG GTTTCGGGGC AAAGACCCTG TCAATCAGGC GGTGTTGCGG 

301 CTGTATGCGG ACGAGTGGCG GCAACCTTCG GTACGTTGGA AAATAGGCGC 

351 AACGTCGCAC AGC3TGTGGC TCTGCACGCT GCTCGGAATG CTGGTGTCGG 

401 TATTGTTGCT GCTTTTGGTG CGGCAATATA CGTTCAACTG GGAAAGCACG 

451 CTGTTGGGCG ATTCGTCTTC GGTACGGCTG GTGGAAATGT TGGCATGGCT 

501 GCCTGCGAAA CTGGGTTTTC CCGTGCCTGA TGCGCGGGCG GTCATCGAAG 

551 GTCGTCTGAA CGGCAATATT GCCGATGCGC GGGCTTGGTC GGGGCTGCTG 

601 GTCGGCAGTA TCGCCTGCTA CGGCATCCTG CCGCGCCTCT TGGCTTGGGC 

651 GGTATGCAAA ATCCTTTTGA AAACAAGCGA AAACGGCTTG GATTTGGAAA 

701 AGCCCTATTA TCAGGCGGTC ATCCGCCGCT GGCAGAACAA AATCACCGAT 

751 GCGGATACGC GTCGGGAAAC CGTGTCCGCC GTTTCGCCGA AAATCGTCTT 

801 GAACGATGCG CCGAAATGGG CGGTCATGCT GGAGACCGAA TGGCAGGACG 

851 GCGAATGGTT CGAGGGCAGG CTGGCGCAGG AAT3GCTGGA TAAGGGCGTT 

901 GCCGCCAATC GGGAACAGGT TGCCGCGCTG GAGACAGAGC TGAAGCAGAA 

951 ACCGGCGCAA CTGCTTATCG GCGTGCGCGC CCAAACTGTG CCCGACCGCG 

1001 GCGTGTTGCG GCAGATCGTC CGACTTTCGG AAGCGGCGCA GGGCGGCGCG 

1051 GTGGTGCAGC TTTTGGCGGA ACAGGGGCTT TCAGACGACC TTTCGGAAAA 

1101 GCTGGAACAT TGGCGTAACG CGCTGACCGA ATGCGGCGCG GCGTGGCTGG 

1151 AACCCGACAG AGCGGCGCAG GAAGGCCGTC TGAAAACCAA CGACCGCACT 

1201 TGA 

This corresponds to the amino acid sequence <SEQ ID 992; ORF 253.a>: 

a253 .pep 

1 MIDRNRMLRE TLERVRAGSF WLWVAAATFA FFTGFSVTYL LMDNQGLNFF 

51 LVLAGVLGMN TLMLAVW LAM LFLRVKVGRF FSSPATWFRG KDPVNQAVLR 

101 LYADEWRQPS VRWKIGATSH SLW LCTLLGM LVSVLLLLLV RQYTFNWEST 

151 LLGDSSSVRL VEMLAWLPAK LGFPVPDARA VIEGRLNGNI ADARAWSGLL 

201 VGSIACYGIL PRLLA WAVCK ILLKTSENGL DLEKPYYQAV IRRWQNKITD 

251 ADTRRETVSA VSPKIVLNDA PKWAVMLETE WQDGEWFEGR LAQEWLDKGV 

301 AANREQVAAL ETELKQKPAQ LLIGVRAQTV PDRGVLRQIV RLSEAAQGGA 

351 WQLLAEQGL SDDLSEKLEH WRNALTECGA AWLEPDRAAQ EGRLKTNDRT 

401 * 



m253 -pep 
g253 

m2 5 3 . pep 
g253 

m253 .pep 
g253 

m253 .pep 
g253 
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m253/a253 97.2% identity in 395 aa overlap 

10 20 30 40 50 60 

m253.pep MI DRNRMLRET LE R VRAG S FW LWW AAT FAF FTG F S V T Y L LM DN QG LN FFL VLAG V LGMN 

I I I M I I I I M I I I 1 1 I I I I I ! I I : I I I f 1 I I II I I 1 1 I i I I I I i I I ! I 1 1 | | 1 1 | M | | 
a253 MI DRNRMLRETLERVRAGS FV7LWAAATFAFF?GFSVTYLU^DNQGLNFFLVLAGVLGMN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m253.pep T LMLAVWLAM L FLR V KVG R FFS S P AT W FRGK D P VN QA V LR L YADEWRQ PS VRWK I GAT S H 

II I I IIIMI KM II III I II I I II MM II I M I I I I I II I I I I I ! 1 1 ! I I I II I I I 
a253 T LML AVW LAf 4 L FLR VKVG R FFS S P ATW FR GK D PVN QA V LR L Y A DEWRQ P S VRWK I GAT S H 

70 80 90 100 110 120 

130 140 150 160 170 180 

m253 . pep SLWLCTLLGMLVSVLLLLLVRQYTFNWESTLLSNAASVRAVEMLAWLPSKLGFPVPDARA 

I M I I I I I I I I I II II I I I I H I I I I I I I I I I :::: I M I II I II I I : I I I I I I I I I II 
a253 SLWLCTLLGMLVSVLLLLLVRQYTFNWESTLLGDSSSVRLVEMLAWLPAKLGFPVPDARA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m253 . pep VIEGRLNGNIADARAWSGLLVGSIACYGILPRLLAWVVCKILLKTSENGLDLEKPYYQAV 
I M I I II II II II I I I I I II I II I I II I I I I I I I I I : I I I I I II I I I I I III I II I II II 
a253 VIEGRLNGNIADARAWSGLLVGSIACYGILPRLLAWAVCKILLKTSENGLDLEKPYYQAV 

190 200 210 220 230 240 

250 260 270 280 290 300 

m253 . pep I RRWQNKI TDADTRRETVSAVS PKI ILNDAPKWAVMLETEWQDGEW FEGRLAQEWLDKGV 

I I M I M M M I I M I M M I I M I : II I II I f I II I I M I II I I Ml I I II I I II I I II 

a253 I RRWQNKITDADTRRETVSAVSPK I VLNDAPKWAVMLETEWQDGEW FEGRLAQEWLDKGV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m253 . pep ATNREQVAALETELKQKPAQLLIGVRAQTVPDRGVLRQIVRLSEAAQGGAWQLLAEQGL 
I : I M I M M I I M I I I I II I I I I II I I I I I I I I I I I I I I II I I I I M I I I I I II I I I II 
a253 AANREQVAALETELKQKPAQLLIGVRAQTVPDRGVLRQIVRLSEAAQGGAWQLLAEQGL 
310 320 330 340 350 360 

370 380 390 

m253 .pep SDDLSEKLEHWRNALAECGAAWLEFDRAAQEGRLKDQX 

I I I M I M M I II I I : II II I II I M M II I I I I I 
a253 SDDLSEKLEHWRNALTECGAAWLEPDRAAQEGRLKTNDRTX 

370 380 390 J00 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 993>: 

g254 .Beg 

l atgtatgcag gcgaacgctt caatacttac agccatttga gcggtttgat 

51 tctggcggcg gcaggtttga tgctgatgct gctgaaaacc ataggacacg 

iOl gggacggata ccgtatcttc agcgtatcgg tttacggcat cagccttctt 

151 ctgctctatt tgagttcctc gctgtaccac ggaattgcag ccggaaaact 

201 gaaaagcatt ttgaaaaaaa ccgaccactg catgatttat gtgctgattg 

251 ccggaagcta cacaccgttt gcactggttt ctttgagaaa cgggccgggc 

301 tggacggtat tttcactgtc ctggctgctg gcggctgcag gaatcgcaca 

351 agaactcacc atcggacgga aaagcgaaaa acgtctgctg tctattgcga 

401 tttatatcgt aatgggctgg atggtcttgg cggtaatgaa atccctgaca 

4 51 gcctcactcc cgccggcagg actggcttgg ctggcggcag gcggtatgct 

501 gtacagcgtc ggcatttact ggtttgtaaa cgatgaaaaa atccgacacg 

551 ggcacggaat ctggcatctg ttcgtattgg gcggcagcat aacccaattt 

601 gtcagcgtgt acggttatgt aatctga 

This corresponds to the amino acid sequence <SEQ ID 994; ORF 254.ng>: 

g254.pep 

1 MYAGERFNTY SHLSGLILAA AGLMLMLLKT IGHGDGYRIF SVSVYGISLL 
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51 LLYLS SSLYH GIAAGKLKSI LKKTDHCMIY VLIAGSYTPF ALVSLRNGPG 
101 WTVFSLS WLL AAAGIAQELT IGRKS EKRLL SIAIYIVMGW MVLAVMKSLT 
151 ASLPPAGLAW LAAGGMLYSV GIYWFVNDEK IRHGHGIWHL FVLGGSITQF 
201 VSVYGYVI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 995>: 



m254 . seq 


(partial) 










1 


. . GTATCGGTTT 


ACGGCATCAG 


CCTTCTTCTG 


CTCTATTTGA 


GTTCCTGGCT 


51 


GTACCACGGA 


ATTGCAGCCG 


GAAAACTGAA 


AAGCATTTTG 


AAAAAAACCG 


101 


ACCACTGCAT 


GATTTATGTG 


CTGATTGCCG 


GAAGCTACAC 


ACCGTTTGCA 


151 


CTGGTTTCTT 


TGAGAAACGG 


GCCGGGCTGG 


ACGGTATTTT 


CACTGTCCTG 


201 


GCTGCTGGCG 


GCTGCAGGAA 


TCGCACAAGA 


ACTCACCATC 


GGACGGAAAA 


251 


GCGAAAAACG 


TCTGCTGTCT 


ATTGTGATTT 


ATGTCGTCAT 


GGGTTGGATG 


301 


GTCTTGGCGG 


TAATGAAATC 


CCTGACAGCC 


TCACTCCCGT 


CGGCAGGACT 


351 


GGCTTGGCTG 


GCGGCAGGCG 


GTATGCTGTA 


CAGTGTCGGC 


ATTTACTGGT 


401 


TTGTAAACGA 


TGAAAAAATC 


CGACACGGGC 


ACGGAATCTG 


GCATCTGTTC 


451 


GTATTGGGCG 


GCAGCATCAC 


CCAATTTGTC 


AGCGTGTACG 


GTTACGTAAT 


501 


CTGA 











This corresponds to the amino acid sequence <SEQ ID 996; ORF 254>: 

m254.pep (partial) 

1 . . VSVYGISLLL LYLS SWLYHG I AAGKLKS I L KKTDHCMIYV LIAGSYTPFA 

51 LVSLRNGPGW TVFSLSW LLA AAGIAQELTI GRKS EKRLLS IVIYWMGWM 

101 VLAVMKSLTA SLPSAGLAWL AAGGMLYSVG IYWFVNDEKI RHGHGIWHLF 

151 VLGGSITQFV SVYGYVI* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 254 shows 97.6% identity over a 167 aa overlap with a predicted ORF (ORF 254.ng) 
from TV. gonorrhoeae: 

m254/g254 

10 20 30 

m254 . pep VSVYGISLLLLYLSSWLYHGIAAGKLKSIL 

lllllllllllllll I II I M I I I I I I II 
g2 5 4 HLSGLILAAAGLMLMLLKTIGHGDGYRI FS VS V YG I S LLLL YLS S S L YHG I AAGKLKS I L 

20 30 40 50 60 70 



40 50 60 70 80 90 

m254 . pep KKTDHCMIYVLIAGSYTPFALVSLRNGPGWTVFSLSWLLAAAGIAQELTI GRKSEKRLLS 

I IIIIIIIMI Ml I Ml II 1 1 1 1 1 1 II III III II I II III II MM II I II llll II I 

g2 54 KKTDHCMIYVLIAGSYTPFALVSLRNGPGWTVFSLSWLLAAAGIAQELTIGRKSEKRLLS 
80 90 100 110 120 130 



100 110 120 130 140 150 

m2 54 . pep I VI YVVMGWWIjAVMKS LTASLP SAGLAWLAAGGML YS VG I YWFVNDE KI RHGHG I WHL F 

IMhIMIIIilllMMIIM ! 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M II II II 1 1 1 1 1 

g254 I AI Y I VMGWMVLAVMKS LTASL P PAGLAWLAAGGML YS VG I YW FVNDE KI RHGHG I WHL F 

140 150 160 170 180 190 



160 

m254 .pep VLGGSITQFVSVYGYVIX 
I I I I I I I I 1 I I I I I I I I I 
g254 VLGGSITQFVSVYGYVIX 
200 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 997>: 

a254 . seq 

1 ATGTATACAG GCGAACGCTT CAATACTTAC AGCCATTTGA GCGGTTTGAT 

51 TCTGGCGGCG GCAGGTTTGG CGCTGATGCT GCTGAAAACC ATAGGACACG 

101 GGGACGGCTA CCGTATCTTC AGCGTATCGG TTTACGGCAT CAGCCTTCTT 

151 CTGCTCTATT TGAGTTCCTC GCTGTACCAC GGAATTGCAG CCGGAAAACT 

201 GAAAAGCATT TTGAAAAAAA CCGACCACTG CATGATTTAT GTGCTGATTG 

251 CCGGAAGCTA CACACCGTTT GCACTGGTTT CTTTGAGAAA CGGGCCGGGC 
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301 TGGACGGTAT TTTCACTGTC CTGGCTGCTG GCGGCTGCAG GAATCGCACA 

351 AGAACTCACC ATTGGACGGA AAAGCGAAAA ACGACTGCTG TCTATTGCGA 

4 01 TTTATATCGT AATGGGCTGG ATGGTCTTGG CGGTAATGAA ATCCCTGACA 

4 51 GCCTCACTCC CGCCGGCAGG ACTGGCTTGG CTGGCGGCAG GCGGTATGCT 

501 GTACAGCGTC GGCATTTACT GGTTTGTAAA CGATGAAAAA ATCCGACACG 

551 GGCACGGAAT CTGGCATCTG TTCGTATTGG GCGGCAGCAT CACCCAATTT 

601 GTCAGCGTGT ACGGTTACGT AATCTGA 

This corresponds to the amino acid sequence <SEQ ID 998; ORF 254.a>: 
a254 .pep 

1 MYTGERFNTY SHLSGLILAA AGLALMLLKT IGHGDGYRIF SVSVYGISLL 

51 LLYLSSSLYH G IAAGKLKSI LKKTDH CMIY VLIAGSYTPF ALV SLRNGPG 

101 WTVFSLSWLL AAAGIAQELT IGRKSEK RLL SIAIYIVMGW MVLAVM KSLT 

151 ASLPPAGLAW LAAGGMLYSV GTYWFVNDEK IRHGHGIWHL FVLGGSITQF 

201 VSVYGYVI* 

m254/a254 97.6% identity in 1 67 aa overlap 

10 20 

30 

m254 .pep 

VSVYGISLLLLYLSSWLYHGIAAGKLKSIL 

I I I I I I I I ! I I I I ! I 

I I I I I II I I i I I I I 

a254 

HLSGLILAAAGLALMLLKTIGHGDGYRIFSVSVYGISLLLLYLSSSLYHGIAAGKLKSIL 

20 30 40 50 60 

70 

40 50 60 70 30 

90 

m254 .pep 

KKT DHCMI YVL I AGSYTPFALVSLRNGPGWTVFSLSWLLAAAG I AQELTI GRKSEKRLLS 

! I M I I I II I I t I I I I 1 1 I I 1 1 I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

a254 

KKTDHCMIYVLIAGSYTPFALVSLRMGPGWTVFSLSWLLAAAGIAQELTI GRKSEKRLLS 

80 90 100 110 120 

130 

100 110 120 130 140 

150 

m254 .pep 

IVIYWMGWMVlAVMKSLTASLPSAGLAWIjyVGGMLYSVGIYWFVNDEKIRHGHGIWHLF 

I : I I: I MM M I II! I I II II I 
M I I I M I I 1 1 I I I I I I I I I I I I I I I I I I I II I 1 1 I 
a254 

IAIYIVMGWMVLAVMKSLTASLPPAGLAWLAAGGMLYSVGIYWFVNDEKIRHGHGIWHLF 

140 150 160 170 180 

190 

160 

m254 .pep VLGGSITQFVSVYGYVIX 
I I I I I I I I I I I I I I I I I I 
a254 VLGGSITQFVSVYGYVIX 
200 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 999>: 
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g255.seq 

1 atggttggac aggaagcctt gcggggtcag ttcgtcgccg tgttcgctgc 

51 cgcgttgcgt tacgctgtca aaacctgcgc cgatttccac gcctttgacg 

101 gcgttgatgc ccatcatcgc gtaggcgatt tcggcatcga ggcggtcgaa 

151 aacgggttcg cccaaaccga cggggacgtt ggcggcttcg atatgcagtt 

201 tcgcgccgac ggaatccaag gatttgcgca caccgtccat atagtgttcc 

251 agttcggcga tttggctttg gttggcggca aaaaaaggat tbtgggaaat 

301 gtgttcgctg ccttcaaacc ggattttttt ttcgccgact tgggtaacgt 

351 aggcggtgat ttccgtgccg aatttttctt tcagccattt tttggcaacg 

401 gctccggcgg caacgcgggc tgcggtttcg cgggcggaac tcctgccgcc 

4 51 gccccggtag tcgcgcgtac cgtatttgtg ccaataggta tagtcggcgt 

501 gtccggggcg gaaggcggtg gcgatgtcgc cgtagtcttc gctgcgctgg 

551 tcggtgttgc ggattag 

This corresponds to the amino acid sequence <SEQ ID 1000; ORF 255.ng>: 

g255 .pep 

1 MVGQEALRGQ FVAVFA AALR YAVKTCADFH AFDGVDAHHR VGDFGIEAVE 

51 NGFAQTDGDV GGFDMQFRAD GIQGFAHTVH IVFQFGDLAL VGGKKRILGN 

101 VFAAFKPDFF FADLGNVGGD FRAEFFFQPF FGNGSGGNAG CGFAGGTPAA 

151 APWARrVFV PIGIVGV SGA EGGGDVAWF AALVGVAD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1001>: 

m255 . seq 

1 GTGGTTGGAC AGGAAGCCTT GCGGGGTCAG TTCGTCGCCG TGTTCGCTGC 

51 CGCGTTGCGT TACGCTGTCA AAACCTGCGC CGATTTCCAC GCCTTTGACG 

101 GCGTTGATGC CCATCATCGC GTAGGCGATT TCGGCATCGA GGCGGTCAAA 

151 AACAGGTTCG CCCAAGCCGA CAGGGACATT GGCTGCTTCG ATATGCAGCT 

2 01 TCGCGCCGAC GGAATCCAAG GATTTGCGCA CGCTGTCCAT ATAGTTTTCC 

251 AGCTCGGCAA TTTGGCTATG GTTGGCGGCA AAAAAAGGAT TTTGGGAAAT 

301 GTGTTCGCAG CCTTCAAACC GGATTTCTTT TTCGCCGACT TGGGTAACGT 

351 AGGCGGTGAT TTCCGTGCCG AATTTTTCTT TCAACCATTT TTTGGCAACG 

401 GCTCCGGCAG CAACGCGGGC GGCGGTTTCA CGGGCGGAGC TCCTGCCGCC 

4 51 GCCGCGGTAG TCGCGCGTGC CGTATTTGTG CCAATAGGTA TAGTCGGCGT 

501 GGCCGGGGCG GAAGCTGGTG GCGATGTTGC CGTAGTCTTT GCTGCGCTGG 

551 TCGGTATTGC GGATTAA 

This corresponds to the amino acid sequence <SEQ ID 1002; ORF 255>: 
ra255 .pep 

1 WGQEALRGQ FVAVFAAALR YAVKTCADFH AFDGVDAHHR VGDFGIEAVK 
51 NRFAQADRDI GCFDMQLRAD GIQGFAHAVH IVFQLGNLAM VGGKKRILGN 
101 VFAAFKPDFF FADLGNVGGD FRAEFFFQPF FGNGSGSNAG GGFTGGAPAA 
151 AAWARAVFV PIGIVGVAGA EAGGDVAWF AALVGIAD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 255 shows 88.8% identity over a 188 aa overlap with a predicted ORF (ORF 255.ng) 
from TV. gonorrhoeae: 

m255/g255 

10 20 30 40 50 60 

WGQEALRGQ FVAVFAAALR YAVKTCADFHAFDGVDAHHRVGDFG I EAVKNRFAQADRD I 

= I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I = I llhl h 
MVGQEALRGQ FVAVFAAALR YAV KTCADFHAFDGVDAHHRVGDFG I EAVENG FAQTDGD V 
10 20 30 40 50 60 

70 80 90 100 110 120 

GCFDMQLRADGI QGFAHAVH IVFQLGNLAMVGGKKRI LGNVFAAFKPDF FFADLGNVGGD 

I lllhill II llllhlllllhhl HI III IMIIMNMI III IIMIIIII 

GG FDMQFRADGI QGF AHTVH I VFQ FGDLALVGG KKR I LGNVFAAF KPD F FFADLGNVGGD 
70 80 90 100 110 120 

130 140 150 160 170 180 

FRAEFFFQPF PGNGSGSNAGGGFTGGAPAAAAWARAVFVP I G I VGVAGAEAGGDVAWF 

lllllllllllllllhlll Ihlhllll III HII III Nihil Hi III III 
FRAE FF FQP F FGNGSGGNAGCG FAGGT P AAAP WARTV FV P I G I VGVS GAEGGGD VAWF 



m255 .pep 
g255 

m255 -pep 
g255 

m255 .pep 
g255 
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130 140 150 160 170 180 



189 

m2 5 5 . pep AALVGIADX 

Illlhlll 
g255 AALVGVADX 



The following partial DNA sequence was identified inN. meningitidis <SEQ ID 1003>: 



a255.seq 




1 


GTGGTTGGAC 


51 


CGCGTTGCGT 


101 


GCGTTGATGC 


151 


TACGGGTTCG 


201 


TCGCGCCGAC 


251 


AGCTCGGCAA 


301 


GTGTTCGCAG 


351 


AGGCGGTGAT 


401 


GCTCCGGCGG 


451 


GCCCCGGTAG 


501 


GGCCGGGGCG 


551 


TCGGTATTGC 



'TCAAACC GGATTTCTTT TTCGCCGACT TGGGTAACGT 
:CGTGCCG AATTTTTCTT TCAACCATTT TTTGGCAACG 
ICGCGGGC GGCGGTTTCG CGGGCGGAAC TCCTGCCGCC 
;CGCG?GC CGTATTTGTG CCAATAGGTA TAGTCGGCGT 
LGCTGGTG GCGATGTTGC CGTAGTCTTT GCTGCGCTGG 
tTTAA 

This corresponds to the amino acid sequence <SEQ ID 1004; ORF 255.a>: 

a255 .pep 

1 WGQEALRGE FVAVFA AALR YAVKTCADFH AFDGVDAHHG VGDFGIEAVE 

51 YGFAQADGDV GGFNMQLRAD GIQGFAHAVH IVFQLGNLAM VGGKKRILGN 

1C1 VFAAFKPDFF FADLGNVGGD FRAEFFFQPF FGNGSGGNAG GGFAGGTPAA 

151 AP WARAVFV PIGIVGVAG A E AGGDVAVVF AALVGIAD + 

m255/a255 93.1% identity in 188 aa overlap 

10 20 30 40 50 60 

m255 . pep WGQEALRGQ FVAV FAAALR YAVKTCADFHAFDGVDAHHRVG D FG I EAVKNRFAQADRD I 

I I I I I I I I I : I I I I I I I I II I I 1 I I I ! I I I I II I I I M I II I I I I I 1 I : I I I I 1 I: 
a255 WGQEALRGE FVAVFAAALRYAVKTCADFHAFDGVDAHHG VG DFG I EAVE YG FAQADG DV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m255 . pep GCFDMQLRADGIQGFAHAVHIVFQLGNLAMVGGKKRILGNVFAAFKPDFFFADLGNVGGD 
I I : I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I ! I I II II I I I I I I I I II I I I I I I 
a255 GGFNMQLRADGIQGFAHAVHIVFQLGNLAMVGGKKRILGNVFAAFKPDFFFADLGNVGGD 

70 80 90 100 110 120 

130 140 150 160 170 180 

m255 . pep FRAE FFFQPFFGNGSGSNAGGGFTGGAPAAAAWARAVFVPIGIVGVAGAEAGGDVAVV? 

I I I I I II I I I II I II 1:1 II I II: I I: I I I I I I I I I ! I I I I I I I I I I I I II I I I I I M I 
a255 FRAE FFFQPFFGNGSGGNAGGGFAGGTPAAAPWARAVFVPIGIVGVAGAEAGGDVAWF 

130 140 150 160 170 180 

189 

m2 5 5 . pep AALVGIADX 
I I I I I I I I I 
a255 AALVGIADX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1005>: 

g256 .seq 

1 atgctcgcgg tacgcaatcg gggttggcac ggcgcagtcg tccatttccg 

51 cagctgcggc ggcgtagcga acaccgcccc ggtgttctac cacttgggtg 

101 ataccgccga aatcgccttt gctttggaca cgctcaccgc gcgttaccgt 

151 gaaatatacg ccgtcggcgt atcgctgggc ggcaacgcgc cggcaaaata 

2 01 tttgggcgaa cagggcaaaa aggcattgcc gcacgcctcg gccgccgtat 

2 51 ccgcccccgt tgatgcagag gcggcaggca gccgcttcga cagcggcatc 

3 01 acgcggctgc tctacacgcg ctacttcctc cgcacactga tacccaaagc 
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351 acgttcgctc caaggttttc agacggcatt tgccgcaggg tgcaaaacac 

4 01 tgggcgagtt tgacgaccgt ttcaccgcac cgctgcacgg ctttgccgac 

4 51 cggcacgact actaccgcca aacttcctgc aaaccgctgc tcaaacacgt 

501 tgccaaaccg ctgctcctgc tcaatgccgc caacgacccc ttcctgccgc 

551 ccgaagccct gccccgtgca gacgaagcgt ccgaagccgt taccctgttc 

601 caacctgcac acggcgggca cgccggcttt gtcagcagca ccggcggcag 

651 gctgcacctg caatggctgc cgcagaccgt cctgtcctat tttgacagct 

701 tccgcacaaa caggcgttaa 

This corresponds to the amino acid sequence <SEQ ID 1006; ORF 256.ng>: 

g256 -pep 

1 MLAVRNRGWH GAWHFRSCG GVANTAPVFY HLGDTAEIAF ALDTLTARYR 

51 EIYAVGVSLG GNAPAKYLGE QGKKALPHAS AAVSAPVDAE AAGSRFDSGI 

101 TRLLYTRYFL RTLIPKARSL QGFQTAFAAG C KTLGEFDDR FTAPLHGFAD 

151 RHDYYRQTSC KPLLKHVAKP LLLLNAANDP FLPPEALPRA DEASEAVTLF 

201 QPAHGGHAGF VSSTGGRLHL QWLPQTVLSY FDSFRTNRR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1007>: 

ra256 . seq 

1 ATGCTTGCGG TACGCGATCG GGGTTGGCAC 

51 CAGCTGCGGC GGCATTGCCA ACACCGCTCC 

101 ATACCGCCGA AATCGCCTTT ACTTTGGACA 

151 GAAAtATACG CCGTCGGCGT ATCGCTGGGC 

201 TTTGGGCGAA CAGGGCAAAA AGGCATTGCC 

251 CCGCCCCCGT CGATGCAGAG GCGGCAGGCA 

301 ACGCGGCTGC TCTACACGCG CTACTTCCTC 

351 AAAATCGCTC CAAGGTTTTC AGACGGCATT 

401 TGGGCGAGTT TGACGACCGC TTCACCGCAC 

451 CGGCACGACT ACTACCGCCA AACTTCCTGC 

501 TGCCAAACCG CTGCTCCTGC TCAATGCCGT 

551 CCGAAGCCCT GCCCCGCGCA GACGAAGTAT 

601 CAGCCGGCAT ATGGTGGTCA TGTCGGCTTT 

651 GCTGCACCTG CAATGGCTGC CGCAGACCGT 

701 TCCGCACAAA CAGGCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 1008; ORF 256>: 

m256 .pep 

1 MLAVRDRGWH GVWHFRSCG GIANTAPVFY XLGDTAEIAF TLDTFAARYR 

51 EIYAVGVSLG GNALAKYLGE QGKKALPQAA AVISAPVDAE AAGRRFDSGI 

101 TRLLYTRYFL RTLI PKAKSL QGFQTAFAAG C KTLGEFDDR FTAPLHGFAD 

151 RHDYYRQTSC KPLLKHVAKP LLLLNAVNDP FLPPEALPRA DEVSEAVTLF 

201 QPAYGGHVGF VSSTGGRLHL QWLPQTVLSY FDSFRTNRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 256 shows 92.9% identity over a 239 aa overlap with a predicted ORF (ORF 256.ng) 
from AT. gonorrhoeae: 

m256/g256 

10 20 30 40 50 60 

MLAVRDRGWHGVWHFRS CGGI ANTAPVFYHLGDTAE I AFTLDTFAARYRE I YAVGVSLG 

UN h Mil MINIMI Mill III II III II III hi ll-ll I IMIMIIMI 

MIAVRNRG WHGAVVHFRS CGGVANTAPVFYHLGDTAE IAFALDTLTAR YRE I YAVGVSLG 
10 20 30 40 50 60 

70 80 90 1O0 110 120 

GNALAKYLGEQGKKALPQAAAVISAPVDAEAAGRRFDSGI TRLLYTRYFLRTLI PKAKSL 

III llilllllllllhhh:|||llillll II II MM MIIIIMIIIMIIMI 

GNAP7VKYLGEQGKKALPHASAAVSAPVDAEAAGSRFDSG I TRLLYTRYFLRTLI PKARSL 
70 80 90 100 110 120 

130 140 150 160 170 .180 

QGFX?TAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCKPLLKHVAKPLLLLNAVNDP 

MINI Ml Ml III I II II MINI II I N III III M Ml III III III NNIM II 

QX3FQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCKPLLKHVAKPLLLLNAANDP 



GGCGTAGTCG 
GGTGTTCTAC 
CGTTCGCCGC 
GGCAACGCGC 
GCAAGCCGCT 
GACGCTTCGA 
CGCACCCTGA 
TGCCGCAGGG 
CGCTGCACGG 
AAACCGCTGC 
CAACGACCCC 
CCGAAGCCGT 
GTCAGCAGCA 
CCTGTCCTAT 



TCCATTTCCG 
CA.CTtGGCG 
GCGTTACCGT 
TGGCAAAATA 
GCCGTCATCT 
CAGCGGCATC 
TACCCAAAGC 
TGCAAAACAC 
CTTTGCCGAC 
TCAAACACGT 
TTCCTGCCGC 
TACCCTGTTC 
CCGGCGGCAG 
TTCGACAGCT 



m256 .pep 
g256 

m256 -pep 
g256 

m256 .pep 
g256 
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130 140 150 160 170 180 

190 200 210 220 230 240 

m2 5 6 . pep F L P PEALPRADEVS E AVTLFQP A YGGHVG FV S STGGRLHLQWLPQTVLS YFDS FRTNRRX 

IIIMMMII|:||I!IMIII:|| hlllllMIIIIIIII M lllllllllll llll 

9256 FLPPEALPRADEASEAVTLFQPAHGGHAGFVSSTGGRUlI^WLPQTVLSYFDSFimn^ 
190 200 210 220 230 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1009>: 

a256. seq 

1 ATGCTCGCGG TACGCGATCG GGGTTGGAAC 

51 CAGCTGCGGC GGCGTAGCGA ACACCGCCCC 

101 ATACCGCCGA AATTGCCTTT ACTTTGGACA 

151 GAAATATACG CCGTCGGCGT ATCGCTGGGC 

201 TTTGGGCGAA CAGGGCGAAA ACGCGCTGCC 

251 CCGCACCCGT CGATGCAGAG GCGGCAGGCA 

301 ACACGGCTGC TCTACACGCG CTACTTCCTC 

351 ACGGTCGCTC CAAGGTTTTC AGACGGCATT 

401 TGGGCGAGTT TGACGACCGT TTCACCGCAC 

451 CGGCACGACT ACTACCGCCA AACTTCCTGC 

501 TGCCAAACCG CTGCTCCTGC TCAATGCCGT 

551 CCGAAGCGCT GCCCCGCGCA GACGAAGTGT 

601 CAGCCGACAC ACGGTGGTCA TGTCGGCTTT 

651 GCTGCACCTG CAATGGTTGC CGCAGACCGT 

701 TCCGCACAAA CAGGCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 1010; ORF 256.a>: 

a256.pep 

1 MLAVRDRGWN GVWHFRSCG GVANTAPVFY HLGDTAEIAF TLDTLAARYR 

51 EIYAVGVSLG GNALAKYLGE QGENALPQAA AVISAPVDAE AAGNRFDSGI 

101 TRLLYTRYFL RTLIPKARSL QGFQTAFAAG CKTLGEFDDR FTAPLHGFAD 

.151 RHDYYRQTSC KPLLKHVAKP LLLLNAVNDP FLPPEALPRA DEVSEAVTLF 

201 QPTHGGHVGF VGSTGGRLHL QWLPQTVLSY FDSFRTNRR* 

m256/a256 95.4% identity in 239 aa overlap 

10 20 30 40 50 60 

m2 5 6 . pep MLAVRDRGWHGVWHFRSCGGIANTAPVFYXLGDTAEIAFTLDTFAARYREIYAVGVSLG 
I I M I I I I I : I I I I I I I I I I I : | I I | | I | | I I I I M I I I I I I I : I I I I 1 I t I I | | I | M 
a2 5 6 MLAVRURGWNGVVVHFRSCGGVANTAPVFYHLGDTAEIAFTLDTLAARYREIYAVGVSLG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 56 . pep GNALAKYLGEQGKKAL PQAAAVI SAPVDAEAAGRRFDSG ITR1LYTRYFLRTLI PKAKS L 

I I I I N I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I | | | | | : I | 
a 2 5 6 GNALAK YLGEQGENAL PQAAAVI SAP VDAEAAGNRFDSG ITRLLYTRYFLRT LI PKARSL 

"70 80 90 100 110 120 

130 140 150 160 170 180 

m2 56 . pep QGFQTAFAAGCKTLGE7DDRFTAPLHGFADRHDYYRQTSCKPLLKHVAKPLLLLNAVNDP 
I II 1 1 I I I I I I I I I II I 1 1 1 I I I II II I I II M I M I 1! I II II I I I 1 1 I 1 1 1 | I I I | | | 
a256 QGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCKPLLKHVAKPLLLLNAVNDP 

130 140 150 160 170 180 

190 200 210 220 230 240 

m256 . pep FLP PEALPRADEVSEAVTLFQPA YGGHVG FVSSTGGRLHLQWLPQTVLS YFDS FRTNRRX 

I II I I 1 I I I I I I I I I I I I I I I I :: I I I I I | I : | I I I I I I I I I I I I I I I I I I I I I I I I I I 
a256 FL P PEALPRADE V S E AVT L FQPT HGG HVG FVG ST GGRLHLQW LPQTVL S YFDS FRTNRRX 

190 200 210 220 230 240 



GGCGTAGTCG 
GGTGTTCTAC 
CGCTCGCCGC 
GGCAACGCGC 
GCAAGCCGCC 
ACCGCTTCGA 
CGCACACTGA 
TGCCGCAGGG 
CGCTGCACGG 
AAACCGCTGC 
CAACGACCCC 
CCGAAGCCGT 
GTCGGCAGCA 
CCTGTCCTAT 



TCCATTTCCG 
CACTTGGGCG 
GCGTTACCGT 
TGGCAAAATA 
GCCGTCATCT 
CAGCGGCATC 
TACCCAAAGC 
TGCAAAACAC 
CTTTGCCGAT 
TCAAACACGT 
TTCCTGCCGC 
TACCCTGTTC 
CCGGCGGCAG 
TTCGACAGCT 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 101 1>: 

g256-l.seq 

1 ATGATTTTGA CACCGCCGGA CACGCCCTTT TTCCTCCGCA ACGGCAATGC 
51 CGACACGATT GCCGCCAAAT TCCTGCAACA CCCCGCACCC GCATACCGCC 
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101 GCGAGATGCT TCCCGACAGC ACGGGTAAAA CCAAAACCGC CTACGACTTT 

151 TCAGCAGGCG GCATTTCGCC CGATGCGCCG CTGGTCGTGC TGTTTCACGG 

201 TTTGGAAGGA AGCAGCCGCA GCCATTACGC GGTCGAACTG ATGCTCGCGG 

251 TACGCAATCG GGGTTGGCAC GGCGCAGTCG TCCATTTCCG CAGCTGCGGC 

301 GGCGTAGCGA ACACCGCCCC GGTGTTCTAC CACTTGGGTG ATACCGCCGA 

351 AATCGCCTTT GCTTTGGACA CGCTCACCGC GCGTTACCGT GAAATATACG 

401 CCGTCGGCGT ATCGCTGGGC GGCAACGCGC CGGCAAAATA TTTGGGCGAA 

451 CAGGGCAAAA AGGCATTGCC GCACGCCTCG GCCGCCGTAT CCGCCCCCGT 

501 TGATGCAGAG GCGGCAGGCA GCCGCTTCGA CAGCGGCATC ACGCGGCTGC 

551 TCTACACGCG CTACTTCCTC CGCACACTGA TACCCAAAGC ACGTTCGCTC 

601 CAAGGTTTTC AGACGGCATT TGCCGCAGGG TGCAAAACAC TGGGCGAGTT 

651 TGACGACCGT TTCACCGCAC CGCTGCACGG CTTTGCCGAC CGGCACGACT 

701 ACTACCGCCA AACTTCCTGC AAACCGCTGC TCAAACACGT TGCCAAACCG 

7 51 CTGCTCCTGC TCAATGCCGC CAACGACCCC TTCCTGCCGC CCGAAGCCCT 

801 GCCCCGTGCA GACGAAGCGT CCGAAGCCGT TACCCTGTTC CAACCTGCAC 

851 ACGGCGGGCA CGCCGGCTTT GTCAGCAGCA CCGGCGGCAG GCTGCACCTG 

901 CAATGGCTGC CGCAGACCGT CCTGTCCTAT TTTGACAGCT TCCGCACAAA 

951 CAGGCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 1012; ORF 256-1. ng>: 

g2 56-1. pep 

1 MILTPPDTPF FLRNGNADTI AAKFLQHPAP AYRREMLPDS TGKTKTAYDF 

51 SAGGISPDAP LVVLFHGLEG SSRSHYAVEL MLAVRNRGWH GAWHFRSCG 

101 GVANTAPVFY HLGDTAEIAF ALDTLTARYR EIYAVGVSLG GNAPAKYLGE 

151 QGKK ALPHAS AAVSAPVDAE AAGSRFDSGI TRLLYTRYFL RTLIPKARSL 

201 QGFQTAFAAG CKTLGEFDDR FTAPLHGFAD RHDYYRQTSC KPLLKHVAKP 

251 LLLLNAANDP FLPPEALPRA DEASEAVTLF QPAHGGHAGF VSSTGGRLHL 

301 QWLPQTVLSY FDSFRTNRR* 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 101 3>: 

m256-l. soq 

1 ATGATTTTAA CACCGCCGGA 

51 CGACACGATT GCCGCCAAAT 

101 GAGAGCTGCT TCCCGACAGC 

151 TCAGACGGCA TTTCGCCCGA 

201 GGAAGGAAGC AGCCGCAGCC 

251 GCGATCGGGG TTGGCACGGC 

301 ATTGCCAACA CCGCTCCGGT 

351 CGCCTTTACT TTGGACACGT 

401 TCGGCGTATC GCTGGGCGGC 

451 GGCAAAAAGG CATTGCCGCA 

501 TGCAGAGGCG GCAGGCAGAC 

551 ACACGCGCTA CTTCCTCCGC 

601 GGTTTTCAGA CGGCATTTGC 

651 CGACCGCTTC ACCGCACCGC 

701 ACCGCCAAAC TTCCTGCAAA 

751 CTCCTGCTCA ATGCCGTCAA 

801 CCGCGCAGAC GAAGTATCCG 

851 GTGGTCATGT CGGCTTTGTC 

901 TGGCTGCCGC AGACCGTCCT 

951 GCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 1014; ORF 256- 1>: 

m2 56-1 .pop 

1 MILTPPDTPF FLRNGNADTI AAKFLQRPAP AYRRELLPDS TGK7KVAYDF 

51 SDGISPDAPL WLFHGLEGS SRSHYAVELM LAVRDRGWHG WVHFRSCGG 

101 IANTAPVFYH LGDTAEIAFT LDTFAARYRE IYAVGVSLGG NALAKYLGEO 

151 GKKALPQAAA VISAPVDAEA AGRRFDSGIT RLLYTRYFLR TLIPKAKSLQ 

201 GFQTAFAAGC KTLGEFDDRF TAPLHGFADR HDYYRQTSCK PLLKHVAKEL 

251 LLLNAVNDPF LPPEALPRAD EVSEAVTLFQ PAYGGHVGFV SSTGGRLHLQ 

301 WLPQTVLSYF DSFRTNRR* 

m256-l/g256-l 93.1% identity in 319 aa overlap 

10 20 30 40 50 59 

m256-l . pep MILTPPDTPFFLRNGNADTIAAKFLQRPAPAYRRELLPDSTGKTKVAYDFS-DGISPDAP 
I I I II M U I I I I I I I I I I I I I I I I I : I I I i I I I I : j I I I I I I I I :| | | | | i II I I II 
g256-l MILTPPDTPFFLRNGNADTIAAKFLQHPAPAYRREMLPDSTGKTKTAYDFSAGGISPDAP 
10 20 30 40 50 60 



CACGCCCTTT ttcctccgca ACGGCAATGC 
TCCTGCAACG CCCCGCGCCC GCATACCGCC 
ACGGGTAAAA CCAAAGTCGC CTACGACTTT 
TGCGCCGCTG GTCGTGCTGT TTCACGGTTT 
ATTACGCGGT CGAACTGATG CTTGCGGTAC 
GTAGTCGTCC ATTTCCGCAG CTGCGGCGGC 
GTTCTACCAC TTGGGCGATA CCGCCGAAAT 
TCGCCGCGCG TTACCGTGAA ATATACGCCG 
AACGCGCTGG CAAAATATTT GGGCGAACAG 
AGCCGCTGCC GTCATCTCCG CCCCCGTCGA 
GCTTCGACAG CGGCATCACG CGGCTGCTCT 
ACCCTGATAC CCAAAGCAAA ATCGCTCCAA 
CGCAGGGTGC AAAACACTGG GCGAGTTTGA 
TGCACGGCTT TGCCGACCGG CACGACTACT 
CCGCTGCTCA AACACGTTGC CAAACCGCTG 
CGACCCCTTC CTGCCGCCCG AAGCCCTGCC 
AAGCCGTTAC CCTGTTCCAG CCGGCATATG 
AGCAGCACCG GCGGCAGGCT GCACCTGCAA 
GTCCTATTTC GACAGCTTCC GCACAAACAG 



60 70 80 90 100 110 119 

m256-l .pep LWLFHGLEGSSRSHYAVELMLAVRDRGWHGVWHFRSCGGIANTAPVFYHLGDTAEIAF 
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I 1 II I lllll II II IMIIII I It 1:11 II 1:1 It tl ! II !:ll III I lllllt II Mil 
g25 5-1 LWLFHGLEGSSRSHYAVELMLAVRNRGWHGAWHFRSCGGVANTAPVFYHLGDTAEIAF 
"70 80 90 100 110 120 

120 130 140 150 160 170 179 

m256-l . pep TLDTFAARYRETYAVGVSr.GGNALAKYLGEQGKKALPQAAAVISAPVDAEAAGRRFDSGI 
: I I I :: I I I I I I I I I I I 11 I I I I I i I I I I I I I I I I I : I : I :: I I I I I I I I I I I I I I I I 
g25 6-1 ALDTLTARYREI YAVGVSLGGNAPAKYLGEQGKKALPHASAAVSAPVDAFAAGSRFDSGI 

130 140 150 160 170 180 

180 190 200 210 220 230 239 

m256-l .pep T RL L Y TR Y FLRTL I P KA KS LQGFQT A F AAGC KTLG EF DDRFTA PLHG FA DRHD Y YRQTS C 
HIM llllltlll IICtlilMIIII IIMMIIIIIMI Itllllllltllllllll 
g256-l TRLLYTRYFLRTLIPKARSLQGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSC 
190 200 210 220 230 240 

240 250 260 270 280 290 299 

m25 6- 1 . pep KPLLKHVAKPLLLLNAVNDPFLPPEALPRADEVSEAVTLFQPAYGGHVGFVSSTGGRLHL 
I I I II M II Ml II I I Ml II I !l 11 It II MM III MM I I: III: I M M I MM II 
g25 6- 1 KPLLKHVAKPLLLLKAANDPFLPPEALPRADEASEAVTLFQPAHGGHAGFVSSTGGRLHL 
250 260 270 280 290 300 

300 310 319 

m25 6-l.pep QWLPQTVLSYFDSFRTNRRX 

I M M 1 1 Mill 1 1 1 I Ml I 
g2 5 6- 1 QWLPQT VLS Y FDS FRTNRRX 

310 320 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1015>: 

a256-l.seq 

1 ATGATTTTGA CACCGCCGGA CACACCCTTT TTCCTCCGCA ACGGCAATGC 

51 CGACACGATT GCCGCCAAAT TCCTGCAACG CTCCGCACCT GCATACCGCC 

101 GCGAGCTGCT TCCCGACAGC ACGGGTAAAA CCAAAACCGC CTACGACTTr 

151 TCAGACGGCA TTTCGCCCGA T3CGCCGCTG GTCGTGCTGT TTCACGGTTT 

201 GGAGGGCGGC AGTGGCAGCC ATTACGCGGT CGAACTGATG CTCGCGGTAC 

251 GCGATCGGGG TTGGAACGGC GTAGTCGTCC ATTTCCGCAG CTGCGGCGGC 

301 GTAGCGAACA CCGCCCCGGT GTTCTACCAC TTGGGCGATA CCGCCGAAAT 

351 TGCCTTTACT TTGGACACGC TCGCCGCGCG TTACCGTGAA ATATACGCCG 

401 TCGGCGTATC GCTGGGCGGC AACGCGCTGG CAAAATATTT GGGCGAACAG 

451 GGCGAAAACG CGCTGCCGCA AGCCGCCGCC GTCATCTCCG CACCCGTCGA 

501 TGCAGAGGCG GCAGGCAACC GCTTCGACAG CGGCATCACA CGGCTGCTCT 

551 ACACGCGCTA CTTCCTCCGC ACACTGATAC CCAAAGCACG GTCGCTCCAA 

601 GGTTTTCAGA CGGCATTTGC CGCAGGGTGC AAAACACTGG GCGAGTTTGA 

651 CGACCGTTTC ACCGCACCGC TGCACGGCTT TGCCGATCGG CACGACTACT 

701 ACCGCCAAAC TTCCTGCAAA CCGCTGCTCA AACACGTTGC CAAACCGCTG 

751 CTCCTGCTCA ATGCCGTCAA CGACCCCTTC CTGCCGCCCG AAGCGCTGCC 

801 CCGCGCAGAC GAAGTGTCCG AAGCCGTTAC CCTGTTCCAG CCGACACACG 

851 GTGGTCATGT CGGCTTTGTC GGCAGCACCG GCGGCAGGCT GCACCTGCAA 

901 TGGTTGCCGC AGACCGTCCT GTCCTATTTC GACAGCTTCC GCACAAACAG 

951 GCGTTAA 

This corresponds to the amino acid sequence <SEQ ED 1016; ORF 256-1. a>: 

a2 56-1. pep 

1 MILTPPDTPF FLRNGNADTI AAKFLQRSAP AYRRELLPDS TGKTKTAYDF 

51 SDGISPDAPL VVLFHGLEGG SGSHYAVELM LAVRDRGWNG WVHFRSCGG 

101 VANTAPVFYH LGDTAEIAFT LDTLAARYRE IYAVGVSLGG NALAKYLGEQ 

151 GENALPQAAA VISAPVDAEA AGNRFDSGIT RLLYTRYFLR TLIPKARSLQ 

201 GFQTAFAAGC KTLGEFDDRF TAPLHGFADR HDYYRQTSCK PLLKHVAKPL 

251 LLLNAVNDPF LPPEALPRAD EVSEAVTLFQ PTHGGHVGFV GSTGGRLHLO 

301 WLPQXVLSYF DSFRTNRR* 

a256-l/m256-l 95.6% identity in 318 aa overlap 

10 20 30 40 50 60 

a256- 1 . pep MILTPPDTPFFLRNGNADTI AAKFLQRSAPAYRRELLPDSTGKTKTAYDF3DGI SPDAPL 
M ! I I I I I I M M M I M M I M M I I 1 1 I I I I I I I I I 1 1 I 1 1 1 : I I I I I 1 1 I I I I I I I 
m256-l MILTPPDTPFFLRNGNADTIAAKFLQRPAPAYRRELLPDSTGKTKVAYDFSDGISPDAPL 

10 20 30 40 50 60 

70 80 90 100 110 120 

a256-l .pep WLFHGLEGGSGSHYAVELMLAVRDRGWNGVVVHFRSCGGVANTAPVFYHLGDTAEIAFT 
MlilllilM IMIIIMMMIIIIMMMlllllhliMIMMIMMIMM 
m256-l VVLFHGLEGSSRSHYAVELM^AVRDRGWHGWVHFRSCGGIANTAPVFYHLGDTAEIAFT 
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70 80 90 100 110 120 

130 140 150 160 170 180 

a256-l . pep LDTLAARYREI YAVGVSLGGNALAKYLGEQGENALPQAAAVISAPVDAEAAGNRFDSGIT 
I I I: I I I I I I I I I I I I I I I I I I I I I I I I I i I :: I I I I I I I I I I I ! I ! I I I I I I I I I I I I 
m256-l LDTFAARYREIYAVGVSLGGNALAKYLGEQGKKALPQAAAVISAPVDAEAAGRRFDSGIT 

130 140 150 160 170 180 

190 200 210 220 230 240 

a256-l.pep RLL YT RY FLRTL I PKARS LQC FQT A FAAGC KT LGEFDDRFTAPLHG F ADRH D Y Y ROT S C K 
I II I I I I I I I I I I I I I ! I I I I i I I I I I I I I I I I I I I I II I I I I ! I I I I I I I I II I I I I I t 
m256~ 1 RLLYTRYFLRTLIPKAKSLQGFQTAFAAGCKTLGEFDDRFTAPLHGFADRHDYYRQTSCK 

190 200 210 220 230 240 

250 260 270 280 290 300 

a25 6- 1 . pep PLLKHVAKPLLLLNAVNDPFLPPEALPRADEVSEAVTLFQPTHGGHVGFVGSTGGRLHLQ 
I I I I I I t i I I II I I I i i I I I I I I I II I! I I I I i I I I I I I I I :: I I I I I I I : I I I II I I I I 
m256-l PLLKHVAKPLLLLNAVNDPFLPPEALPRADEVSEAVTLFQPAYGGHVGFVSSTGGRLHLQ 

250 260 270 280 290 300 

310 319 
a25o-l .pep KLPQTVLSYFDSFRTNRRX 
I I I II Mill II II lllll 
m255-l WLPQTVLSYFDSFRTNRRX 

310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1017>: 

g257 . seq 

1 atgggcaggc atttcgggcg cagacgtttt ctgacggctg ccgccgttgc 

51 tgtggccggt gcggcggttt cttttttgcc gaatcctttt gccgccggcg 

101 gcgaaaaacg caacatggat aaaaaacgcg atgaaaatgt gtttttctgg 

151 aaaggtgtcg cgctgggttc cggcgcggag ctgcgcctgt tcggcgtgga 

201 cgacagacag gcggcggatt tggtcaataa ggttttggcg gaagtggcgc 

251 gtttggaaaa aatgttcagc ctttaccgtg aagacagcct gatcagccgt 

301 ctgaaccgcg acggttatct gacttcgcct ccggcggatt ttttggaact 

3 51 gttgagcctg gccgcgatat tcacgcgctg a 

This corresponds to the amino acid sequence <SEQ ED 1018; ORF 257.ng>: 

g257 -pep 

1 MGRHFGRRRF LTAAAVAVAG AA VSFLPNPF AAGGEKRNMD KKRDENVFFW 

51 KGVALGSGAE LRIjFGVDDRQ AADLVNKVLA EVARLEKMFS LYREDSLISR 

101 LNRDGYLTSP PADFLELLSL AAIFTR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 101 9>: 
m257 . seq 

1 ATGGGCAGGC ATTTCGGGCG . CAGCGTTTT CTGACGGTTG CCGCCGTTGC 

51 GGCGGGGaC. GCGGcGGTTT CTTTCCTGCC GAATCCTTTT GCCGCCGATG 

101 ATGAAAAACG CAAcGGGGAT GAAAAACGCA ATGAAAATGT GTTTTTCTGG 

151 AAAGGTGTCG CACTGGGTTC CGGTGCGGa . CTCCGTCTGT TCGGTGTGGA 

201 CGACAGGCGT GCGGCGGATT TGGTCAACAA GGTTTTGGCG GAAGTGGCGC 

251 GTTTGGAAAA ATTGTTCAGC CTTTACCGTG AAGACAGCCT GATCAGCCGC 

301 CTGAACAGGG ACGGTTATCT GACTTCGCCG TCGGCGGATT TTTTGGAACT 

351 GkTGAGCCTG GCCGCGATAT TCACGCkCTG A 

This corresponds to the amino acid sequence <SEQ ID 1 020; ORF 25 7>: 

ra257.pep 

1 MGRHFGXQRF LTVAAVAAGX AA VSFLPNPF AADDEKRNGD EKRNENVFFW 

51 KGVALGSGAX LRLFGVDDRR AADLVNKVLA EVARLEKLFS LYREDSLISR 

101 LNRDGYLTSP SADFLELXSL AAIFTX* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 257 shows 88.0% identity over a 125 aa overlap with a predicted ORF (ORF 257.ng) 
from K gonorrhoeae: 

m257/g257 

10 20 30 40 50 60 
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m257 -pep MGRHFGRQRFLTVAAVAAGTAAVS FLPN PFAADDEKRNGDEKRNENVFFWKGVALGSGAD 

llllllhlllhllll:: llllllllllll I I I I hlhMMIMIIIIIIIh 
g2 5 7 MGRHFGRRRFLTAAAVAVAGAAVS FLPNPFAAGGEKRNMDKKRDENVFFWKGVALGSGAE 

10 20 30 40 50 60 



70 80 90 100 110 120 

m2 5 7 . pep LRLFGVDDRRAADLVNKVLAEVARLEKLFS LYREDS L I SRLNRDGYLTS P SADFLELXS L 

iiiimihiMiiimmiiiiMtiimiiiiiiiiiMiiii nun n 

g257 LRLFGVDDRQAADLVMKVLAE VARLEKMFS LYRE DS L I SRLNRD GYLTS P PADFLE LLS L 

70 80 90 100 110 120 



m257.pep AAIFTXX 
Mill I 

g257 AAIFTRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 102 1>: 

a257 . seq 

1 ATGGGCAGGC ATTTCGGGCG CAGGCGTTTT TTGACAGTTG CCGCCGTTGC 

51 GGCGGCGGGC GCGGCGGTTT CTTTCCTGCC GAATCCTTTT GCCGCCGATG 

101 ATGAAAAACG CAATAAAGAT GAAAAACGCA ATGAAAATGT GTTTTTCTGG 

151 AAAGGTGTCG CACTGGGTTC CGGTGCGGAG CTCCGTCTGT TCGGTGTGGA 

201 CGACAGGCGT GCGGCGGATT TGGTCAACAA GGTTTTGGCG GAAGTGGCGC 

251 GTTTGGAAAA AATGTTCAGC CTTTACCGTG AAGACAGCCT GATCAGCCGT 

301 CTGAACCGTG ACGGTTATTT GACTTCGCCG CCGGCGGATT TTTTGGAACT 

351 GTTGAGCCTG GCCGTGATAT TCACGCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 1022; ORF 257.a>: 

a257.pep 

1 MGRHFGRRRF LTVAAVAAAG A AVSFLPNPF AADDEKRNKD EKRNENVFFW 

51 KG VALGSGAE LRLFGVDDRR AADLVNKVLA EVARLEKMFS LYREDSLISR 

101 LNRDGYLTSP PADFLELLSL AVIFTR* 

m257/a257 92.0% identity in 125 aa overlap 

10 20 30 40 50 

60 

m257 .pep 

MGRHFGXQRFLTVAAVAAGXAAVSFLPNPFAADDEKRNGDEKRNENVFFWKGVALGSGAX 

I I I I I I : I 1 I I I I I I I I: I I I 1 I I I 1 I 1 I I I I I I I I 

I I I I I I I I I I I I I I I I I I I I 
a257 

MGRH FGRRRFLTVAAVAAAGAAVS FL PN P FAADDEKRNKDEKRNEN V FFW KG VALGSGAE 

10 20 30 , 40 50 

60 

70 80 90 100 110 

120 

m257 .pep 

LRLFGVDDRRAADLVNKVLAEVARLEKLFSLYREDSLISRLNRDGYLTSPSADFLELXSL 

i I I I M I I I I I I I I I I I I I 1 I I I I I I I : I I I I I I i I i I I I I I I I I | I I I I I I I I I I 

I I 

a257 

LRLFGVDDRRAADLVNKVLAEVARLEKMFSLYREDSLISRLNRDGYLTSPPADFLELLSL 

70 80 90 100 110 

120 



m257.pep AAIFTXX 
1:111 

a257 AVIFTRX 
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The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 1023>: 

g258 . seq 

1 atgcgccgct tcctaccgat cgcagccata tgcgccgtcg tcctgctgta 

51 cggattgacg gcggcgaccg gcagcaccag ttcgctggcg gattatttct 

101 ggtggatagt ctcgttcagc gcaatgctgc tgctggtgtt gtccgccgtt 

151 ttggcacgtt atgtcatatt gctgttgaaa gacaggcgca acggcgtgtt 

201 cggttcgcag attgccaaac gcctttccgg gatgttcacg ctggtcgccg 

251 tactgcccgg cttgttcctg ttcggcattc ccgcgcagtt tatcaacggc 

301 acgattaatt cgtggttcgg caacgacacc cacgaagccc tcgaacgcag 

351 ccttaatttg agcaagtccg cactggattt ggcggcacac aatgccgtca 

401 gcaacgccgt tcccgtacag atagacctca tcggcaccgc ctccctgtcg 

451 ggcaatiatgg gcagtgtgct ggaacactac gccggcagcg gttttgccca 

501 gcttgccctg tacaatgccg caagcgggaa aatcgaaaaa agcatcaatc 

551 cgcaccaatt cgaccagccg cttcccgaca aagaacattg ggaacagatt 

601 cagcagaccg gttcggttcg gagtttggaa agcataggcg gcgtattgta 

651 cgcgcaggga tggttgtcgg caggtacgca caacgggcgc gattacgcgc 

701 tgttcttccg ccagccgatt cccgaaaatg tggcacagga tgccgttctg 

751 attgaaaagg cgcgggcgaa atatgccgaa ttgagttaca gcaaaaaagg 

801 tttgcagacc ttttttctgg taaccctgct gattgcctcg ctgctgtcga 

851 tttttcttgc gctggtaatg gcactgtatt ttgcccgccg tttcgtcgaa 

901 cccattctgt cgcttgccga gggcgcaaag gcggtggcgc agggtgattt 

951 cagccagacg cgccccgtat tgcgcaacga cgagttcgga cgtttgacca 

1001 agctgttcaa ccatatgacc gagcagcttt ccatcgccaa agaagcagac 

1051 gaacgcaacc gccggcgcga ggaagccgcc cgtcactacc tcgagtgcgt 

1101 gttggatggg ttgactaccg gtgtggtggt ctcntacccc ctctcttgtt 

1151 gccgtaccgc ggtgttttcc acttgtcatt cctcccctct ttcttatttc 

1201 taa 

This corresponds to the amino acid sequence <SEQ ID 1024; ORF 258.ng>: 

g258.pep 

1 MRRFLPIAAI CAWLLYGLT AATGSTSSLA DYFWWIVSFS AMLLLVLSAV 

51 LARYVILLLK DRRNGVFGSQ IAKRLS GMFT LVAVLPGLFL FGI SAQFING 

101 TINSWFGND? HEALERSLNL SKSALDLAAD NAVSNAVPVQ IDLIGTASLS 

151 GNMGSVLEHY AGSGFAQLAL YNAASGKIEK SINPHQFDQP LPDKEHWEQI 

201 QQTGSVRSLE SIGGVLYAQG WLSAGTHNGR DYALFFRQPI PENVAQDAVL 

251 IEKARAKYAE LSYSKKGLQT FFLVT LLIAS LLSIFLALVM AL YFARRFVE 

301 PILSLAEGAK AVAQGDFSQT RPVLRNDEFG RLTKLFNHMT EQLSIAKEAD 

351 ERNRRREEAA RHYLECVLDG LTTGVWSYP LSCCRTAVFS TCHSSPLSYF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1025>: 

m258. seq 

1 ATGCGCCGT7 TTCTACCGAT CGCAGCCATA TGCGCCGTCG TCCTGTTGTA 

51 CGGACTGACG GCGGCAACCG GCAGCACCAG TTCGCTGGCG GATTATTTCT 

101 GGTGGATTGT TGCGTTCAGC GCAATGCTGC TGCTGGTGTT GTCCGCCGTT 

151 TTGGCACGTT ATGTCATATT GCTGTTGAAA GACAG GCGCG ACGGCGTATT 

201 CGGTTCGCAG ATTGCCAAAC GCCTTTCTGG GATGTTTACG CTGGTTGCCG 

251 TACTGCCCGG CGTGTTTCTG TTCGGCGTTT CCGCACAGTT CATCAACGGC 

301 ACGATTAATT CGTGGTTCGG CAACGATACC CACGAGGCGC TTGAACGCAG 

351 CCTCAATTTG AGCAAGTCCG CATTGAATTT GGCGGCAGAC AACGCCCTCG 

401 GCAACGCCGT CCCCGTGCAG ATAGACCTCA TCGGCGCGGC TTCCCTGCCC 

451 GGGGATATGG GCAGGGTGCT GGAACATTAC GCCGGCAGCG GTTTTGCCCA 

501 GCTTGCCCTG TACAATGCCG CAAGCGGCAA AATCGAAAAA AGCATCAACC 

551 CGCACAAGCT CGATCAGCCG TTTCCAGGTA AGGCGCGTTG GGAAAAAATC 

601 CAACGGGCGG GTTCGGTCAG GGATTTGGAA AGCATAGGCG GCGTATTGTA 

651 CGCGCAGGGC TGGCTGTCGG CGGGTACGCA CAACGGGCGC GATTACGCCT 

701 TGTTTTTCCG TCAGCCGGTT CCCAAAGGCG TGGCAGAGGA TGCCGTCTTA 

751 ATGGAAAAGG CAAGGGCGAA ATATGCTGAG TTGAGTTACA GCAAAAAAGG 

801 TTTGCAGACC TTTTTCCTGG CAACCCTGCT GATTGCCTCG CTGCTGTCGA 

851 TTTTTCTTGC ACTGGTCATG GCACTGTATT TCGCCCGCCG TTTCGTCGAA 

901 CCCGTCCTAT CGCTTGCCGA GGGGGCGAAG GCGGTGGCGC AAGGCGATTT 

951 CAGCCAGACG CGCCCCGTGT TGCGCAACGA CGAGTTCGGA CGCTTGACCA 
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1001 AGTTGTTCAA CCACATGACC GAGCAGCTTT CCATCGCCAA AGAAGCAGAC 

1051 GAGCGCAACC GCCGGCGCGA GGAAGCCGCC AGGCATTATC TTGAATGCGT 

1101 GTTGGAGGGG CTGACCACGG GCGTGGTGGT GTTTGACGAA CAAGGCTGTC 

1151 TGAAAACsTT CAACAAAGCG GCGGAACAGA TTyTGGGGAT GCCGCTTACC 

1201 CCCcTGTGGG GCAGCAGCCG GCACGGTTGG CACGGCGTTT CGGCGCAGCA 

1251 GTCCCTGCTT GCCGAAGTGT TTGCCGCCAT CGGCGCGGCG GCAGGTACGG 

1301 ACAAACCGGT CCATGTGAAA TATGCCGCGC CGGACGATGC CAAAATCCTG 

1351 CTGGGCAAGG CAACCGTCCT GCCCGAAGAC AACGGCAACg GCGTGGTAAT 

1401 GGTGATTGAC GACATCACCG TTTTGATACA CGCGCAAAAA GAAGCCGCGT 

1451 GGGGCGAAgT GGCGaAgCGG CTGGCACACG AAATCCGCAA TCCGCTCACG 

1501 CCCATCCAGC TTTCCGCCGA ACgGsTGGCG T kGAAATTG G GCGGGAAGCT 

1551 GGATGAGCAO GATGCGCAAA TCCTGACGCG TTCGACCGAC ACCATCGTCA 

1601 AACAGGTGGC GGCATTGAAG GAAATGGTCG AAGCATTCCG CAATTATGCG 

1651 CGTTCCCCTT CGCTCAAATT GGAAAATCAG GATTTGAACG CCTTAATCGG 

1701 CGATGTGTTG GCATTGTATG AAGCCGGTCC GTGCCGGTTT GCGGCGGACT 

1751 TGCCGGCGAA CCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 1026; ORF 25 8>: 

m258 -pep 

1 MRRFLPIAAI CAWLLYGLT AATGSTSSLA DYFWWIVAFS AM LLLVLSAV 

51 LARYVILLLK DRRDGVFGSQ IAKRL SGMFT LVAVLPGVFL FGV SAQFING 

101 TINSWFGNDT HEALERS LNL SKSALNLAAD NALGNAVPVQ IDLIGAASLP 

151 GDMGRVLEHY AGSGFAQLAL YNAASGKIEK SINPHKLDQP FPGKARWEKr 

201 QRAGSVRDLE SIGGVLYAQG WLSAGTHNGR DYALFFRQPV PKGVAEDAVL 

251 IEKARAKYAE LSYSKKGLQT FFLAT LLIAS LLSIFLALVM AL YFARRFVE 

301 PVLSLAEGAK AVAQGDFSQT RPVLRNDBFG RLTKLFNHMT EQLSIAKEAD 

351 ERNRRREEAA RHYLECVLEG LTTGVWFDE QGCLKTFNKA AEQILGMPLT 

401 PLWGSSRHGW HGVSAQQSLL AEVFAAIGAA AGTDKPVHVK YAAPDDAKIL 

451 LGKATVLPED NGNGWMVID DITVLIHAQK EAAWGEVAKR LAHEIRNPLT 

501 PIQLSAERXA XKLGGKLDEQ DAQILTRSTD TIVKQVAALK EMVEAFRNYA 

551 RSPSLKLENQ DLNALIGDVL ALYEAGPCRF AADLPANR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 258 shows 90.9% identity over a 386 aa overlap with a predicted ORF (ORF 258.ng) 
from N. gonorrhoeae: 

m258/g258 

10 20 30 40 50 60 

MRRFLPIAAI CAWLLYGLTAATGSTSSLADYFWWIVAFSAMLLLVLSAVLARYVILLLK 

II Mil III III III II IIIIIIIIMI III lllllhl III HUM IIM1IIIII 1 1 

MRRFLPIAAI CAVVLLYGLTAATGSTSSLADYFTfWIVSFSAMLLIiVLSAVLARYVILZiliK 
10 20 30 40 50 60 

70 80 90 100 110 120 

DRRDGVFGSQ I AKRLSGMFTLVAVLPGVFLFGVSAQF I NGTI NSWFGNDTHEALERSLNL 

M hi I III II MM I III II II I! I|:|ll hi II I !i III IN 1 1 1 III II lllllll 

DRRNGVFGSOIAKRLSGMFTLVAVLPGLFLFGISAOFINGTINSWFGNDTHEALERSLNL 
70 80 90 100 110 120 



m258 -pep 
g258 

m258 . pep 
g258 



130 140 150 160 170 180 

m2 5 8 . pep S KS ALNLAADNAIX3NAVPVQ I DLIGAAS LPGDMGRVLEH YAGS G FAQLAL YNAAS GKI E K 

MllhllMlhMIIIMIIIIhlll MM llllllllllllllll lllllllll 

g258 SKSALDLAADNAVSNAVPVQIDLIGTASLSGNMGSVLEHYAGSGFAQLALYNAASGKIEK 
130 140 150 160 170 180 

190 200 210 220 230 240 

m2 5 8 . pep SINPHKLIX)PFPGKARWEKIQRAGSVRDLESIGGVLYAQGWjSAGTHNGRDYALFFRQPV 

MMMMIMI | :||:|h:||||:||MIIIIIIIIIIMIIIIIIIIIIIIII|: 
g2 5 8 S INPHQFDOPLPDKEHWEOI OOTGSVRSLES I GGVLYAQGWLS AGTHNGRDYALFFRQP I 

190 2O0 210 220 230 240 
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250 260 270 280 290 300 

PKGVAEDAVLI EKAJUUTCAELS YSKKGLQT^ 

l-lhlll IIIIIIMIIIIIIIIIII III Ihlll IIIIIIIIIIIIIIIIIIIHM 

PENVAQD AVL I EKARAKYAELS YSKKGLQTFFLVTLLI ASLLS I FLALVMAL Y FARRFVE 
250 260 270 280 290 300 

310 320 330 340 350 360 

PVLSLAEGAKAVAQGDFSQTOPVLRNDEFGRLTKLFNHMTEQLSIAKEADERNRRREEAA 

II I IIIIMI II I llllillll III III II I II III HIM 1 1| I III HIM! 

PILSLAEGAKAVAQGDFSQTRPVLiyroEFGRLTKLFNHMTEQLSIAKEADERNIU^EEAA 
310 320 330 340 350 360 

370 380 390 400 410 420 

RHYLECVLEGLTTGVWFDEQGCLKTFNKAAEQ^ 
MINIM: II HUM :| :\ 

RHYLECVLDGLTTGVWS YPLSCCRTAVFSTCHSS PLS YFX 
370 380 390 400 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1027>: 

a258.seq 

1 ATGCGCCGTT TTCTACCGAT CGCAGCCATA TGCGCCGTCG TCCTGTTGTA 

51 CGGACTGACG GCGGCAACCG GCAGCACCAG TTCGCTGGCG GATTATTTCT 

101 GGTGGATTGT TGCGTTCAGC GCAATGCTGC TGCTGGTGTT GTCCGCCGTT 

151 TTGGCACGTT ATG7CATATT GCTGTTGAAA GACAGGCGCG ACGGCGTATT 

201 CGGTTCGCAG ATTGCCAAAC GCCTTTCCGG GATGTTTACG CTGGTTGCCG 

251 TACTGCCCGG CGTGTTTCTG TTCGGCGTTT CCGCACAGTT TATCAACGGC 

301 ACGATTAATT CGTGGTTCGG CAACGATACC CACGAGGCGC TTGAACGCAG 

351 CCTCAATTTG AGCAAGTCCG CATTGAATCT GGCGGCAGAC AACGCCCTTG 

401 GCAACGCCAT CCCCGTGCAG ATAGACCTCA TCGGCGCGGC TTCCCTGCCC 

451 GGGGATATGG GCAGGGTGCT GGAACATTAC GCCGGCAGCG GTTTTGCCCA 

501 GCTTGCCCTG TACAATGCCG CAAGCGGCAA AATCGAAAAA AGCATCAACC 

551 CGCACAAGCT CGATCAGCCG TTTCCAGGTA AGGCGCGTTG GGAAAAAATC 

601 CAACAGGCGG GTTCGGTCAG GGATTTGGAA AGCATAGGCG GCGTATTGTA 

651 CGCGCAGGGC TGGCTGTCGG CAGGTACGCA CAACGGGCGC GATTACGCCT 

701 TGTTTTTCCG TCAGCCGGTT CCCAAAGGCG TGGCAGAGGA TGCCGTCTTA 

751 ATCGAAAAGG CAAGGGCGAA ATATGCTGAG TTGAGTTACA GCAAAAAAGG 

801 TTTGCAGACC TTTTTCCTGG CAACCCTGCT GAT7GCCTCG CTGCTGTCGA 

851 TTTTTCTTGC ACTGGTCATG GCACTGTATT TCGCCCGCCG TTTCGTCGAA 

901 CCCGTCCTAT CGCTTGCCGA GGGGGCGAAG GCGGTGGCGC AAGGCGATTT 

951 CAGCCAGACG CGCCCCGTGT TGCGCAACGA CGAGTTCGGA CGCTTGACCA 

1001 AGTTGTTCAA CCACATGACC GAGCAGCTTT CCATCGCCAA AGAAGCAGAC 

1051 GAGCGCAACC GCCGGCGCGA GGAAGCCGCC AGACATTATC TCGAATGCGT 

1101 GTTGGAGGGG CTGACCACGG GCGTGGTGGT GTTTGACGAA CAAGGCTGTC 

1151 TGAAAACCTT CAACAAAGCG GCGGAACAGA TTTTGGGGAT GCCGCTTACC 

1201 CCCCTGTGGG GCAGCAGCCG GCACGGTTGG CACGGCGTTT CGGCGCAGCA 

1251 GTCCCTGCTT GCCGAAGTGT TTGCCGCCAT CGGCGCGGCG GCAGGTACGG 

1301 ACAAACCGGT CCATGTGAAA TATGCCGCGC CGGACGATGC CAAAATCCTG 

1351 CTGGGCAAGG CAACCGTCCT GCCCGAAGAC AACGGCAACG GCGTGGTAAT 

1401 GGTGATTGAC GACATCACCG TTTTGATACA CGCGCAAAAA GAAGCCGCGT 

1451 GGGGCGAAGT GGCAAAACGG CTGGCACACG AAATCCGCAA TCCGCTCACG 

1501 CCCATCCAGC TTTCTGCCGA ACGGCTGGCG TGGAAATTGG GCGGGAAGCT 

1551 GGACGAGCAG GACGCGCAAA TCCTGACACG TTCGACCGAC ACCATCATCA 

1601 AACAAGTGGC GGCATTAAAA GAAATGGTCG AGGCATTCCG CAATTACGCG 

1651 CGTTCCCCTT CGCTCAAATT GGAAAATCAG GATTTGAACG CCTTAATCGG 

1701 CGATGTGTTG GCATTGTACG AAGCTGGTCC GTGCCGGTTT GCGGCGGAAC 

1751 TTGCCGGCGA ACCGCTGATG ATGGCGGCGG ATACGACCGC CATGCGGCAG 

1801 GTGCTGCACA ATATTTTCAA AAATGCCGCC GAAGCGGCGG AAGAAGCCGA 

1851 TGTGCCCGAA GTCAGGGTAA AATCGGAAGC GGGGCAGGAC GGACGGATTG 

1901 TCCTGACAGT TTGCGACAAC GGCAAGGGGT TCGGCAGGGA AATGCTGCAC 

1951 AATGCCTTCG AGCCGTATGT AACGGACAAA CCGGCTGGAA CGGGATTGGG 

2001 ACTGCCCGTG GTGAAAAAAA TCATTGAAGA ACACGGCGGC CGCATCAGCC 

2051 TGAGCAATCA GGATGCGGGC GGCGCGTGTG TCAGAATCAT CTTGCCAAAA 

2101 ACGGTAGAAA CTTATGCGTA G 



m258 -pep 
g258 

m258 .pep 
9258 

m258,pep 
9258 
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This corresponds to the amino acid sequence <SEQ ID 1028; ORF 258. a>: 



a258. pep 

1 


MRRFLPIAAI 


CAWLLYGLT 


AATGSTSSLA 


DYFWWIVAFS 


AMLLLVLSAV 


51 


LARYVILLLK 


DRRDGVFG3Q 


IAKRLSGMFT 


LVAVLPGVFL 


FGVSAQFING 


101 


TINSWFGNDT 


HEALER SLNL 


SKSALNLAAD 


NALGNAIPVQ 


IDLIGAASLP 


151 


GDMGRVLEHY 


AGSGFAQLAL 


YNAASGKIEK 


SINPHKLDQP 


FPGKARWEKI 


201 


QQAGSVRDLE 


SIGGVLYAQG 


WLSAGTHNGR 


DYALFFRQPV 


PKGVAEDAVL 


251 


IEKARAKYAE 


LSYSKKGLQT 


FFLATLLIAS 


LLSIFLALVM 


ALYFARRFVE 


301 


PVLSLAEGAK 


AVAQGDFSQT 


RPVLRNDEFG 


RLTKLFNHMT 


EQLSIAKEAD 


351 


ERNRRREEAA 


RHYLECVLEG 


LTTGWVFDE 


QGCLKTFNKA 


AEQILGMPLT 


401 


PLWGSSRHGW 


HGVSAQQSLL 


AEVFAAIGAA 


AGTDKPVHVK 


YAAPDDAKIL 


451 


LGKATVLPED 


NGNGWMVID 


DITVLIHAQK 


EAAWGEVAKR 


LAHEIRNPLT 


501 


PIQLSAERLA 


WKLGGKLDEQ 


DAQILTRSTD 


TIIKQVAALK 


EMVEAFRNYA 


551 


RSPSLKLENQ 


DLNALIGDVL 


ALYEAGPCRF 


AAELAGEPLM 


MAADTTAMRQ 


601 


VLHNIFKNAA 


EAAEEADVPE 


VRVKSEAGQD 


GRIVLTVCDN 


GKGFGREMLH 


651 


NAFEPYVTDK 


PAGTGLGLPV 


VKKIIEEHGG 


RISLSNQDAG 


GACVRIILPK 


701 


TVETYA* 











m258/a258 99.0% identity in 584 aa overlap 

10 20 30 40 50 60 

^RFLPIAAICAVVLLYGLTAATGSTSSLADYFWWIVAFSAMLLLVLSAVLARYVILLLK 
I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I 
MRRFLPIAAICAWLLYGLTAATGSTSSLADYFWWIVAFSAMLLLVLSAVLARYVrLLLK 
10 20 30 40 50 60 

70 80 90 100 110 120 

DRRDGVFGSQIAKRLSGMFTLVAVLPGVFLFGVSAQFINGTINSWFGNDTHEALERSLNL 
I I I I I I I I I I I II I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
DRRDGVFGSQIAKRL3GMFTLVAVLPGVFLFGVSAQFINGTINSWFGNDTHEALERSLNL 
70 80 90 100 110 120 

130 140 150 160 170 180 

SKSALNLAADNALGNAVPVQIDLIGAASLPGDMGRVLEHYAGSGFAQLAL YNAASGKIEK 
I II I I I I I I I I I I I I I: I I I ! I I I I I I I II I I I I I I I I I I I | I I I | I | | I | | | | | || | | | 
SKSALNLAADNALGNAIPVQIDLIGAASLPGDMGRVLEHYAGSGFAQLAL YNAASGKIEK 
130 140 150 160 170 180 

190 200 210 220 230 240 

SINPHKLDQPFPGKARWEKIQRAGSVRDLESIGGVLYAOGWLSAGTHNGRDYALFFRQPV 
I II I I I I I I I I I! I I I I I I II :l I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
S IN PHKLDQPFPGKARWEKI QQAGSVRDLE SIGGVLYAQGWLSAGTHNGRDYALFFRQPV 
190 200 210 220 230 240 

250 260 270 280 290 300 

PKGVAEDAVLIEKARAKYAE LSYSKKGLQT FFLATLLIAS LLSIFLALVMALYFARRFVE 
I 1 1 1 I I I I I I I 1 1 I I I I I I I I I I I I I I M I I 1 1 I 1 1 1 1 1 I I I I I I I I I I I I I II I 1 I I I I 
PKGVAEDAVLIEKARAKYAELSYSKKGLQT FFLATLLIAS LLSIFLALVMALYFARRFVE 
250 260 270 280 290 300 

310 320 330 340 350 360 

PVLSLAEGAKAVAQGDFSQTRPVLRNDEFGRLTKLFNHMTEQLSIAKEADERNRRREEAA 
I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I 
PVLSLAEGAKAVAQG D FS QTRPVLRN DE FGRLTKLFNHMTEQLS I AKEADERNRRREE AA 
310 320 330 340 350 360 

370 380 390 400 410 420 

RHYLECVLEGLTTGV\T/FDEQGCLKTFNKAAEQILGMPLTPLWGSSRHGWHGVSAQQSLL 
I II I I I I I I II I I I I I II I I II I I I I II I I I I I I I I I I I I I I I I I I I II I I I I | I I I I I | 
RHYLECVLEGLTTGW^/FDEQGCLKTFNKAAEQILGMPLTPLWGSSRHGWHGVSAQQSLL 
370 380 390 400 410 420 

430 440 450 460 470 480 

AEVFAAIGAAAGTDKPVHVKYAAPDDAKILLGKATVLPEDNGNGWMVIDDITVLIHAQK 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AE V FAAIGAAAGT DK PVH VKYAAPD DAKI LLGKATVLPEDNGNG WMV I DD I TVL I HAQK 



m258.pep 
a258 

m258 .pep 
a258 

m258 .pep 
a258 

m258 .pep 
a258 

ro258.pep 
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m258 .pep 
a258 

m258 .pep 
a258 

m258.pep 
a258 
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430 440 450 4 60 470 480 

490 500 510 520 530 540 

EAAWGEVAKRLAHEIRNPLTPIQLSAERXAXKLGGKLDEQDAQILTRSTDTIVKQVAALK 
I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 
E AAWGE VAK R LAHE I RN P LT P I QL S AE R LAWK LGG KL DE QDAQ I LT R ST DT 1 1 KQ V AA1K 

490 500 510 520 530 540 

550 560 570 580 589 

EMVEAFRNYARSPSLKLENQDLNALIGDVLALYEAGPCRFAADLPANRX 
I I I t II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I It I: I 

EMVEAFRNYARSPSLKLENQDLNALIGDVLALYEAGPCRFAAELAGEPLMMAADTTAMRQ 
550 560 570 580 590 600 

VLHNIFKNAAEAAEEADVPEVRVKSEAGQDGRIVLTVCDNGKGFGREMLHNAFEPYVTDK 
610 620 630 640 650 660 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1029>: 

g259.seq 

1 acgatgatgc 

51 tttgattttc 

101 aagcctatac 

151 gcgctggtgt 

201 ggcagaaaag 

251 ccgtcctgca 

301 gccctgctgg 

351 caaggcggaa 

401 agttggcaaa 

451 ggcgaaacct 

501 tttggaaagg 

551 aaaaacatct 

601 cgcgctcaat 

651 a 

This corresponds to the amino acid sequence <SEQ ID 1030; ORF 259.ng>: 

g259 .pep 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYTEEL PPLLS ALSAV 

51 ALVWLAWAFV SVR SKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 

151 GET YGRV FAD IFELSAALER RAFKG I LKLT AEYKKHLRRC LPFGNGVGFG 

201 RAQSGVEGNL ENAGKA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 103 1>: 



m259.seq 


(partial) 










1 


ATGATGATGC 


ACGCTTCTGT 


CCAAAGCCGT 


TTCGCACCGA 


TACTTTATGT 


51 


TTTGATTTTC 




TTTTGACCGC 


GCAAATCTGG 


TTCAATCAGA 


101 


AAGCCTATAC 


TGAAGAGCTG 


CCTCCGCTTC 


TGTCCGCATT 


GTCCGCCGTC 


151 


GCGCTGGTGT 


GGCTGGCGTG 


GGCGTTCGTG 


TCGGCGCGTT 


CAAAGGCCAA 


201 


GGCGGAAAAG 


TTCTACCGCG 


AAAAAATGAT 


ACAGAACGAA 


AGCATACACC 


251 


CCGTCsTGCA 


CGCCTCTTTG 


CAACACTTGG 


AACACAAGCC 


GCAAATACTC 


301 


GCCCTGCTGG 


TCAAAAACCA 


CGGCAAAGGG 


ATGGCGGAAC 


AGGTCAGGTT 


351 


CAAGGCGGAA 


GTGCTGCCCG 


ACGACGAAGA 


CGCGCGCACG 


ATTGCCGCCG 


401 


AGTTGGCAAA 


AATGGATATG 


TTCGCATTGG 


GGACkGACGC 


GGTCGCCTCG 


451 


GGCGAAACCT 


ATGGACGCGT 


GTTCGCCGAT 


ATTTTCGAGT 


TGTCGGmGGC 


501 


TTTGGAAGGG 


CGCGCGTTCA 


AAGGAATGTT 


GAAACTGACG 


GCGGAATATA 


551 


AA.AACATCT 


TCGGmGATGC 


CTGCCGTTCG 


GAAACGGCGT 


TGGAGTTGGG 


601 


CGCACTCAAT 


CAGGCGTTGC 


AGGAGATTTC 


AAAAACATCC 


GG. . 



This corresponds to the amino acid sequence <SEQ ID 1032; ORF 25 9>: 

m259.pep (partial) 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYT EEL PPLLS ALSAV 

51 ALVWLAWAFV SARSKAKAEK FYREKMIQNE SIHPVXHASL QHLEHKPQIL 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGTDAVAS 



m258 .pep 
a258 

m258.pep 
a258 

a258 



acgcttctgt ccaaagtcgt 
tttgccggtt ttttgaccgc 
tgaagagctg cctccgcttc 
ggctggcgtg ggcgttcgtg 
ttctaccgcg aaaaaatgat 
cgcttctttg caacacttgg 
tcaaaaacca cggcaaaggc 
gtgctgcccg acgacgaaga 
aatggatatg ttcgcattgg 
atgggcgcgt gttcgccgat 
cgcgcgttca aagggatact 
tcggcgatgc ctgccgttcg 
caggcgttga gggaaatctc 



ttcgcaccga tactttatgt 
gcaaatctgg ttcaatcaga 
tgtccgcatt gtccgccgtc 
tcggtgcgtt caaaggctaa 
acagaacgaa agcatacacc 
aacacaagcc gcaaatgctc 
atggcggaac aggtcaggtt 
cgcgcgcacg attgccgccg 
ggacggacgc ggtcgcctcg 
attttcgagt tgtcggcggc 
gaaactgacg gcggaatata 
gaaacggcgt tggatttggg 
gaaaacgccg gaaaagccta 
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151 GETYGRVFAD I FELSXALEG RAFKGMLKLT AEYKXHLRRC LPFGNGVGVG 
201 RTQSGVAGDF KNIR. . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 259 shows 94.3% identity over a 212 aa overlap with a predicted ORF (ORF 259.ng) 
from N. gonorrhoeae: 

m259/g259 

10 20 30 40 50 60 

m259 .pep MMMHASVQSRFAPILYVIjIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 

Mill II Ml MIIIM III III IIMIM Ml III III II! Illllil Ml MINI II 

g259 MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 
10 20 30 40 50 60 

70 80 90 100 110 120 

m259 .pep SARSKAKAE KFYREKMI QNES I HPVXHASLQHLEHKPQ I LALLVKNHGKGMAEQVRFKAE 

IMMIIIIIIIIIIIiniMMI I M 1 M 1 1 1 1 1 1 ^ 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 f i M 
g2 5 9 svrskakaekj^rekmiqnesihpvlhaslqhlehkpqmlai^^ 

70 80 90 100 110 120 

130 140 150 160 170 180 

m259 .pep VL PDDEDARTI AAELAKMDM F ALGTDAVAS G ET YGRV FAD I FE L SXALEGRAFKGMLKLT 

1 1 1 1 F 1 1 i 1 1 1 1 1 1 1 1 J I f f ! f 1 1 1 i 1 1 1 1 ) f 1 1 1 1 1 1 1 1 1 1 f f i III IMIIMMI 

g2 5 9 VLPDDEDARTIAAELAKMDMFALGTDAVASGETYGRVFADIFELSAALERRAFKGI LKLT 

130 140 150 160 170 180 

190 200 210 

m259 .pep AEYKKHLRRCL PFGNGVGVGRTQS GVAGDF KNI R 

Illlllllllilllllll IIMIM |:«:| 
g2 5 9 AEYKKHLRRCLPFGNGVGFGRAOSGVEGNLENAGKAX 
190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1033>: 

a259.seq (partial) 

1 ATGATGATGC ACGCTTCTGT CCAAAGCCGT TTCGCACCGA TACTTTATGT 

51 TTTGATTTTC TTTGCCGGTT TTTTGACCGC GCAAATCTGG TTCAATCAGA 

101 AAGCCTATAC TGAAGAGCTG CCTCCGCTTC TGTCCGCATT GTCCGCCGTC 

151 GCGCTGGTGT GGCTGGCGTG GGCGTTCGTG TCGGCGCGTT CAAAGGCTAA 

201 GGCGGAAAAG TTCTACCGCG AAAAAATGAT ACAGAACGAA AGCATACACC 

251 CCGTCCTGCA CGCTTCTTTG CAACACTTGG AACACAAGCC GCAAATGCTC 

301 GCCCTGCrGG TCAAAAACCA CGGCAAAGGG ATGGCGGAAC AGGTCAGGTT 

351 CAAGGCGGAA GTGCTGCCCG ACGACGAAGA CGCGCGCACG ATTGCCGCCG 

401 AGTTGGCAAA AATGGATATG TTTGCATTGG GGACGGACGC GGTCGCCTCG 

4 51 GGCGAAACCT ATGGACGCGT GTTCGCCGAT ATTTTCGAGT TGTCGGCGGC 

501 TTTGGAAGGG CGCGCGTTCA AAGGAATGTT GAAACTGACG GCGGAATATA 

551 AAAA.CATCT TCGGCGATGC CTGCCGTTCG GAAACGGCGT TGGAGTTGGG 

601 CGCGCTCAAT CAGGCGTTGC AGGAGATTTC AAAAACATCG GAAAAGTCCA 

651 A 

This corresponds to the amino acid sequence <SEQ ID 1034; ORF 259.a>: 

a259.pep (partial) 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYT EEL PP LLSALSAV 

51 ALVW LAW AFV SARSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART I AAELAKMDM FALGTDAVAS 

151 GETYGRVFAD IFELSAALEG RAFKGMLKLT AEYKXHLRRC LPFGNGVGVG 

201 RAQSGVAGDF KNIGKVQ 

m259/a259 98. 1% identity in 21 3 aa overlap 

10 20 30 40 50 60 

m259 . pep MMMHASVQSR FAP I LYVL I FFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 

I I I f 1 1 I I I I I I I f I I I I I I I I I I I I I I I t I I I I I I I I I 1 I I I I I I I 1 I I I I I t I I I I I | 
a259 MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m259 . pep SARSKAKAEKFYREKMIQNESIHPVXHASLQHLEHKPQILALLVKNHGKGMAEQVRFKAE 
I I I I I 1 I I I II I I I I I I I I I I I I I I I I I I I i I I I I I 1 : 1 I I I M I I I I t I I I I I I I I I 1 
a 2 5 9 SARSKAKAEKFYREKMI QNES I H P VLHAS LQHLE HKPQMLALLVKNHGKGMAEQVRFKAE 

70 80 90 100 110 120 

130 140 150 160 170 180 

rc25 9 . pep VLPDDEDARTIAAELAKM»1FALGTDAVASGETYGRVFADIFELSXALEGRAFKGMLKLT 
I ! I I I I ! II I 1 1 I I I I I I I I I I I I II I I I I I I I I II I I I I I I ! I I I I I I I I I I I I I I ! I 
a259 VLPDDEDARTIAAELAKMDMFALGTDAVASGETYGRVFADIFELSAALEGRAFKGMLKLT 

130 140 150 160 170 . 180 

190 200 210 

m259 . pep AEYKXHLRRCLPFGNGVGVGRTQSGVAGDFKNIR 

I I II I I I I I I I I I I I I I I I I I : i I I I I I I II I I 
a2 5 9 AEYKXHLRRCLPFGNGVGVGRAQSGVAGDFKNIGKVQ 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1035>: 

g259-l.seq 

1 ATGATGATGC ACGCTTCTGT CCAAAGTCGT TTCGCACCGA TACTTTATGT 

51 TTTGATTTTC TTTGCCGGTT TTTTGACCGC GCAAATCTGG TTCAATCAGA 

101 AAGCCTATAC TGAAGAGCTG CCTCCGCTTC TGTCCGCATT GTCCGCCGTC 

151 GCGCTGGTGT GGCTGGCGTG GGCGTTCGTG TCGGTGCGTT CAAAGGCTAA 

201 GGCAGAAAAG TTCTACCGCG AAAAAATGAT ACAGAACGAA AGCATACACC 

251 CCGTCCTGCA CGCTTCTTTG CAACACTTGG AACACAAGCC GCAAATGCTC 

301 GCCCTGCTGG TCAAAAACCA CGGCAAAGGC ATGGCGGAAC AGGTCAGGTT 

351 CAAGGCGGAA GTGCTGCCCG ACGACGAAGA CGCGCGCACG ATTGCCGCCG 

401 AGTTGGCAAA AATGGATATG TTCGCATTGG GGACGGACGC GGTCGCCTCG 

451 GGCGAAACCT ATGGGCGCGT GTTCGCCGAT ATTTTCGAGT TGTCGGCGGC 
501 TTTGGAA 

This corresponds to the amino acid sequence <SEQ ID 1036; ORF 259-1 .ng>: 

g259-l.pep 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYTEEL PPLLSALSAV 

51 ALVWLAWAFV SVRSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGT DAVAS 
151 GETYGRVFAD IFELSAALE 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1037>: 

m259-l.seq 

1 ATGATGATGC ACGCTTCTGT CCAAAGCCGT TTCGCACCGA TACTTTATGT 

51 TTTGATTTTC TTTGCCGGTT TTTTGACCGC GCAAATCTGG TTCAATCAGA 

101 AAGCCTATAC TGAAGAGCTG CCTCCGCTTC TGTCCGCATT GTCCGCCGTC 

151 GCGCTGGTGT GGCTGGCGTG GGCGTTCGTG TCGGCGCGTT CAAAGGCCAA 

201 GGCGGAAAAG TTCTACCGCG AAAAAATGAT ACAGAACGAA AGCATACACC 

251 CCGTCCTGCA CGCCTCTTTG CAACACTTGG AACACAAGCC GCAAATACTC 

301 GCCCTGCTGG TCAAAAACCA CGGCAAAGGG ATGGCGGAAC AGGTCAGGTT 

351 CAAGGCGGAA GTGCTGCCCG ACGACGAAGA CGCGCGCACG ATTGCCGCCG 

401 AGTTGGCAAA AATGGATATG TTCGCATTGG GGACGGACGC GGTCGCCTCG 

4 51 GGCGAAACCT ATGGACGCGT GTTCGCCGAT ATTTTCGAGT TGTCGGCGGC 

501 TTTGGAAGGG CGCGCGTTCA AAGGAATGTT GAAACTGACG GCGGAATATA 

551 AAAACATCTT CGGCGATGCC TGCCGTTCGG AAACGGCGTT GGAGTTGGGC 

601 GCACTCAATC AGGCGTTGCA GGAGATTTCA AAAACATCGG AAAAGTCCAA 
651 ACGGATATTT TATTGA 

This corresponds to the amino acid sequence <SEQ ID 1038; ORF 259-l>: 

m259-l .pop 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYTEEL PPLLSALSAV 

51 ALVWLAWAFV SARSKAKAEK FYREKMI QNE SIHPVLHASL QHLEHKPQIL 

101 ALLVKNHGKG MAEQVRFKAE VLPDDEDART IAAELAKMDM FALGT DAVAS 

151 GETYGRVFAD IFELSAALEG RAFKGMLKLT AEYKNIFGDA CRSETALELG 
201 ALNQALQEIS KTSEK5KRIF Y* 

g259-l/m259-l 98.8% identity in 169 aa overlap 
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10 20 30 40 50 SO 

g259-l.pep MMMHASVQSRFAPILYVLIFFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVVfLAWAFV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I 
m 259-l MMMHASVQSRFAPILYVLI FFAGFLTAQIWFNQKAYTEELPPLL SALS AVAL VW LA WAFV 

10 20 30 40 50 60 



70 80 90 100 110 120 

g259-l . pep SVRSKAKAEKFYREKMIQNESIHPVLHASLQHLEHKPQMLALLVKNHGKGMAEQVRFKAE 
I : 1 1 | I I II I II I I I I I I I I i I I I I I I I I I I I I I I I I I : I I! I I I I I I M I I I I I I I I I I 
m2S9-l SARSKAKAEKFYREKMIQMESIHPVLHASLQHLEHKPQILALLVKNHGKGMAEQVRFKAE 

70 80 90 100 110 120 



130 140 150 160 169 

g259-l.pep VLPDDEDARTI AAELAKMDMFALGTDAVASGET YGRVFADI FELSAALE 
Mil III I I I II II il Mllllll! Ill IIIMMIMI II Ml Hit: 

m259-l VLPDDEDARTI AAELAKMDMFALGTDAVASGETYGRVFAD I FELSAALEGRAFKGMLKLT 

130 140 150 160 170 180 



The following partial DNA sequence was identified in N: meningitidis <SEQ ID 1039>: 

a259-l . seq 

1 ATGATGATGC ACGCTTCTGT CCAAAGCCGT TTCGCACCGA TACTTTATGT 

51 TTTGATTTTC TTTGCCGGTT TTTTGACCGC GCAAATCTGG TTCAATCAGA 

101 AAGCCTATAC TGAAGAGCTG CCTCCGCTTC TGTCCGCATT GTCCGCCGTC 

151 GCGCTGGTGT GGCTGGCGTG GGCGTTCGTG TCGGCGCGTT CAAAGGCTAA 

201 GGCGGAAAAG TTCTAGCGCG AAAAAATGAT ACAGAACGAA AGCATACACC 

251 CCGTCCTGCA CGCTTCTTTG CAACACTTGG AACACAAGCC GCAAATGCTC 

301 GCCCTGCTGG TCAAAAACCA CGGCAAAGGG ATGGCGGAAC AGGTCAGGTT 

351 CAAGGCGGAA GTGCTGCCCG ACGACGAAGA CGCGCGCACG ATTGCCGCCG 

401 AGTTGGCAAA AATGGATATG TTTGCATTGG GGACGGACGC GGTCGCCTCG 

451 GGCGAAACCT ATGGACGCGT GTTCGCCGAT ATTTTCGAGT TGTCGGCGGC 

501 TTTGGAAGGG CGCGCGTTCA AAGGAATGTT GAAACTGACG GCGGAATATA 

551 AAAACATCTT CGGCGATGCC TGCCGTTCGG AAACGGCGTT GGAGTTGGGC 

601 GCGCTCAATC AGGCGTTGCA GGAGATTTCA AAAACATCGG AAAAGTCCAA 

651 ACGGATATTT TATTGA 



This corresponds to the amino acid sequence <SEQ ID 1040; ORF 259-1. a>: 

a259-l.pep 

1 MMMHASVQSR FAPILYVLIF FAGFLTAQIW FNQKAYTEEL PPLLSALSAV 
51 ALVWLAWAFV SARSKAKAEK FYREKMIQNE SIHPVLHASL QHLEHKPQML 
101 ALLVKNHGKG MAEQVRFKAE VLP DDE DART IAAELAKMDM FALGTDAVAS 
151 GETYGRVFAD I FELSAALEG RAFKGMLKLT AEYKNIFGDA CRSETALELG 
201 ALIJQALQEIS KTSEKSKRIF Y* 

a259-l/m259-l 99.5% identity in 221 aa overlap 



10 20 30 40 50 60 

a259- 1 . pep MMMHASVQSRFAPILYVL1 FFAGFLTAQIWFNQKAYTEELPPLLSALSAVALVWLAWAFV 
I I I I I I I I I I I I I I 11 I I I I 1 I I I I I I I II I I I I II I I I I I I I I I I II I I I I I I I I I I I I 
m259- 1 MMMHASVQSRFAPILYVLI FFAGFLTAQ I WFNQKAYTEELP PLLSALS A V ALVWLAWAFV 

10 20 30 40 50 60 

70 80 90 100 110 120 

a259- 1 . pep SARSKAKAEKFYREKMIQNESIHPVIjHASLQHLEUIKPQMIALLVKNHGKGMAEQVRFKAE 
I I I I I I I I I I I I I I II I I I I I II I I I I I I I I 1 II II I I: I I I II I I II II I I I i I I I I II 

m259- 1 sarskakaekfyrekmiqnesihpvlhaslqhlehkpqi lallvknhgkgmaeqvrfkas 

70 80 90 100 110 120 



130 140 150 160 170 180 

a25 9- 1 . pep VLPDDEDARTI AAELAKMDMFALGTDAVASGET YGRVFADI FELSAALEGRAFKGMLKLT 
II I I I I I I 1 I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 1 I I I I I I I I II I I I I I 
m259-l VLPDDEDARTI AAELAKMDMFALGTDAVASGET YGRVFADI FELSAALEGRAFKGMLKLT 

130 140 150 160 170 180 

190 200 210 220 

a259-l . pep AEYKNIFGDACRSETALELGALNOjALQEISKTSEKSKRI FYX 
I I I i i I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
IH259-1 AEYKNI FGDACRSETALELGALNQALQEISKTSEKSKRI FYX 

190 200 210 220 



The following partial DNA sequence was identified in AT. gonorrhoeae <SEQ ID 1041>: 

g260.seq 
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1 atgggtgcgg gtgtagtatt cgttgtcttt cagccgttct tcagcctgtt 

51 tcgagcgttg ttcgagggcg gagtcggtat agtcgaggga gcgcacgatg 

101 ccgctgaatg cgacttcttg tccgaggaat ttacccgtat ccggatcggt 

151 gatgttttta ttgattcggt aggtcagata acggcccggt tctttcaggc 

501 ctttggtgta aaccctggcg cctttggtgt acagcagcct gccttccggg 

251 cccgagagca ggcgcggcgc ggcagcggtt tctttgcggg aaacgatttg 

301 cgggtgctgc ataaagacgc ggtagaagtt gacatcgatg gcgggaatac 

3 51 cgtatccgga cacttcctta tccggactga ctttgacgac ggggatgccg 
401 tctgtctgtt ccaagccgag gcgcggttcg ccgccaacgt agcgcaacac 

4 51 caatacctgg cccggataaa tcaggtcggg attgtggatt tgatcccggt 
501 tcgcgcccca caggggggga ccattgccac gggctgtaca ggtatttgcc 
551 cgaaataccc cacagggtgt cgccctgttt ga 

This corresponds to the amino acid sequence <SEQ ID 1042; ORF 260.ng>: 
g260.pep 

1 MGA GWFWF QPFFSLFRAL F EGGVGIVEG AHDAAECDFL SEEFTRIRIG 
51 DVFIDSVGQI TARFFQAFGV NPGAFGVQQP AFRAREQARR GSGFFAGNDL 
101 RVLHKDAVEV DIDGGNTVSG HFLIRTDFDD GDAVCLFQAE ARFAANVAQH 
151 QYLARINQVG IVDLIFVRAP QGGTIATGCT GICPKYPTGC RFV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1043>: 

m260. seq 

1 ATGGGTGCGG GTATGGTATT CGTTGTCTTT CGGCCGTTCT CCAGCCTGTT 

51 TCGAGCGTTG TTCGAGGACA GAGTCGGTAT AGTCGAGGGA GCGCACGATG 

101 CCGCTGAATG CGACTTCCTG CCCGAGGAAT TTACCCGTAT CCGGATCGGT 

151 GATGTTTTTA TTGATTCGGT AGG TCAGGTA GCGGCCCGGC TCTTTCAGGC 

201 CTTTGGTGTA AACCCTGGTG CCTTTGGTGT ACAGCAGCCT GCCTTCCGGG 

251 CCCGAGwrCA sGCGCGGyGC GGCAGCGGTT TCTTTGCGGG AAACGATTTG 

301 CGGATGCCGC ATAAAGATGC GGTAGAAGTT GACATCGATG GCGGGAATAC 

351 CGTATCCGGA CACTTCCTTA TCCGGACTCA TTTTGACGAC GGGGATGCCG 

401 TCTGTCTGTT CCAAGCCGAG GCGCGGTTCG CCGTCAACGT GGCGCAACAC 

451 CAATACCTGG TCCGGATAAA TCAGGTCGGG ATTGTGGATT TGATCCCGGT 

501 TCGCGTyCCA CAG 

This corresponds to the amino acid sequence <SEQ ID 1044; ORF 260>: 

m260.pep 

1 MGAG MVFWF RPFSSLFRAL F EDRVGIVEG AHDAAECDFL PEEFTRIRIG 

51 DVFIDSVGQV AARLFQAFGV NPGAFGVQQP AFRARXXARX GSGFFAGNDL 

101 RMPHKDAVEV DIDGGNTVSG HFLIRTHFDD GDAVCLFQAE ARFAVNVAQH 

151 QYLVRINQVG IVDLIPVRVP Q 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 260 shows 89.5% identity over a 171 aa overlap with a predicted ORF (ORF 260.ng) 
from N. gonorrhoeae: 

m260/g260 



10 20 30 40 50 60 

m260 . pep MGAGMVFWFRPFSSLFRALFEDRVGI VEGAHDAAECDFLPEEFTRI RIGDVFI DSVGQV 

III hill II: II llllllll llllllllllllllll I III III! I I II III I Ih 
g260 MGAGWFWFQPFFSLFRALFEGGVGIVEGAHDAAECDFLS5EFTRIRIGDVFIDSVGQI 
10 20 30 40 50 60 



70 80 90 100 110 120 

m260 . pep AARLFQAFGWPGAIXJVQQPAFRARXXARXGSGFFAGNDIiRMPHKDAVEVDIDGGNTVSG 

:|hlllllll!IIIIIIIIIIMI II llllllllllh lilllllllllilllll 
g260 TARFFQAFGVNPGAFGVQQPAFRAREQARRGSGFFAGNDLRVLHKDAVEVDIDGGNTVSG 
70 80 90 100 110 120 



130 140 150 160 170 

m260 . pep HFL I RTH FDDGDAVCL FQAEAR FAVNVAQHQ YLVRI NQVG I VDL I PVRVPQ 

llllll llllllllllllllllhlllllllhlllllllllllllhll 

g 2 6 0 HFLI RTDFDDGDAVCLFOAEARFAANVAQHOYLARINQVGITOLI PVRAPQGGTI ATGCT 

130 140 150 160 170 180 
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g260 GICPKYPTGCRPV 
190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1045>: 
a260.seq 

1 ATGGGTGCGG GTATGGTATT CGTTGTCTTT CGGCCGTTCT CCAGCCTGTT 
51 TCGAGCGTTG TTCGAGGACA GAGTCGGTAT AGTCGAGGGA GCGCACGATG 
101 CCGCTGAATG CGACTTCCTG CCCGAGGAAT TTACCCGTAT CCGGATCGGT 
151 GATGTTTTTA TTGATTCGGT AGGTCAGGTA GCGGCCCGGC TCTTTCAGGC 
201 CTTTGGTGTA AACCCTGGTG CCTTTGGTGT ACAGCAGCCT GCCTTCCGGG 
251 CCCGAGAGCA GGCGCGGCGC GGCAGCGGTT TCTTTGCGGG AAACGATTTG 
301 CGGGTGCCGC ATAAAGATGC GGTAGAAGTT GACATCGATG GCGGGAATAC 
351 CGTATCCGGA CACTTCCTTA TCCGGACTCA TTTTGACGAC GGGGATGCCG 
4 01 TCTGTCTGTT CCAAGCCGAG GCGCGGTTCG CCGTCAACGT GGCGCAACAC 
4 51 CAATACCTGG TCCAGATAAA TCAGGTCGGG ATTGTGGATT TGATCCCGGT 
501 TCGCGTCCCA CAGGCGGCC. GCATTGCCAC GGGCTGTACA GGTATTTGCC 
551 CGAAATGCCC CACAGGGTGT CGCCCTGTTT GA 

This corresponds to the amino acid sequence <SEQ ID 1046; ORF 260.a>: 

a260.pep 

1 MGAGMVFVVF RPFSSLFRAL FEDRVGIVEG AHDAAECDFL PEEFTRIRIG 

51 DVFIDSVGQV AARLFQAFGV NPGAFGVQQP AFRARECARR GSGFFAGNDL 

131 RVPHKDAVEV DI DGGNTVSG HFLIRTHFDD GDAVCLFQAE ARFAVNVAQH 

151 QYLVQINQVG IVDLIPVRVP QAAXIATGCT GICPKCPTGC RPV* 

m260/a260 97. 1 % identity in 1 7 1 aa overlap 

10 20 30 40 50 . 60 

MGAGMVFWFRPFSSLFRALFEDRVGIVEGAHDAAECDFLPEEFTRIRIGDVFIDSVGQV 

1 1 1 1 I I i I i I i t t I I 1 1 1 1 I I I 1 1 I J I f 1 1 t 1 1 I I I I I I I I I I I I 1 1 I I I I I t I I f I I I I 

MGAGMVFWFRPFSSLFRALFEDRVGIVEGAHDAAECDFLPEEFTRIRIGDVFIDSVGQV 
10 2C 30 40 50 60 

70 8C 90 100 110 120 

AARLFQAFGVNPGAFGVQQPAFRARXXARXGSGFFAGNDLRMPHKDAVEVDI DGGNTVSG 
I I I I I I M I I I t M I I I I I I I I I I I II I II I F I I I M I : I II M I I I I I I I I I I I I f 
AARLFQAFGVNPGAFGVQQPAFRAREQARRGSGFFAGNDLRVPHKDAVEVDI DGGNTVSG 
70 80 90 100 110 120 

130 140 150 160 170 

H FL I RT H FD DGDAVCL FQAE AR FAVN VAQHQY L VR I NQVG I V D L I P VRV PQ 
I I I I I I I I I I I I I II I I M ! I 1 I I I II I I I I I I I: M I i I I I I I I I I I I II 
H FL I RT H FD DGDAVCL FQAE AR FAVN VAQIIQ YL VQ I NQVG I V DL I P VRV PQAAX I ATG CT 
130 140 150 160 170 160 

GICPKCPTGCRPVX 
190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1047>: 

g261 . seq 

1 atggagcttg ggcatatcgt attccttgtg ctttgcgcgc cttcagacgg 

51 cctttttact ttccagacat cccgccagcc cgcgttcgcg caagatacag 

101 ctcgggcatt cgcggcagcc gccgacgata cccttgtagc aggtgtgggt 

151 ctgttcgcgg atgtagtcca acacgcccat ttcgtccgcc aacgcccacg 

201 tttgcgcctt ggtcaggtac atcagcggcg tgtggatttg aaaatcgtag 

251 tccatcgcca gattaagggt aacgttcatg gatttgacga acacgccgcg 

3 01 gcagtcggga tagcccgaaa aatcggtttc gcacacgccc ccgatgatgt 

3 51 gccggatacc ctgccctttg gcaaaaatgg cggcgtaaag caggaaaagc 

4 01 gcgttacgcc cgtccacaaa ggtattggga acgccgttgt cggcggtttc 

4 51 gatggcggcg gtttcgatgg cggcggtttc gtccatcagg gcgttgtgcg 
501 taatctgccg catcaggctc aaatcgagta cggtttgact gacacccaaa 

5 51 tcctgcgcga tccactctgc gcgttccagc tcgacggcat ggcgttgccc 
601 gtatcggaag gtgatggctt ggacgttttc gcgcccgtag gtttggattg 



m2 60 . pep 
a2 60 

m260.pep 
a2 60 

m260.pep 
a260 

a260 
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651 cctgaatcag gcaggtcgtc gaatcctgac cgcccgagaa gatgaccaag 
701 gctttttggt ttga 

This corresponds to the amino acid sequence <SEQ ID 1048; ORF 261. ng>: 

g261 -pep 

1 MELGHIVFLV LCARSDGLFT FQTFRQPAFA QDTARAFAAA ADDTLVAGVG 

51 L FAD WO HAH FVRQRPRLRL GQVHQRRVDL KIWHRQIKG NVHGFDEHAA 

101 AVGIARKIGF AHARDDV PDT LPFGKNGGVK QEKRVTPVHK GIGNAWGGF 

151 DGGGFDGGGF VHQGWRNLP HQAQIEYGLT DTQILRDPLC AFQLDGMALP 

201 VSEGDGLDVF APVGLDCLNQ AGGRI LTARE DDQGFLV* 

The following partial DNA sequence was identified in A/, meningitidis <SEQ ID 1049>: 



m261 . seq 




1 


ATGGAGCTTG 


51 


CCTTTTTACT 


101 


CTCGGGCATT 


151 


TTGCTCGCGG 


201 


TTTGCGCCTT 


251 


TCCATCGCCA 


301 


GCAGTCGGGA 


351 


GCCGTATCCC 


401 


gCGTTGCGGC 


451 


GATGGCGGCG 


501 


GgCTcAAGTC 


551 


CGGGCACGTT 


601 


GGCTTGGACG 


651 


TGGTCGAATC 



This corresponds to the amino acid sequence <SEQ ID 1050; ORF 26 1>: 

m261 .pep 

1 MELGHIVFLM VCACSDGLFT FQIFRQPAFA QDTARAFAAA ADDAVIAGVG 

51 LLADIVQHAH FVRQRPRLRL GQIHQRRVDL KIIVHRQIKG NVHRFDKHVA 

101 AVGIAGEVGF AHARDDVPYP LPFGVNRGIE QEKRVAAVYK GIRNAVFGSF 

151 DGGGWHQGI VRNLPHQAQV EYGLFDAQIL RNPAGTFQLD GMALPVLESN 

201 GLDVFAPVGL 0CLNQAGGRI LTARKDDQGL LV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORP from N. gonorrhoeae 

ORF 261 shows 79.7% identity over a 237 aa overlap with a predicted ORF (ORF 261. ng) 
from AT. gonorrhoeae: 

m26l/g261 

10 20 30 40 50 60 

m2 6 1 . pep MELGH I VFLMVCACSDGLFTFQI FRQPAFAQDTARAFAAAADDAVI AGVGLLAD I VQHAH 

lllllllll-M lllll I II II III III II III III I I h-l II I hi hill I I 
g 2 6 1 MELGH I VFLVLCARSDGLFTFQTFRQPAFAQDTARA FAAAADDTLVAGVGLFADWQHAH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m261 .pep FVRQRPRLRLGQI HQRRVDLKI I VHRQI KGNVHRFDKHVAAVGI AGEVGFAHARDDVPYP 

IIIIIMIIIIhlllllMlhllllllllll Ihhlhlll = E 1 1 1 ] 1 1 1 1 1 

g2 6 1 FVRQRPRLRLGQVHQRRVDLKIWHRQI KGNVHG FDEHAAAVG I ARK I GF AHARDDV PDT 

70 80 90 100 110 120 

130 140 150 ISO 170 

m261 . pep L P FGVNRG I EQEKRVAAVY KG I RN AVFGS FDGGGV VHQGIVRNLPHQAQVEYGLF 

1 1 II I h • 1 1 1 1 h hill III hlllll lllhllllllllhllll 

g261 LP FGKNGGVKQEKRVTPVHKGIGNAWGGFDGGG FDGGGFVHQGWRNLPHQAQ I E YGLT 

130 140 150 160 170 180 

180 190 200 210 220 230 

1X1261 . pep DAQI LRNPAGT FQLDGMAL PVLES NGLDVF APVG LDCLNQAGGR I LT ARKDDQG LLVX 

hlllhl =111111111 l-l lllllll III II I I I 1 t I I 1 I I = I I I 1 ^ I 1 I 
g 2 6 1 DTQI LRDPLCAFQLDGMALPVSEGDGLDVFAPVGLDCLNQAGGR I LTAREDDQGFLVX 

190 200 210 220 230 
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The following partial DNA sequence was identified in jY. meningitidis <SEQ ED 1051>; 

a261. seq 

1 ATGGAGCTTG GGC AT ATCGT ATTCCTTATG GTTTGCGCGT GTTCAGACGG 

51 CCTTTTTACT TTCCAGATAT TCCGCCAGCC CGCGTTCGCG CAAGATACAG 

101 CTCGGGCATT CGCGGCAGCC GCCGACGATG CCGTTATAGC AGGTGTGGGT 

151 TTGCTCGCGG ATATAGTCCA GCGCGCCCAT TTCGTCCGCC AACGCCCAAG 

201 TTTGCGCCTT GGTCAGATAC ATCAGCGGCG TGTGGATTTG AAAATCATAG 

251 TCCATCGCCA GATTAAGGGT AACGTTCATG GATTTGACAA ACACGTCACG 

301 GCAGTCGGGA TAGCCGGAGA AGTCGGTTTC GCACACGCCC GCGATGATGT 

351 GCCGTATCCC CTGCCCTTTG GCGTAAATCG CGGCATAGAG CAGGAAAAGC 

4 01 GCGTTGCGGC CGTCTACAAA 3GTATTCGGA ACGCCGTTTT CGGCAGTTTC 

4 51 GATGGCGGCG GTGTCGTCCA TCAGGGCATT GTGCGTAATC TGCCGCATCA 

501 GGCTCAAGTC GAGTACGGTT TGTTTGACGC CCAAATCCTG CGCAATCCAG 

551 CGGGCACGTT CCAGCTCGAC GGCATGGCGT TGCCCGTATT GGAAAGTAAT 

601 GGCTTGGACG TTTTCGCGCC .CGTAGGTTTG GATTGCCTGA ATCAGGCAGG 

651 TGGTCGAATC CTGACCGCCC GAAAAGATGA CCAAGGCTTT TTGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 1052; ORF 261 .a>: 

a2 61 . pep 

1 MELGHIVFLM VCACSDGLFT FQIFRQPAFA QDTAR AFAAA ADDAVIAGVG 

51 LLADIVQRAH FVRQRPSLRL GQIHQRRVDL KIIVHRQIKG NVHGFDKHVT 

101 AVGIAGEVGF AHARDDVPYP LPFGVNRGIE QEKRVAAVYK GIRNAVFGSF 

151 DGGGVVHQGI VRNLPHQAQV EYGLFDAQIL RNPAGTFQLD GMALPVLESN 

201 GLDVFAPVGL DCLNQAGGRI LTARKDDQGF LV* 

m261/a26 1 97.8% identity in 232 aa overlap 

10 20 30 40 50 60 

m261.pep MELGHIVFI^CACSDGLFTFQIFRQPAFAQDTARAFAAAADDAVIAGVGLLADIVQHAH 
t I I I I I I I I I If I I I I I I I I I I I I I I I I I I I I I I I I . I I I II I I I I I I II ! I I I I I I : I I 
a2 61 MELGHIVFLMVCACSDGLFTFQIFRQPAFAQDTARAFAAAADDAVIAGVGLLADIVQRAH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 61 . pep FVRORPRLRLGOIH0RRVDLKI I VHROI KGN VHRFDKH V AAV G I AGE VGFAHARDDVP Y P 

I I I I I I I I II I I I I I I I I I I I I I II I I I i I I I I If I I : I I I I t I 1 I I I I I I I I || I I I 
a2 61 FVRQRPSLRLGQIHQRRVDLKI I VHRQI KGNVHG FDKHVT AV G I AGE VGFAHARDDVP Y P 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 61 . pep LPFGVNRGIEQEKRVAAVYKGIRNAVFGS FDGGGWHQGIVRNLPHQAQVEYGLFDAQIL 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I | I I I | | | | 
a2 61 LPFGVNRGIEQEKRVAAVYKGIRNAVFGSFDGGGWHQGIVRNLPHQAQVEYGLFDAQIL 

130 140 150 160 170 130 

190 200 210 220 230 

m2 61 . pep RN PAG T FQL DGM AL P VLE S N GL D VFA PVG LD C LN Q AGG R I LT ARK D DQG LL VX 

I I I I I I I I I I I 1 I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I : I I I 
a2 61 RNPAGTFQLDGMALPVLESNGLDVFAPVGLDCLNQAGGRILTARKDDQGFLVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1053>: 

g263 . seq 

1 atggcacgtt taaccgtaca caccctcgaa accgcccccg aagccgccaa 

51 accgcgcgta gaggccgtac ccaaaaacaa cggctttatc cccaacctca 

101 tcggcgtatt ggcaaacgcc cccgaagctt tggcgtttta ccaagaagtc 

151 ggcaagctca acgccgccaa cagcctgacc gccggcgaag ccgaagtgat 

201 ccggatcatc gccgtccgca ccaaccaatg cagcttctgc gtggcagggc 

251 acaccaaact cgcaaccctg aaaaaactcc tgtccgagca atccctcaat 

301 gccgcccgcg ctttggcggc aggtaaatct gacgatgcca aactcggcgc 

351 gcttgccgcc ttcacccaag ccgtaatggc gaaaaaaggc gcagtatccg 

401 acgacgaact caacgccttc ctcgaagcgg gctacaaccg gcagcaggca 
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4 51 gtcgaagtcg taatgggcgt agccttggca actttgtgca actacgccaa 
501 caacctcgcc eaaaccgaaa Lcaaccccaa attgcaggca tacgcctaa 

This corresponds to the amino acid sequence <SEQ ID 1054; ORF 263.ng>: 

g263 .pep 

1 MARLTVHTLE TAPEAAKPRV EAVPKNNGFI PNLIGVLANA PEALAFYQEV 

51 GKLNAANSLT AGEVEVIRII AVRTNQCSFC VAGHTKLATL KKLLSEQSLN 

101 AARALAAGKS DDAKLGALAA FTOAVMAKKG AVSDDELNAF LEAGYNRQQA 

151 VEWMGVALA TLCNYANNLA QTEINPKL QA YA + 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1055>: 

m263 . seq (partial) 

1 . . GCAGCAGGCG AATTTGACGA TGCCAAACTC GGCGCGCTCG CCGCCTTCAC 

51 CCAAGCCGTA ATGGCGAAAA AAGGCGCGGT ATCCGACGAG GAACTCAAAG 

101 CATTTTTCGA TGCGGGCTAC AACCAGCAGC AGGCAGTCGA AGTCGTGATG 

151 GGCGT.AsyC TgGCAACCCT GTGCAACTAC GTCAACAACC TCGGACAAAC 

201 CGAAATCAAC CCCGAATTGC AGGCTTACGC CTGA 

This corresponds to the amino acid sequence <SEQ ID 1056; ORF 263>: 

m263.pep (partial) 

1 . . GCAGCAGGCG AATTTGACGA TGCCAAACTC GGCGCGCTCG CCGCCTTCAC 

51 CCAAGCCGTA ATGGCGAAAA AAGGCGCGGT ATCCGACGAG GAACTCAAAG 

101 CATTTTTCGA TGCGGGCTAC AACCAGCAGC AGGCAGTCGA AGTCGTGATG 

151 GGCGT.AsyC TgGCAACCCT GTGCAACTAC GTCAACAACC TCGGACAAAC 

201 CGAAATCAAC CCCGAATTGC AGGCTTACGC CTGA 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 263 shows 85.7% identity over a 77 aa overlap with a predicted ORF (ORF 263 Jig) 
from N. gonorrhoeae: 

m263/g263 

10 20 30 

m2 6 3 . pep AAGEFDDAKLGALAAFTQAVMAKKGAVSDE 

llh NIIMIIIIIIIIIIIIIIIMI: 
g2 63 Q CS FCVAGHT KLAT LKKLLS EQS LN AARALAAGKS DDAKLGALAAFTQAVMAKKGAVSDD 

B0 90 100 110 120 130 



40 50 60 70 

m2 6 3 . pep ELKAFFDAGYNC^OAVEVVMGVXIATLOnrVNNLGQTEINPELQAYAX 

Ihll-Mlhlll III llll llllilhllhll MINIMI 

g2 6 3 E LNAFLEAG YNRQQAVE WMGV ALATLCN YANN LAQTE I NPKLQAYAX 

140 150 160 170 180 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1057>: 

a263. seq 

1 ATGGCACGTT TAACCGTACA CACCCTCGAA ACCGCCCCCG AAGCCGCCAA 

51 AGCGCGCGTC GAGGCGGTAC TTCAAAACAA CGGCTTTATC CCCAACCTTA 

101 TCGGCGTATT ATCAAACGCC CCCGAAGCCT TGGCGTTTTA CCAAGAAGTC 

151 GGCAAGCTCA ACGCCGCCAA CAGCCTGACC GCCGGCGAAG TCGAAGTAAT 

201 CCAGATTATT GCCGCCCGCA CCAACCAATG CGGCTTCTGC GTGGCAGGGC 

251 ACACCAAACT CGCAACCCTG AAAAAACTCC TTTCCGAACA ATCCGTCAAA 

301 GCCGCGCGCG CTTTGGCGGC AGGCGAATTT GACGAT GCT A AACTCGGCGC 

351 GCTCGCCGCC TTTACCCAAG CCGTAATGGC AAAAAAAGGC GCGGTATCCG 

4 01 ACGAGGAACT CAAAGCATTT TTTGATGCGG GCTACAACCA GCAGCAGGCA 

4 51 GTCGAAGTCG TGATGGGCGT AGCCTTGGCA ACTTTGTGCA ACTACGTCAA 

501 CAACCTCGGA CAAACCGAAA TCAACCCCGA ATTGCAGGCT TACGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 1058; ORF 263. a>: 

a2 63.pep 

1 MARLTVHTLE TAPEAAKARV EAVLQNNGFI PNLIGV^SNA PEALAFYQEV 

51 GKLNAANSLT AGEVEVIQII AARTNQCGFC VAGHTKLATL KKLLSEQSVK 

101 AARALAAGEf DDAKLGALAA FTQAVMAKKG AVSDEELKAF FDAGYNQQQA 

151 VEWMGVALA TLCNYVl ^NLG QTEINPELQA YA* 
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m263/a263 97.4% identity in 77 aa overlap 

10 20 30 

m2 63 . pep AAGEFDCAKLGALAAFTQAVMAKKGAVSDE 

I I I I I I I I I I I I 1 i I I i I I I I I I I I I I I I I 
a2 63 QCGFCVAGHTKLATLKKLLSEQSVI<AARALAAGEFDDAKLGALAAFTQAVMAKKGAVSDE 
80 90 100 110 120 130 



40 50 60 70 

m263.pep ELKAFFDAGYNQQQAVEWKGXXLATLCNYVNNLGOTEINPELQAYAX 
I I I I I I I I I I II 1 I I I ! II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 
a2 63 ELKAFFDAGYNQQQAVEWMGVALATLCNYVNNLGQTEINPELQAYAX 
140 150 160 170 180 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1059>: 

g264 . seq 

1 ttgactttaa cccgaaaaac ccttttcctc ctcaccgccg cgttcggcac 

51 acactccctt cagacggcat ccgccgacgc agtggtcaag ccggaaaaac 

101 tgcacgcctc cgccaaccgc agctacaaag tcgccgaatt cacgcaaacc 

151 ggcaacgcct cgtggtacgg cggcaggttt cacgggcgca aaacttccgg 

2 01 cggagaccgc tacgatatga acgcctttac cgccgcccac aaaaccctgc 

251 ccatccccag ccatgtgcgc gtaaccaaca ccaaaaacgg caaaagcgtc 

301 atcgtccgcg tcaacgaccg cggccccttc cacggcaacc ccatcatcga 

351 cgtatccaaa gccgccgcgc aaaaattggg ctttgtcagc caagggacgg 

4 01 cacacgtcaa aatcgaacaa atcgtcccgg gccaatccgc accggttgcc 
451 gaaaacaaag acatctttat cgacttgaaa tctttcggta cggaacacga 

5 01 agcacaagcc tatctgaacc aagccgccca aaatttcgcc gcttcgtcat 
551 caagcccgaa cctctcggtt gaaaaacgcc gttacgaata cgttgtcaaa 
601 atgggcccgt ttgcctcgca ggaacgcgcc gccgaagccg aagcgcaggc 
651 acgcggtatg gttcgggcgg tactgacctc cggttga 

This corresponds to the amino acid sequence <SEQ ID 1060; ORF 264.ng>: 

g264 .pep 

1 LTLTRKTLFL LTAAFGTHSL QTASA DAWK PEKLHASANR SYKVAEFTQT 

51 GNASWYGGRF HGRKTSGGDR YDMNAFTAAH KTLPIPSHVR VTNTKNGKSV 

101 IVRVNDRGPF HGNRIIDVSK AAAQKLGFVS QGTAHVKIEQ IVPGQSAPVA 

151 ENKDIFIDLK SFGTEHEAQA YLNQAAQNFA ASSSSPNLSV EKRRYEYWK 

201 MGPFASQERA AEAEAQARGM VRAVLTSG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1061>: 
m264 .seq 

1 TTGACTTTAA CCCGAAAAAC CCTTTTCCTT CTCACCGCCG CATTCGGCAC 

51 ACACTCCCTT CAGACGGCAT CCGCCGACGC AGTGGTCAAG GCAGAAAAAC 

101 TGCACGCCTC CGCCAACCGC AGCTACAAAG TCGCCGGAAA ACGCTACACG 

151 CCGAAAAACC AAGTCGCCGA ATTCACGCAA ACCGGCAACG CCTCGTGGTA 

201 CGGCGGCAGG TTTCACGGGC GCAAAACTTC CGGCGGAGAA CGATACGATA 

251 TGAACGCCTT TACCGCCGCC CACAAAACCC TGCCCATCCC CAGCTATGTG 

301 CG CGTAACC A ATACCAAAAA CGGCAAAAGC GTCATCGTCC GCGTCAACGA 

351 CCGCGGCCCC TTCCACGGCA AC CGCATCAT CGACGTATCC AAAGCCGCCG 

401 CGCAAAAATT GGGCTTTGTC AACCAAGGGA CGGCACACGT CAAAATCGAA 

451 CAAATCGTCC CGGGCCAATC CGCACCGGTT GCCGAAAACA AAGACATCTT 

501 TATCGACTTG AAATCTTTCG GTACGGAACA CGAAGCACAA GCCTATCTGA 

551 ACCAAGCCGC CCAAAACTTC GCCGTTTCGT CATCGGGTAC GAACCTCTCG 

601 GTTGAAAAAC GCCGTTACGA ATACGTCGTC AAAATGGGAC CGTTTACCTC 

651 GCAGGAACGC GCCGCCGAAG CCGAAGCTCA GGCGCGCGGT ATGGTTCGGG 

701 CGGTATTGAC CGCCGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 1062; ORF 264>: 

m264 .pep 

1 LTLTRKTLFL LTAAFGTHSL QTASA DAWK AEKLHASANR SYKVAGKRYT 

51 PKNQVAEFTQ TGNASWYGGR FHGRKTSGGE RYDMNAFTAA HKTLPIPSYV 

101 RVTNTKNGKS VIVRVNDRGP FHGNRIIDVS KAAA0KL3FV NQGTAHVKIE 

151 QIVPGQSAPV AENKDIFIDL KSFGTBHEAQ AYLNQAAQNF AVSSSGTNLS 

201 VEKRRYEYW KMGPFTSQER AAEAEAQARG MVRAVLTAG* 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 264 shows 91 .6% identity over a 239 aa overlap with a predicted ORF (ORF 264 .ng) 
from N. gonorrhoeae: 

m264/g264 

10 20 30 40 50 60 

LTLTRKTLFLLTAAFGTHSLQTASADAVVKAEKLHASANRSYKVAGKRYTPKNQVAEFTQ 
IIIIIIIIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIII | | | | 

LTLTRKTLFLLTAAFGTHSLQTASADAWKPEKLHASANRSYKVA EFTQ 

10 20 30 40 



m264 .pep 
g264 



70 80 90 100 110 120 

m2 64 . pep TGNASWYGGRFHGRKTSGGERYDMNAFTAAHKTLPI P S YVRVTNT KNG KS V I V R VNDRG P 

I I I I I I I I I I I I I I t I I I h I I I I I I I I I I I I I I I I I I = I I I I I I I I I | I I I | | | | | | | | 
g2 64 TGN ASW YGGR FHGR KTS GGD R YDMNAFT AAHKTLP I P SHVRVTNT KNG KS V I VRVNDRGP 

50 60 70 80 90 100 



130 140 150 160 170 180 

m264 . pep FHGNRI IDVSKAAAQKLGFVNQGTAHVK1 EQI VPGQSAPVAENKDIFIDLKSFGTEHEAQ 

I lllllll lllllllllll|:||l Mil III III III llllllllll I MNIIIIIIM 

g2 6 4 FHGNRI I DVS KAAAQKLGFVSQGTAHVKI EQI VPGQS APVAENKD I FI DLKSFGTEHEAQ 

110 120 130 140 150 160 



190 200 210 220 230 240 

m2 64 - pep AYLNQAAQN FAVSS S GTNLS VE KRR YEYWKMGP FT S QERAAEAE AQARGMVRAVLTAGX 

1 1 1 1 1 1 ! 1 1 h 1 1 I •• MINI Mill II! II: hi llllll II II I II i INII Ml 

g2 64 AYLNQAAQNFAASSSSPNLSVEKRRYEYWKMGPFASQERAAEAEAOARGMVRAVLTSGX 
170 180 190 200 210 220 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1063>: 

a264 . seq 

1 TTGACTTTAA CCCGAAAAAC CCTTTTCCTC CTCACCGCCG CATTCGGCAT 

51 ACATTCCTTT CAGACGGCAT CCGCCGACGC AGTGGTCAGG GCAGAAAAAC 

101 TGCACGCCTC CGCCAACCGC AGCTACAAAG TCGCCGGAAA ACGCTACACG 

151 CCGAAAAACC AAGTCGCCGA ATTCACGCAA ACCGGCAACG CCTCGTGGTA 

201 CGGCGGCAGG TTTCACGGGC GCAAAACTTC CGGCGGAGAA CGATACGATA 

251 TGAACGCCTT TACCGCCGCC CACAAAACCC TGCCCATCCC CAGCTATGTG 

301 CGCGTAACCA AT AC C AAAAA CGGCAAAAGC GTCATCGTCC GCGTCAACGA 

351 CCGCGGCCCC TTCCACGGCA ACCGCATCAT CGACGTATCC AAAGCCGCCG 

401 CGCAAAAATT GGGCTTTGTC AACCAAGGGA CGGCGCACGT CAAAATCGAA 

451 CAAATCGTCC CGGGCCAATC CGCACCGGTT GCCGAAAACA AAGACATCTT 

501 CATCGACTTG AAATCTTTCG GTACGGAACA CGAAGCACAA GCCTATCTGA 

551 ACCAAGCCGC CCAAAACCTG GCTTCATCGG CATCAAACCC GAACCTCTCG 

601 GTTGAAAAAC GCCGTTACGA ATACGTCGTC AAAATGGGAC CGTTTGCCTC 

651 GCAGGAACGC GCCGCCGAGG CCGAAGCTCA GGCGCGCGGT ATGGTTCGGG 

701 CGGTATTAAC CGCCGGTTGA 



This corresponds to the amino acid sequence <SEQ ID 1064; ORF 264.a>: 

a264 .pep 

1 LTLTRKTLFL LTAAFGIHSF QTAS ADAWR AEKLHASANR SYKVAGKRYT 

51 PKNQVAEFTQ TGNASWYGGR ?HGRKTSGGE RYDMNAFTAA HKTLPIPSYV 

101 RVTNTKNGKS VIVRVNDRGP FHGNRI I DVS KAAAQKLGFV NQGTAHVKIE 

151 QIVPGQSAPV AENKDIFIDL KSFGTEHEAQ AYLNQAAQN L ASSASNPNLS 

201 VEKRRYEYW KMGPFASQER AAEAEAQARG MVRAVLTAG* 

m264/a264 96.2% identity in 239 aa overlap 

10 20 30 40 50 50 

m2 64 . pep LTLTRKTLFLLTAAFGTHSLQTASADAWKAEKLHASANRS YKVAGKRYT PKNQVAEFTQ 

I I I I I I I I f I I I t I I I I I : I I M II I I I : t I I I I I I || || | | | t | | | | | | | | | | | | | || 
a2 64 LTLT RKT L FLLT AA FG I H S FQT AS A D AV VRAE KL HAS AN R S Y K V AGKR YT PKNQ V AE FTQ 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m264 . pep TGNASWYGGRFHGRKTSGGERYDMNAFTAAHKTLPIPSYVRVTNTKNGKSVIVRVNDRGP 

I I I I M i I I I 1 1 1 1 1 I 1 1 I I ! 1 1 II I 1 1 I 1 1 1 1 1 I 1 1 I I I I I I I I 1 1 1 I I I 1 1 1 I I i I I I 
a264 TGNASWYGGRFHGRKTSGGERYDMNAFTAAHKTLPIPSYVRVTNTKNGKSVIVRVNDRGP 
70 80 90 100 110 120 

130 140 150 160 170 180 

m264 . pep FHGNRI I DVSKAAAQKLGtVNQGTAHVKIEQl V PG^SAPVAENKDI FI DLKSFGTEHEAQ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
a264 FHGNRI I DVSKAAAQKLGFVNQGTAHVKIEQ I VPGQSAPVAENKDIFI DLKSFGTEHEAQ 

130 .140 1.50 160 170 180 

190 200 210 220 230 240 

m2 64 . pep AYLNQAAQNFAVSSSGTNLSVEKRRYEYVVKMGPFTSQERAAEAEAQARGMVRAVLTAGX 
I i I I I I I I I : I 1:1: I II I I I I II I I I I I ! I I i : I I I I I I I I I I I I I I I I I I I I ! I I I 
a264 AYLNQAAQNLASSASNPNLSVEKRRYEYWKMGPFASQERAAEAEAQARGMVRAVLTAGX 
190 200 210 220 230 240 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1065>: 



m265 . seq 












1 


ATGTCGGTGA 


TTTTGCCGCC 


GACACGCGCC 


AACGCTGCTT 


TTTCGGCTTG 


51 


GGCGCGGCTG 


ATGATTTTGT 


CTTGTTTGTT 


GTGTTGGTGT 


GCGGCGTGTC 


101 


CGTGGTCGTC 


ATCGCCGTGT 


CCGTCGTGGT 


GGGCGAGCGC 


GGGGGCGGAA 


151 


ATGCTCAGCA 


GTGCGGTTGC 


GGCGGAGGTC 


AAGAGAAGGT 


3TTTGATGTT 


201 


CATAT . TTTT 


GCCTTTGTAA 


ATCGTGGGTT 


GGAAAATGTG 


3ATATTAATA 


251 


AGGTATCAAA 


TAACCGTCAG 


CCGGCGGTCA 


ATACCGCCCG 


AACCATACCG 


301 


CGCGCCTGAG 


CTTCGGCTTC 


GGCGGCGCGT 


TCCTGCGAGG 


TAAACGGTCC 


351 


CATTTTGACG 


ACGTATTCGT 


AA 







This corresponds to the amino acid sequence <SEQ ID 1066; ORF 265>: 

m265.pep 

1 MSVILPPTRA NAAFSAWARL MILSCLLCWC AACPWSSSPC PSWWASAGAE 

51 MLSSAVA AEV KRRCLMFIXF AFVNRGLENV DINKVSNNRQ PAVNTARTIP 

101 RAXASASAAR SCEVNGPILT TYS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 265 shows 88.6% identity over a 123 aa overlap with a predicted ORF (ORF 265.ng) 
from N. gonorrhoeae: 

m265/g265 

10 20 30 40 50 60 

m265 . pep MSVILPPTRANAAFSAWARLMILSCLLC^C^CPWSSSPCPSWWASAGAEMIiSSAVAAEV 

iiiiiiiiihiiiiiiiiiiiim minim iiiiiiiiiiiiii mm I 

g265 MSVILPPTRAQAAFSAWARLMILSCLPCWCAACPWSSSPCPSWWASAGAEMPNSAVAAAV 
10 20 30 40 50 60 



70 80 90 100 110 120 

m2 65 . pep KRRCbMFIXFAFVNRGLENVDINKVSNNRQPAVNTARTIPRAXASASAARSCEVNGPILT 

I II II III Ihlhlhl III II II II II hllMIIII lllllllllhllllll 

g2 65 KRRCLMFI - FALVNQGL KNGD I N KVSNNRQPEV STARTI PRACASASAARSCEANGPI LT 

70 80 90 100 110 



m265.pep TYSX 
Nil 

g265 TYSX 
12 0 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1067>: 
a265.seq 

1 ATGTCGGTGA TTTTGCCGCC GACACGCGCC AACGCTGCTT TTTCGGCTTG 

51 GGCGCGGCTG ATGATTTTGT CTTGTTTGCT GTGTTGGTGT GCGGCGTGTC 

101 CGTGGTCGTC ATCGCCGTGT CCGTCGTGGT GGGCGAGTGC GGGGGCGGAA 

151 ATGCCCATCA GTGCGGTTGC GGCGGCGGTC AAGAGAAGGC GTTTGAAGTT 

201 CATTTTTGCT CCTGCGAAGT ATCTGGT GGTGT TTGAAGGACG 

251 TAAAGGCGGG ACATCAACCG GCGGTTAATA CCGCCCGAAC CATACCGCGC 

301 GCCTGAGCTT CGGCCTCGGC GGCGCGTTCC TGCGAGGCAA ACGGTCCCAT 

351 TTTGACGACG TATTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 1068; ORF 265. a>: 
a2 65 . pep 

1 MSVILPPTRA NAAFSAWARL MILSCLLCWC AACPWSSSPC PSWWASAGAE 

51 MPISAVAA AV KRRRLKFIFA PAKYLX . .XC LKDVKAGHQP AVNTART I PR 

101 A*ASASAARS CEANGPII/TT YS* 



m265/a265 79.7% identity in 1 23 aa overlap 

10 20 30 40 50 60 

m265.pep MSVILPPTRANAAFSAWARLMILSCLLCWCAACPWSSSPCPSWWASAGAEMLSSAVAAEV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a2 65 MSVI LPPTRANAAFS AWARLMI LSCLLCWCAACPWS SS PCPSWWAS AGAEMP I SAVAAAV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m265 .pep KRRCLMFIXFAFVNRGLENVDINKVSNNRQPAVNTARTIPRAXASASAARSCEVNGPILT 
III ! II I : : : I : : : I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I t 

a265 KRRRLKFI FAPAKYLXXCLKDVKAGHQPAVNTARTIPRAXASASAARSCEANGP1LT 

70 80 90 100 110 



m265.pep TYSX 
II I I 

a265 TYSX 
120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1069>: 

g266 . seq 

l agttcagacg gcatcgccgc cgacaatgcc caaacagaaa gcccatcatg 

51 accgcatcca tgtacatcct tttggtcttg gcactcatct ttgccaacgc 

101 ccccttcctc acgaccagac tgttcggcgt ggccgcgctc aagcgcaaac 

151 atttcggaca ccacctgatc gagctggcgg caggtttcgc gctgaccgcc 

2 01 tctcttgcct acatcctcga atcccgtgcg ggagcggtac acaatcaggg 

251 ttgggagttt tacgccaccg tcgtctgcct gtacctcatt ttcgccttcc 

301 cgtgtttcgt gcggcggtat ttttggcaca cgcgcaacag ggaataa 

This corresponds to the amino acid sequence <SEQ ID 1070; ORF 266.ng>: 

g266 .pep 

1 MQFRRHRRRQ CPNRKPIMTA SMYILLVLA L IFANAPFLTT RLFGVAALKR 

51 KHFGHH LIEL AAGFALTASL AYIL ESRAGA VHNQGWEFY A TWCLYLI FA 

101 FPCFVRRYFW HTRNRE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1071 >: 

m266 . seq 

1 ATGCCGTTCC GCAACGCGtT cAGACGGCAT CGCCGCCGAC AACGCCTAAA 

51 CAGAAAG CCC ACCATGACCG CATCCATGTA CATCCTTTTG GTCTTGGCAC 

101 TCATCTTTGC CAACGCCCCC TTCCTCACGA CCAGACTGTT CGGCGTGGCC 

151 rCACTCAAGC GCAAACATTT CGGACACCAC ATGATCGAGC TGGCGGCAGG 

2 01 TTTCGCG CTG ACCGCCGTTC TTGCCTACAT CCTsGAATCC CGTGCAGGAT 
251 CGGTACACGA TCAGGGTTGG GAGTTTTATG CCACAGTCGT CTGCCTGTAC 

3 01 CTGATTTTTG CGTTTCCATG TTTTGTGTGG CGGTATTTTT GGCACACGCG 
351 CAACAGGGAA TAG 
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This corresponds to the amino acid sequence <SEQ ID 1072; ORF 266>: 

ra266 .pep 

1 MPFRNAFRRH RRRQRLNRKP TMTASMYILL VLA LI FANAP FLTTRLFGVA 

51 XLKRKHFGHH MIELAAGFAL TAVLAYILES RAGSVHDQGW SFY ATWCLY 

101 LIFAFPCFV W RYFWHTRNRE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 266 shows 92. 1% identity over a 1 14 aa overlap with a predicted ORF (ORF 266.ng) 
from N. gonorrhoeae: 

m266/g266 

10 20 30 40 50 60 

m266 .pep MPFRNAFRRHRRRQRLNRKPTMTASMYI LLVLALI FANAP FLTTRLFGVAXLKRKHFGHH 

II! II III III] Mill II II III III III III I Ml II 1 1 II II 1 1 III 

g266 MQFRRHRRRQCPNRKPIMTASMYI LLVLALI FANAP FLTTRLFGVAALKRKHFGHH 

10 20 30 40 50 

70 80 90 100 110 120 

m266 . pep MI ELAAGFALTAVLAYI LESRAGSVHDQGWE FYATWCLYLI FAFPCFVWRYFWHTRNREX 

M I M II 1 1 1 II II I II 1 1 1 1 h I h II I II II 1 1 1 II 1 1 II II 1 1 1 1 II 1 1 II II II I 

g266 LIELAAGFALTASLAYILESRAGAVHNQGWEFYATWCLYLIFAFPCFVRRYFWHTRNREX 
60 70 80 90 100 110 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1073>: 



a266. seq 






1 


ATGCCGTTCC 


GCAATGCGTT 


51 


CAGAAAGCCC 


GCCATGACCG 


101 


TGATTTTTGC 


CAACGCCCCC 


151 


CCGCTCAAGC 


GCAAACATTT 


201 


TTTCGCGCTG 


ACCGCCGTTC 


251 


CGGTACACGA 


TCAGGGTTGG 


301 


CTGATTTTTG 


CGTTTCCCTG 


351 


CAACAGGGAA 


TAG 



This corresponds to the amino acid sequence <SEQ ED 1074; ORF 266.a>: 

a266.pep 

1 MPFRNAFRRH RRRQCPNRKP AMTASMYILL LLALIFA NAP FLTTKLFGIV 

51 PLKRKHFGHH LIELAAGFAL TAVLAYI LES RAGAVHDQGW EFY ATWCLY 

101 LIFAFPCFVW RYFWHTRNRE * 

m266/a266 9 1 .7% identity in 1 20 aa overlap 

10 20 30 40 50 60 

m266.pep MPFRNAFRRHRRRQRLNRKPTMTASMYI LLVLALIFANAPFLTTRLFGV AX LKRKHFGHH 

I I I I I I I I I I I I I I I I I I : I I I I I I I I I : I I t I i I I I I I I I I : I I I : : I I I I i I I I I 
a266 M P FRN AFRRHRRRQC PNRK PAMT ASM Y I LL LLAL I FANAP FLTTKL FG I V PLKRKH FG HH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m266 . pep M I ELAAGFALTAVLAYI LESRAGSVHDQGWE FYATWCLYLI FAFPC FVWRYFWHTRNRE 

: I M I I I I I 1 1 1 1 I I I I I I I I I I: I I I I I I 1 I 1 II I I I I I I I I I I I I I I I I I I I I I I I I I 
a266 LIELAAGFALTAVLAY I LESRAGAVHDQGWEFYATVVCLYLI FAFPC FVWRYFWHTRNRE 

70 80 90 100 110 120 



m2 66.pep X 
I 

a266 X 



The following partial DNA sequence was identified in /V. gonorrhoeae <SEQ ID 1075>: 
g267.seq 
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1 atgcaagtcg ccttttttct cgccgtggta ttcaaaaata tgggtttcca 
51 caatcgcatc ggtcgggcag gcctcttcgc agaaaccgca gaagatgcac 
101 ttggtcaggt cgatgtcgta acgcttggtg cggcgggtgc cgtcttcgcg 
151 ttcttccgat tcgatgttga tcgccattgc cggacacacc gcctcgcaca 
201 atttacacgc gatgcagcgt tcctctccgt tcggaaaacg gcgttgcgcg 
251 rgcagaccgc ggaaacgcac ggattgcggc gttttctctt cgggaaaata 
301 aattgtgtct ttgcgggcaa aaaagttttt gagcgttacg cccatgcctt 
3 51 cgaccagttc gccaagcaga aaggttttta ctaa 
This corresponds to the amino acid sequence <SEQ ID 1076; ORF 267.ng>: 

g267 .pep 

1 MQVAFFLAW FKNMGFHNRI GRAGLFAETA EDALGQVDW TLGAAGAVFA 

51 FFRFDVDRHC RTHRLAQFTR DAAFLSVRKT ALRVQTAETH GLRRFLFGKI 

101 NCVFAGKKVF ERYAHAFDQF AKQKGFY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1077>: 

m2 67.seq 

1 GTGCAAGTCG CCTTTTTTCT CGCCGTGGTA TTCAAAAATA TGGGTTTCCA 

51 CAATCGCATC AGTCGGGCAT GCCTCTTCGC AGAAACCGCA GAAGATGCAC 

101 TTGGTCAGGT CGATGTCGTA ACGCTTGGTG CGGCGCGTAC CGTCTTCACG 

151 TTCTTCCGAT TCGATGTTAA TCGCCATTGC CGGACACACT GCCTCACACA 

201 ACTTACACGC GATACACCGC TCTTCGCCGT TCGGATACCG CcGCTGCGCG 

251 TGCAGACCGC GGAAACGCAC GGATTGCGGC GTTTTCTCTT CGGGGAAATA 

301 AATTGTGTCT TTGCGGGCGA AAAAGTTTTT GAGCGTTACG CCCATACCTT 

3 51 TTACCAATTC GCCAAGCAGA AAGGTTTTTA CTAA 

This corresponds to the amino acid sequence <SEQ ID 1078; ORF 267>: 
m267.pep 

1 VQVAFFLAW FKNMGFHNRI 5RACLFAETA EDALGOVDW TLGAARTVFT 
51 FFRFDVNRHC RTHCLTQLTR DTPLFAVRIP PLRVQTAETH GLRRFLFGEI 
101 NCVFAGEKVF ERYAHTFYQF AKQKGFY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 267 shows 82.7% identity over a 127 aa overlap with a predicted ORF (ORF 267.ng) 
from N. gonorrhoeae: 

m267/g267 

10 20 30 40 50 60 

VQVAFFIAWFKIWGFT3NRISRA<XFAETAEDAIXjQTO 

:||IIIIIIIIIIIIMII|:|I llll IMIli MINIMI II MhMIUhlll 

MQVAFFIAWFKNMGFHNRIGRAGLFAETAEDALGQVDVVTLGAAGAVFAFFRFDVDRHC 
10 20 30 40 50 60 

70 80 90 100 110 120 

RTHCLTQLTRDTPLFAVRIPPLRVQTAETHGLRRFLFGEINCVFAGEKVFERYAHTFYQF 

III MMMM -Ml M 1 1 1 1 1 1 M I M 1 1 1 1 M 1 1 1 1 II M 1 1 1 1 1 II M II 

RTHRLAQFTRDAAFLSVRKTALRVQTAETHGLRRFLFGKINCVFAGKKVFERYAHAFDOF 
70 80 90 100 110 120 



m267.pep 
g267 

m267 .pep 
g267 



m2 67 . pep AKQKGFYX 

Illlllll 
g267 AKQKGFYX 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 1079>: 
a267.seq 

1 GTGCAAGTCG CCTTTTTTCT CGCCGTGGTA TTCAAAAATA TGGGTTTCCA 

51 CAATCGCATC GGTCGGGCAG GCTTCTTCGC AGAAACCGCA GAAGATGCAC 

101 TTGGTCAGGT CGATGTCGTA ACGCTTGGTG CGGCGCGTGC CGTCTTCGCG 

151 TTCTTCCGAT TCGATGTTGA TCGCCATTGC GGGGCAAACG GCTTCACACA 

201 ATTTACACGC GATGCAGCGT TCCTCGCCGT TTGGATAACG GCGTTGCGCG 

251 TGCAGACCGC GGAAACGCAC GGATTGCGGC GTTTTCTCTT CGGGAAAATA 

301 AATCGTGTCT TTGCGGGCAA AAAAGTTTTT GAGCGTTACG CCCATACCTT 

351 TTACCAATTC GCCAAGCAGA AAGGTTTTTA CTAA 
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This corresponds to the amino acid sequence <SEQ ID 1080; ORF 267. a>: 

a267 .pep 

1 VQVAFFLAW FKNfMGFHNRI GRAGFFAETA EDALGQVDVV T LG AARA V FA 

51 FFRFDVDRHC GANG FT Q FT R DAAFLAVWIT ALRVQTAETH GLRRFLFGKI 

101 NRVFAGKKVF ERYAHTFYQF AKQKGFY* 

m267/a267 82.7% identity in 127 aa overlap 

10 20 30 40 50 60 

m2 67 . pep VQVAFFLAWFKNMGFHNRISRACLFAETAEDALGQVDWTLGAARTVFTFFRFDVNRHC 
I I I I II I I I I I I I I I I I I I I: I I : I i 1 1 I I I I I I I I I I I I I I I I I : I I: I I [ I I !: I i | 
a267 VQVAFFLAVVFKNMGFHNRIGRAGFFAETAEDALGQVDVVTLGAARAVFAFFRFDVDRHC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m267 . pep RTHCLTQLTRDTPLFAVRIPPLRVQTAETHGLRRFLFGEINCVFAGEKVFERYAHTFYQF 
:||:IM: ::M I I I I I I I I I I I I I I I I I I : I I I I ! 1 : I I I I I I I I I I II I 
a267 GANGFTQFTRDAAFLAVWITALRVQTAETHGLRRFLFGKINRVFAGKKVFERYAHTFYQF 

70 80 90 100 110 120 



m2 67 . pep AKQKGFYX 
I I I I I I I I 
a267 AKQKGFYX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1081>: 

G268.seq 

1 atgaaaaaaa atttacccgc actggcattg gcaagtatgc tgattttgtc 

51 gggctgcgac cgtttgggaa taggcaaccc gttttccgga aaggaaattt 

101 cctgcggaag cgaagagact aaagagattt tggtcaaact ggtccgcgac 

151 aatgtcgaag gtgaaaccgt caaaactttt gacgacgacg cattcaaaga 

201 ccaagcattt gccgatatcg gcatatcgca tatccgcaga atggtcgaac 

251 gtttgggcat aaccgtcgat gaagtccgaa ctaccgagaa aaccgacacg 

301 tccagcaaac tcaaatgtga agccgcgtta aaactggacg tgcccgacga 

3 51 tgttgtcgat tatgccgtcg ccgccaacca atctataggc aacagccata 

4 01 agaaaacgcc cgactttttt gaaccctact accgcaaaga aggcgcgtat 
451 tatgtcaaaa ctatttctta cagcgtccag ccgacagacg acaaaagcaa 
501 aatctttgcc gaactcagtc aggcacacga tatcatccat ccgctcagcg 
551 agctggtgtc tatggcactg attaaagagc cgttggacaa agcgaaacaa 
601 aggaacgaaa aacttgaagc ggcagaagcc accgcgcagg aagcgaggga 
651 ggcagaagaa gcggcggcgc aggaggcatt gggtcgggag caggaagccg 
701 cccgcgtatc cgaatgggaa gaacgctaca agctgtcgcg cagcgagttc 
751 gagcagtttc ggaaaggatt gcctcaaact gtacagaata agctgcaagc 
801 ctcgcagaaa acatggaaaa gcggtatgga caagatctgt gccaacaatg 
851 cgaaagccga aggtgaaacg ccaaacggca taaaagtcag tgagttggcg 
901 tgtaaaacgg cagaaaccga agcacgcttg gaagagctgc acaaccgtaa 
951 aaaagccctt atcgacgaaa tggtcaggga agaggacaag aaagaactgc 

1001 caaagcggct ctga 

This corresponds to the amino acid sequence <SEQ ID 1082; ORF 268.ng>: 
m268 .pep 

1 MKKNLPALAL ASMLILSG CD RLGIGNPFSG KEISCGSEET KEILVKLVRD 

51 NVEGETVKTF DDDAFKDQAF ADIGI SHIRR MVERLGITVD EVRTTEKTDT 

101 SSKLKCEAAL KLDVPDDWD YAVAANQSIG NSHKKTPDFF EPYYRKEGAY 

151 YVKTISYSVQ PTDDKSKI FA ELSQAHDIIH PLSELVSMAL IKEPLDKAKQ 

201 RNEKLEAAEA TAQEAREAEE AAAQEALGRE QEAARVSEWE ERYKLSRSEF 

251 EOFWKGLPQT VQNKLQA3QK TWKSGMDKIC ANNAKAE3ET FNGIKVSELA 

301 CKTAETEARL EELHNRKKAL IDEMVREEDK KELPKRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1083>: 

m268.aeq (partial) 

1 . . ATGGCACTGA TTAAAGAGCC GTTGGACAAA GTGAAACAAA GGAACGAAGA 
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51 ACTTGAAGCG GCAGAAGAAG CGGCGGCGCA GGAGGCATTG GGTCGGGAGC 

101 AGGAAGCCGC CCGCGTATCC GAATGGGAAG AACGCTACAA GCTGTCGCGC 

151 AG . CAGTTCG AGCAGTTCTG GAAAGGATTG CCTCAAACCG TACAGAATAA 

201 GCTGCAACCn TCACAGAAAA CATGGAAAAG CGGGATGGAT AAAATCTGTG 

251 CCAACAATGC GAAAGCTGAA GGTAAAACGC CAAACGGCAT AAAATTCAGC 

301 GAACTGGCAT GCAAAACGGC GAAAACCGAA GCACG CTTGG AAGAGCTGCA 

351 CAACCGTAAA AAAGCCCTTA TCGACGAAAT GGyCAGGGAA GCGGACAmGA 

401 AAGAACTGTC AAAGCGGCTs TGA 

This corresponds to the amino acid sequence <SEQ ID 1084; ORF 268>: 

m268.pep (partial) 

1 . .MALIKEPLDK VKQRNEELEA AEEAAAQEAL GREQEAARVS EWEERYKLSR 

51 XQFEQFWKGL PQTVQNKLQP SQKTWKSGMD KICANNAKAE GKTPNGIKFS 

101 ELACKTAKTE ARLEELHNRK KALIDEMXRE ADXKELSKRL * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 268 shows 86.0% identity over a 150 aa overlap with a predicted ORF (ORF 268.ng) 
from N. gonorrhoeae: 

m268/g268 

10 20 

MALI KEPLDKVKQRNEELEAAE 

I I! Ill I II hi I II h I I I II 
SVQPTDDKSKIFAELSQAHDI IHPLSELVSMALI KEPLDKAKQRNEKLEAAEATAQEARE 
160 170 180 190 200 210 

30 40 50 60 70 80 

- - EAAAQEAIX3REQEAARVSEWEERYKLSR5QFEQFT'JKGLPQTVQNKLQPS0KTWKSGMD 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 III III MM 

AEEAAAQEAIA3REQEAARVSEWEERYKLSRSEFEQFWKGLPQTVQNKLQASQKTWKSGMD 
220 230 240 250 260 270 

90 100 110 120 130 140 

KI CANNAKAEGKTPNG I KFSELACKTAKTEARLEELHNRKKALIDEMXREADXKELS KRLX 

MMMIIIIhllllll I I I 1 I I I 1 ^ I i 1 1 I I I I I I I I I I I I I t J II I III I I II 
KIGANNAKAEGETPNGIKVSELACKTAETEARLEELJI^ 

280 290 300 310 320 330 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1085>: 

a2 68 . seq 

1 ATGGCACTGA TTAAAGAGCC GTTGGACAAA GCGAAACAAA GGAACGAAGA 

51 ACTTGAAGCG GCAGAAGAAG CGGCGGCGCA GGAGGCATTG GGTCGGGAGC 

101 AGGAAGTCGA CCGCGTATCC GAATGGGAAG AACGCTACAA GCTGTCGCGC 

151 AGCGAGTTCG AGCAGTTCTG GAAAGGATTG CCTCAAACCG TACAGAATAA 

201 GCTGCAAGCC TCACAGAAAA CATGGAAAAG CGGGATGGAT AAAATCTGTG 

251 CCAACAATGC GAAAGCTGAA GGTGAAACGC CAAACGGCAT AAAATTCAGC 

301 GAACTGGCAT GCAAAACGGC GGAAACCGAA GCACGCTTGG AAGAGCTGCA 

351 CAACCGTAAA AAAGCCCTTC TCGACGAAAT GGCCAGGGAA GCGGACAAGA 

401 AAGAACTGCC AAAGCGGCTC TGA 

This corresponds to the amino acid sequence <SEQ ID 1086; ORF 268.a>: 
a268.pep 

1 MALIKEPLDK AKQRNEELEA AEEAAAQEAL GREQEVDRVS EWEERYKLSR 
51 SEFEQFWKGL PQTVQNKLQA SQKTWKSGMD KICANNAKAE GETPNGIKFS 
101 ELACKTAETE ARLEELHNRK KALLDEMARE ADKKELPKRL * 

m268/a268 9 1 .4% identity in 1 40 aa overlap 

10 20 30 40 50 60 

m268 - pep MALIKEPLDKVKQRNEELEAAEEAAAQEALGREQEAARVSEWEERYKLSRXQFEQFWKGL 
I i I I I I 1 I I I : I I I I I I ( I I II 1 1 1 1 II 1 1 I I I I I : I I I I I I I I I I I I I : 1 1 I I I I I I 
a2 68 MALI KEPLDKAKQRNEELEAAEEAAAQEALGREOEVDRVSEWEERYKLSRSEFEQFWKGL 



m268 .pep 
g268 

m268 .pep 
g268 

m268 .pep 
g268 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m268.pep PQTVQNKLQPSQKTHKSGMDKICANNAKAEGKTPKGIKFSELACKTAKTEARLEELHNRK 
1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I i I I II : I I I I I I I I I I I I 
a268 PQTVQNKLQASQKTWKSGMDKICANNAKAEGETPNGIKFSELACKTAETEARLEELHNRK 

70 80 90 100 110 120 

130 140 
m268.pep KALIDEMXREADXKELSKRLX 
111*111 Mil III till 
a268 KALLDEMAREADKKELPKRLX 

130 140 



The following partial DNA sequence was identified in//, meningitidis <SEQ ID 1087>: 

m268-l.seq 

1 GTGCAATCCC GATATGATGG TTTGCATAAA TTTAAACATA TATGTTCCGC 

51 AGCTATGGCA CTGATTAAAG AGCCGTTGGA CAAAGTGAAA CAAAGGAACG 

101 AAGAACTTGA AGCGGCAGAA GAAGCGGCGG CGCAGGAGGC ATTGGGTCGG 

151 GAGCAGGAAG CCGCCCGCGT ATCCGAATGG GAAGAACGCT ACAAGCTGTC 

201 GCGCAGCGAG TTCGAGCAGT TCTGGAAAGG ATTGCCTCAA ACCGTACAGA 

251 ATAAGCTGCA AGCCTCACAG AAAACATGGA AAAGCGGGAT GGATAAAATC 

301 TGTGCCAACA ATGCGAAAGC TGAAGGTAAA ACGCCAAACG GCATAAAATT 

351 CAGCGAACTG GCATGCAAAA CGGCGAAAAC CGAAGCACGC TTGGAAGAGC 

401 TGCACAACCG TAAAAAAGCC CTTATCGACG AAATGGCCAG GGAAGCGGAC 

451 AAGAAAGAAC TGTCAAAGCG GCTCTGA 

This corresponds to the amino acid sequence <SEQ ID 1088; ORF 268-l>: 

m268-l.pep 

1 VQSRYDGLHK FKHICSAAMA LIKEPLDKVK QRNEELEAAE EAAAQEALGR 
51 EQEAARVSEW EERYKLSRSE FEQFWKGLPQ TVQNKLQASQ KTWKSGMDKI 
101 CANNAKAEGK TPNGIKFSEL ACKTAKTEAR LEELHNRKKA LIDEMAREAD 
151 KKELSKRL* 



m268-l/g268 82.3% identity in 164 aa overlap 

10 20 30 

m268-l .pep VQSRYDGLHKFKHICSAAMALIKEPLDKVKQRNE 

: I : I : : : : I I I I t I I I I I I : I I i I I 
g268 KEGAYYVKTISYSVQPTDDKSKIFAELSQAHDIIHPLSELVS — MALIKEPLDKAKQRNE 

150 160 170 180 190 200 

40 50 60 70 80 

ra268- 1 . pep ELEAAE EAAAQEALGREQEAARVSEWEERYKLSRSEFEQFWKGLPQTVQM 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 

g268 KLEAAEATAQEAREAEEAAAQEALGREQEAARVSEWEERYKLSRSEFEQFWKGLPGTVQN 
210 220 230 240 250 260 

90 100 110 120 130 140 

m268- 1 . pep KLQASQKTWKSGMDKICANNAKAEGKTPNGIKFSELACKTAKTEARLEELHNRKKALIDE 
I I I I I I I I I I I I I I II I I I I I I I I I : I M I II I I I I I I I I : I I I I I I I I I I I I I I I I I I 
g268 KLQASQKTWKSGMDKICANNAKABGETPNGIKVSELACKTAETEAR1EELHNRKKALIDE 
270 280 290 300 310 320 

150 159 
m268-l.Dep MAREADKKELSKRUt 
I : I I llllt I I I I 
g268 MVREEDKKELPKRLX 
330 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 1089>: 

&268-l.seq 

1 GTGCAATCCC GATATGATGG TTTGCATAAA TTTAAACATA TATGTTCCGC 

51 AGCTATGGCA CTGATTAAAG AGCCGTTGGA CAAAGCGAAA CAAAGGAACG 

101 AAGAACTTGA AGCGGCAGAA GAAGCGGCGG CGCAGGAGGC ATTGGGTCGG 

151 GAGCAGGAAG TCGACCGCGT ATCCGAATGG GAAGAACGCT ACAAGCTGTC 

201 GCGCAGCGAG TTCGAGCAGT TCTGGAAAGG ATTGCCTCAA ACCGTACAGA. 

251 ATAAGCTGCA AGCCTCACAG AAAACATGGA AAAGCGGGAT GGATAAAATC 

301 TGTGCCAACA ATGCGAAAGC TGAAGGTGAA ACGCCAAACG GCATAAAATT 

351 CAGCGAACTG GCATGCAAAA CGGCGGAAAC CGAAGCACGC TTGGAAGAGC 
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401 TGCACAACCG TAAAAAAGCC CTTCTCGACG AAATGGCCAG GGAAGCGGAC 
4 51 AAGAAAGAAC TGCCAAAGCG GCTCTGA 

This corresponds to the amino acid sequence <SEQ ID 1090; ORF 268-l.a>: 

a268-l -pep 

1 VQSRYDGLHK FKHICSAAMA LIKEPLDKAK QRNEELEAAE EAAAQEALGR 
51 EQEVDRVSEW EERYKLSRSE FEQFWKGLPQ TVQNKLQASQ KTWKSGMDKI 
101 CANNAKAEGE TPNGIKFSEL ACKTAETEAR LEELHNRKKA LLDEMAREAD 
151 KKELPKRL* 

a268-l/»268-i 95.6% identity in 158 aa overlap 

10 20 30 40 50 60 

a268-l .pep VQSRYDGLHKFKHICSAAMALIKEPLDKAKQRNEELEAAEEAAAQEALGREQEVDRVSEW 
I I I I I I I I I I I I I ! I I I 1 II I I I I I I I I : I II 1 1 I I I I I I I I I I I I I t I I I I I : I I I I I 
m268-l VQSRYDGLHKFKHICSAAMALIKEPLDKVKQRNEELEAAEEAAAQEALGREQEAARVSEW 

10 20 30 40 50 60 

70 80 90 100 110 120 

a2 68-1 .pep EE RY KLS RS EF EQ FW KGL P QT VQNKLQAS QKTM KS GM DK I C AN N AKAEG ET PNGI KFS E L 

I I II I I III I I II I tl I I II II I III II II II II II I I I I II II II II l:| I M I HI II 

m268-l EERYKLSRSEFEQFWKGLPQTVQNKbQASQKTWKSGMDKICANNAKAEGKTPNGTKFSEL 
70 60 90 100 110 120 

130 140 150 159 

a268-l .pep ACKTAETEARLEELHNRKKALLDEMAREADKKELPKRLX 
I I I I I : I I I I I I I i I I I I I I I : I I I I I I I I I I I I I I I I 
m268-l ACKTAKTEARLEELHNRKKALIDEMAREADKKELSKRLX 

130 140 150 



The following partial DNA sequence was identified in AT. gonorrhoeae <SEQ ID 1091>: 

g269.seq 

1 atggtttggc gtgtgaattg cgcggcaacg gcggcgctga ttttttcgtc 

51 cagcccttgg atttgggcgg tggtgtgggt gtggtcgcgg tcggcttttt 

101 cctgcaaacc ttgcgccagc cttgacgcgt ccagtgcgcc ggcgttggcg 

151 gtttcgccgt gggactttat ccggaacacg gcttcgccca aggtgtcggc 

201 ggctttgatg cacagtttta aaaccagggc tttggggcgg ttttctgcgc 

251 cgcccgttgc cattttgctg tccaatcgcg gggttaaaaa accgttgtcg 

301 tttaagtcgc cgtccgtcca agtcgatacg agcgcgcttc tttgcctttc 

351 attgcggtct tcgtaa 

This corresponds to the amino acid sequence <SEQ ID 1092; ORF 269.ng>: 

g269.pep 

1 NVWRVNCAAT AALIFSSSPW IWAWWVWSR SAFS CKPCAS IjDASSAPALA 

51 VSPWDFIRNT ASPKVSAALM HSFKTRALGR FSAPPVAILL SNRGVKKPLS 

101 FKSPSVQVDT SALLCLSLRS S* 

The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 1093>: 

m269.seq 

1 ATGGTTTGGC GTGTGAATTG CGCGGCAACG GCGGTGCTGA TTTTTTCGTC 

51 CAGCCCTTGG ATTTGGGCGG CGGTGTGGGT GTGGTCTCGG TCGGCTTTGT 

101 CTTGCAAACC TTGCGCCaCG TGCCCGCGTC CAGCGCCTGC GTTGATGGTT 

151 TCGCCGTGGG ACTTTATCCA AAACACGGCT TCGCCCAAGG TGTCGGCGGC 

201 TTTGATGCAC AGTTTTAAAA CCAGGGCTTT GGGGCGGTTT TCGTCGCCGC 

251 CTGTCGCCAT TTTGCTGTCC GAGCGCGGGG TTAAAAAGCC GTTGTCGTTT 

301 AAATTTTCGT CCGTCCAAGT CGATACGAGC GCGCTTCTCT GCCTTTCGTT 

351 GCGGTCTTCG TAA 

This corresponds to the amino acid sequence <SEQ ID 1094; ORF 269>: 

m269.pep 

1 MVWRVNCAAT AVLIFSSSPW IWAAVWVWSR SALS CKP CAT CPRPAPALMV 

51 SPWDFIQNTA SPKVSAALMH SFKTRALGRF SSPPVAILLS ERGVKKPLSF 

101 KFSSVQVDTS ALLCLSLRSS * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 269 shows 87.6% identity over a 121 aa overlap with a predicted ORF (ORF 269.ng) 
from N. gonorrhoeae: 

m269.pep rW^VNC^TAVLIFSSSPWIWAAVWVWSRSALSCKPCATCP-RPAPALMVSPWDFIQNT 59 

MIlllllllhliMlitllihillllllhlllllh 1 1 1 1 iillllhll 

g269 ^WWRV^ICAATAALIFSSSPWIWAVVWVWSRSAFSCKPCASLDASSAPAIlAVS PWDFIRNT 60 

ra269.pep AS PKVS AALMH5 FKTRALG R FS £ P PVA I LLS ERG VKKPLS FKFSSVQVDTSA LLCLS LR S 119 

I 1 1 I i I M 1 1 1 1 1 1 1 1 M i E 1 1 : 1 1 E 1 1 1 1 h I M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i I i 1 1 1 M 

g269 AS PKVS AALMHS FKTRAIjG R FS AP P VA I LLSNRG VKKPLS FKS PSVQVDTS ALLCLS LRS 120 

ra269.pep SX 121 
1 I 

g269 SX 122 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1095>: 



a269.seq 






1 


ATGGTTTGGC 


GTGTGAATTG 


51 


CAGCCCTTGG 


ATTTGGGCGG 


101 


CTTGGAGGTT 


TTGCGCCAGC 


151 


GTTTCGCCGT 


GGGACTTTAT 


201 


GGCTTTGATG 


CACAGTTTTA 


251 


CGCCTGTCGC 


CATTTTGCTG 


301 


TTTAAATTTT 


CGTCCGTCCA 


351 


GTTGTGGTCT 


TCGTAA 



TTTTCGTCGC 



This corresponds to the amino acid sequence <SEQ ID 1 096; ORF 269.a>: 

a269.pep 

1 MVWRVNCAAT AVLIFSSSPW IWAAVWVWAR SALS WRFCAS VPASSAPALT 

51 VSPWDFIQNT ASPKVSAALM HSFKTRALGR FSSPPVAILL SGRGVKKPLS 

101 FKFSSVQVDT SALLCLSLWS S* 

m269/a269 90. 1 % identity in 1 2 1 aa overlap 

10 20 30 40 50 59 

ra269.pep MVWRVNCAATAVLIFSSSPWIWAAVWVWSRSALSCKPCATCP-RPAPALMVSPWDFICNT 
I I I I I I I I I I I I I I I I I I I I I I I I M I I • I I [ I I : It: i till I I I I I I I I I I 
a269 MVWRVNCAATAVLIFSSSPWIWAAVWVXflARSALSWRFCASVPASSAPALTVSPWDFICNT 
10 20 30 40 50 60 

60 70 80 90 100 110 119 

1*269 . pep ASPKVSAALMHSFKTRALGRFSSPPVAILLSERGVKKPLSFKFSSVQVDTSALLCLSLRS 
I I I I I I I I I I I I I I I II I I I I I 1 I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I I I I I I I 
62 69 ASPKVSAALMHSFKTRALGRFSSPPVAILLSGRGVKKPLSFKFSSVQVDTSALLCLSLWS 
70 80 90 100 110 120 

120 

ni2 69.pep SX 
I I 

a269 SX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1097>: 
g270.seq 

1 atgaataaaa accgcaaatt actgcttgcc gcactgctgc tgactgcctt 

51 tgccgccttc aagctcgttt tgttgcaatg gtggcaggcg cagcagccgc 

101 aagccgtggc ggcgcaatgc gatttgaccg agggttgcac gctgccggac 

151 ggaagccgtg tccgcgccgc cgccgtttca accaaaaaac cgtttgacat 

201 ttatatcgaa cacgcgcccg ccggcacgga acaggtcagc atcagcttca 

251 gtatgaaaaa tatggatatg ggtttcaacc gctatatgtt cgagcggcaa 

301 ccgtcgggga cttggcaggc agcacgcatc cgcctgcccg tctgtgtcga 

351 aggcaggcgc gattttacgg cggacattac aatcggcagc cggacatttc 

401 agacggcacc taccgccgaa taa 

This corresponds to the amino acid sequence <SEQ ID 1098; ORF 270 .ng>: 
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g270 . pep 

1 MNKNRKLLLA ALLLTAFAAF KLVLLQWWQA QQPQAVAAQC DLTEGCTLPD 
51 GSRVRAAAVS TKKPFDIYIE HAPAGTEQVS ISFSMKNMDM GFNRYMFERQ 
101 PSGTWQAARI RLPVCVEGRR DFTADITIGS RTFQTAFTAE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1099>: 



m270. seq 












1 


ATGAATAAAA 


ACCGTAAATT 


ACTGCTTGCC 


GCACTGCTGC 


TGATTGCCTT 


51 


TGCCGCCGTC 


AAGCTCGTTT 


TGTTGCAATG 


GTGGCAGGCG 


Ca.CAGCCGC 


101 


AAGCTGTGGC 


GGCGCAATGC 


GATTTGACCG 


AGGGTTGCAC 


GCTGCCGGAC 


151 


GGAAGCCGCG 


TCCGCGCCGC 


CGCcGTTTCA 


ACCAAAAAAC 


CGTTTGATAT 


201 


TTATATCGAA 


CACGCGCCCG 


CCGGCACGGA 


ACAGGTCAGC 


ATCAGCTTCA 


251 


GTATGAAAAA 


TATGGATATG 


GGTTTCaACC 


GCTATATGTT 


CGAGCGGCAA 


301 


CCGTCGGGGA 


CTTGGCAGGC 


AGTACGCATC 


CGCCTGCCCA 


TCTGTGTCGA 


351 


AGGCAGGCGC 


GATTTTACGG 


CGGACATTAC 


AATCGGCAGT 


CGGACATTTC 


401 


AGACGGCATT 


TACCGCCGAA 


TAA 







This corresponds to the amino acid sequence <SEQ ID 1 100; ORF 270>: 

m270.pep 

1 MNKNRKLLLA ALLLIAFAA V KLVLLQWWQA XQPQAVAAQC DLTEGCTLPD 

51 GSRVRAAAVS TKKPFDIYIE HAPAGTEQVS ISFSMKNMDM GFNRYMFERQ 

101 PSGTWQAVRI RLPICVEGRR DFTADITIGS RTFQTAFTAE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 270 shows 96.4% identity over a 140 aa overlap with a predicted ORF (ORF 270.ng) 
from N. gonorrhoeae: 

m270/c270 

1C 20 30 40 50 60 

MNKNRKLLLAALLLIAPAAVKLVLLQWWOAXQPQAVAAQCDLTEGCTLPDGSRVRAAAVS 

iiiiiiiiiiiiii mi iimmii iiiiiiiiiiiimiiiiimiiini 

MNKNRKLLLAALLLTAFAAFKLVLLQWWQAQQPQAVAAQCDLTEGCTLPDGSRVRAAAVS 
1C 20 30 40 50 60 

70 80 90 100 110 120 

TKKPFD I Y I EHAPAGTEQV5 1 S FSMKNMDMGFNRYMFERQPSGTWQAVR I RLP I CVEGRR 

I IMIIIIIII! II IIIMM IIIMI llll MINI 1 1 1 1 1 1 1 1 1 h 1 1 1 I h 1 1 1 1 1 1 

T KKP FD I Y I EHAPAGTEQV3 1 S FSMKNMDMGFNRYMFERQPSGTWQAARI RLP VCVEGRR 
70 80 90 100 110 120 

130 140 
DFTADITIGSRTFQTAFTAEX 

! II I lllll IIMM IMIM 
DFTADITIGSRTFQTAFTAEX 
130 140 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 11 01>: 

a270 . seq 

1 ATGAATAAAA ACCGTAAATT ACTGCTTGCC GCACTGCTGC TGATTGCCTT 

51 TGCCGCCGTC AAGCTCGTTT TGTTGCAATG GTGGCAGGCG CAGCAGCCGC 

101 AAGCTGTGGC GGCGCAATGC GATTTGACCG AGGGTTGCAC GCTGCCGGAC 

151 GGAAGCCGCG TCCGCGCCGC CGCCGTTTCA ACCAAAAAAC CGTTTGATAT 

201 TTATATCGAA CACGCGCCCG CCGGCACGGA ACAGGTCAGC ATCAGCTTCA 

251 GTATGAAAAA TATGGATATG GGTTTCAACC GCTATATGTT CGAGCGGCAA 

301 CCGTCGGGGA CTTGGCAGGC AGTACGCATC CGCCTGCCCA TCTGTGTCGA 

351 AGGCAGGCGC GATTTTACGG CGGACATTAC AATCGGCAGC CGGACATTTC 

401 AGACGGCATT TACCGCCGAA TAA 

This corresponds to the amino acid sequence <SEQ ID 1 102; ORF 270.a>: 
a270.pep 

1 MNKNRKLLLA ALLLIAFAAV KLVLLQWWQA QQPQAVA AQC DLTEGCTLPD 
51 GSRVRAAAVS TKKPFDIYIE HAPAGTEQVS ISFSMKNMDM GFNRYMFERQ 



m2 70.pep 
g270 

m270 .pep 
g270 

m270.pep 
9270 



WO 99/57280 



PCT/US99/09346 



619 

101 PSGTWQAVRI RLPICVEGRR DFTADITIGS RTFQTAFTAE * 

m270/a270 99.3% identity in 1 40 aa overlap 

10 20 30 40 50 60 

m270 . pep MNKNRKLLLAALLLI AFAAVKLVLLQWWQAXQPQAVAAQCDLTEGCTLPDGSRVRAAAVS 

I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I fl I I I I I I I I I I I I I I I I I I I | 
a270 MNKNRKLLLAALLLIAFAAVKLVLLQWWQAQQPQAVAAGCDLTEGCTLPDGSRVRAAAVS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m270.pep TKKPFDIYIEHAPAGTECVSISFSMKNMDMGFNRYMFERQPSGTWQAVRI RLPICVEGRR 

I I I I II I I I I I I I I I I I I I > I ! I I I I I I I I I I I t I I I I I I I [ I I I I I I I I I I I I I I ! I I I 
a270 TKKPFDIY I EHAPAGTEQVS I SFSMKNMDMGFNRYMFERQPSGTWQAVRI RLPICVEGRR 

70 80 90 100 110 120 

130 140 
m270.pep DFTADITIGSRTFQTAFTAEX 
lllllllll! II II lllllll 
a270 DFTADITIGSRTFQTAFTAEX 

130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 103>: 

g27l . seq 

1 atgttcagtt cgcggatggc gaggatttgg gcgacggggg taacgttgtg 

51 tatggtcagt ccgtgtccgg cgttgacgac caagcccaaa tcgccggcga 

101 aatgcgcgcc gttttggatg cgctcgaact gcctgatttg ttcggcgtgg 

151 ctttgtgcgt cggcatatgc gccggtgtgc agctcgacaa cgggcgcgcc 

201 gacatcacgg gcggcttgga tttgcctgtc gtcggcatcg ataaacaagg 

251 acacgcgtat gcccgcgtcg gtcaggattt tggcgaattc ggcgattttt 

301 tcctgttgcg ccaatacgtc caaaccgcct tcggtcgtga tttcctgccg 

351 tttttcaggc acgatgcaca cgtcttccgg catcacttta agcgcgtttt 

401 cgagcatttc ttccgtcaac gccatttcaa ggttcaggcg cgtgcggatg 

451 gcgtttttga cggcaaatac atccgcgtct ttgatgtggc ggcggtcttc 

501 gcgcaggtgc atggtaatca ggtctgcacc gtgcgtttcg gcaaccagtg 

551 ccgcctccac ggggctggga taa 

This corresponds to the amino acid sequence <SEQ ID 1 104; ORF 271.ng>: 

g271.pep 

1 MFSSRMARIW ATGVTLCMVS PCPALTTKPK SPAKCAPFWM RSNCLICSAW 

51 LCASAYAPVC SSTTGAPTSR AAWICLSSAS INKDTRMPAS VRILANSAIF 

101 SCCANTSKPP SWISCRFSG TMHTSSGITL SAFSSISSVN AISRFRRVRM 

151 AFLTANTSAS LMWRRSSRRC MVIRSAPCVS ATSAASTGLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 105>: 



m271 . seq 












1 


AwGTTCAGTT 


CGCGGATGGC 


GAGGATTTGG 


GCGATGGGGG 


TAACGTTGTG 


51 


TATGGTCAGT 


CCGTGTCCGG 


CGTTGACGAC 


CAAGCCCAAA 


TCGCCGGCGA 


101 


AATGCGCGCC 


GTTTTGGATG 


CGCTCGAACT 


GCCTGATTTG 


TTCGGCGTGG 


151 


CTGCGCGCGT 


CGGCATACGC 


GCCTGTGTGC 


AGCTCGACAA 


CGGGCGCGCC 


201 


GACATCACGG 


GCGGCTTGGA 


TTTGCCTGTC 


GTCGGCATCG 


ATAAACAAAG 


251 


ACACGCGTAT 


GCCTGCGTCG 


GTCAGGATTT 


TGGTGAACCC 


GGCGATTTTT 


301 


TCCTGTTGCG 


CCAATACGTC 


CAAACCGCCT 


TCGGTCGTGA 


TTTCCTGACG 


351 


TTTTTCAGGC 


ACGATGCACA 


CGTCTTCCGG 


CATCACTTTC 


AAAGCGTTTT 


401 


CCAACATTTC 


TTCCGTCAAC 


GCCATTTCAA 


GGTTCAGGCG 


CGTGCGGATG 


451 


GCGTTTTTGA 


CGGCAAACAC 


GTCCGCGTCT 


TTGATGTGGC 


GGCGGTCTTC 


501 


GCGCAGGTGC 


ATGGTAATCA 


AATCCGCACC 


GTGCGTTTCG 


GCAACCAGTG 


551 


CCGCCTCCAC 


GGGGCTGGGA 


TAA 







This corresponds to the amino acid sequence <SEQ ID 1 106; ORF 271>: 
m27l .pep 

1 XFSSRMARIW AMGVTLCMVS PCPALTTKPK SPAKCAPFWM RSNCLICSAW 
51 LRASAYA PVC SSTTGAPTSR AAWICLSSAS INKDTRMPAS VRILVNPAIF 
101 SCCANTSKPP SWISXRFSG TMHTSSGITF KAFSNISSVN AISRFRRVRM 
151 AFLTANTSAS LMWRRSSRRC MVIKSAPCVS ATSAASTGLG * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 271 shows 95.2% identity over a 189 aa overlap with a predicted ORF (ORF 27l.ng) 
from TV. gonorrhoeae: 

m271/g271 

10 20 30 40 50 60 

m2 7 1 . pep XFS SRMARI WAMGVTLCMVS PCPALTTKPKS PAKCAPFWMRSNCLI CSAWLRASAYAPVC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
g271 MFSSRMARIWATGVTLCPIVSPCPAliTTKPKSPAKCAPFWMRSNCLICSAWLCASAYAPVC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m271.pep SSTTGAPTSRAAWIO.SSASINKDTRMPASVRILVNPAIFSCCANTSKPPSWISXRFSG 

I M ! I 1 1 1 1 ! 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 I ! h I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1| Mil 

g271 SSTTGAPTSRAAWICLSSASINKDTRMPASVRILANSAIFSCCANTSKPPSWISCRFSG 
70 80 90 100 110 120 

130 140 150 160 170 180 

m271 . pep TMHTSSGITFKAFSNISSVNAISRFRRVRMAFLTANTSASLMWRRSSRRCMVIKSAPCVS 
M I II I I I I : : I I I : I I I I I I I I I I I I I I I I I I I I I II I I t I I I I I I I I I I 1 1 : I I I I I I 
g271 TMHTSSGITLSAFSSISSVNAISRFRRVRMAFLTANTSASLMWRRSSRRCMVIRSAPCVS 

13G 140 150 160 170 180 



m271 .pep 
g271 



19G 

ATSAASTGLGX 
I II I I I I I I I I 
ATSAASTGLGX 
19C 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 107>: 

a271 . seq 

ATGTTCAGTT CGCGGATGGC GAGGATTTGG 

TATGGTCAGT CCGTGTCCGG CGTTGACGAC 

AATGCGCGCC GTTTTGGATG CGCTCGAACT 

CTGCGCGCGT CGGCATACGC GCCTGTGTGC 

GACATCACGG GCGGCTTGGA TTTGCCTGTC 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



GCGATGGGGG 
CAAGCCCAAA 
GCCTGATTTG 



TAACGTTGTG 
TCGCTGGCAA 
TTCGGCGTGG 



AGCTCGACAA CGGGCGCGCC 
GTCGGCATCG ATAAACAAGG 



ACACGCGTAT 
TCTTGTTGCG 
TTTTTCCGGC 
CGAGCATTTC 

GCGTTTTTGA CAGCAAACAC 



GCCCGCGTCG GTCAGGATTT 
CCAATACGTC CAAGCCGCCT 
ACGATGCACA CGTCTTCCGG 
TTCCGTCAAC GCCATTTCAA 



TGGTGAATTC GGCAATTTTG 
TCGGTCGTGA TTTCCTGACG 



GTCCGCGTCT 
GCGCAGGTGC ATGGTAATCA GGTCGGCACC 
CCGCCTCCAC GGGGCTGGGA TAA 



CATCACTTTA 
GGTTCAGGCG 
T7GATGTGGC 



AGCGCGTTTT 
CGTGCGGATG 
GGCGGTCTTC 



G7GCGTTTCG GCAACCAGTG 



This corresponds to the amino acid sequence <SEQ ID 1 108; ORF 271. a>: 

a271.pep a 

MFSSRMARIW AMGVTLCMVS PCPALTTKPK SLAKCAPFWM RSNCLICSAW 



1 
51 
101 
151 



LRASAYAPVC SSTTGAPTSR AAWICLSSAS 
SCCANTSKPP SVVI5+RFSG TMHTS5GITL 
AFLTANTSAS LMWRRSSRRC MVIRSAPCVS 



INKDTRMPAS VRILVNSAIL 
SAFSSISSVN A1SRFRRVRM 
ATSAASTGLG * 



m271/a271 96.3% identity in 1 89 aa overlap 



10 20 30 40 50 60 

m271 .pep XFSSRMARIWAMGVTLCMVSPCPALTTKPKSPAKCAPFWMRSNCLICSAVfLRASAYAPVC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I 
a271 MFSSRMARIWAMGVTLCMVSPCPALTTKPKSLAKCAPFWMRSNCLICSAWLRA3AYAPVC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m271.pep SSTTGAPTSRAAWICLSSASINKDTRMPASVRILVNPAIFSCCANTSKPPSVVISXRFSG 
I I I I I I I 1 I I I I I M I I 1 I I I i I 1 I I I I I I I I I 1 I i I I : I I I I I I I I I I I 1 I I I I I I I t 
a271 SSTTGAPTSRAAWICLSSASINK3TRMPASVRILVNSAILSCCANTSKPPSWISXRFSG 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m27i.pep tmhtssgitfkafsnissvnaisrfrrvrmafltantsaslmwrrssrrcmviksapcvs 
I I I I I I I I I :: I I |: I I i I 1 1 I t I I t I I I I I I I I I t I I I I t I I I I I I I I i I I I : I I I I I I 
a 27 1 TMHTSSGITLSAFSSISSVWAI SRFRRVRMAFLTANTSASLMWRRSSRRCMVIRSAPCVS 

130 140 150 160 170 180 

190 

m2 7 1 . psp ATS AASTGLGX 

i ii mil n i 

a 271 ATS AASTGLGX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 109>: 

g272 . seq 

1 atgactgcaa aggaagaact gttcgcatgg ctgcgccata tgaacaaaaa 

51 caaaggttcc gacctgtttg tgacgaccca tttcccgccc gctatgaagc 

101 tggacggcaa aatcacccgc ateaeggacg aaccgctgac ggcggaaaaa 

151 tgtatggaaa tcgccttttc gattatgagt gcgaagcagg eggaagaatt 

201 ttcatcgacc aacgagtgea atttcgecat cagcctgccg gacaccagcc 

251 gcttccgcgt caatgegatg atacagegeg gtgegaegge gttggtattc 

301 cgegegatta ccagcaagat tcccaagttt gaaagcctga acctgccgcc 

351 ggccttgaag gatgtbgege tgaaaaaacg cgggctggtt atttttgtcg 

401 gcggcaccgg ctegggcaaa tcgacttcgc tcgcctcgct tatcgactac 

451 cgcaatgaaa attegttegg acacatcatc accatcgaag atccgatcga 

501 gtttgcccac gaacacaaaa actgeatcat tacccagcgc gaggteggeg 

551 tggacacgga aaactggatg gcggcgttga aaaatacget gcgtcaggcg 

601. ccggatgtga tecttategg cgaaatccgc gaccgtgaaa caatggacta 

651 cgccaccgcc tttgccgaaa eggggcattt gtgtatggcg acgctgcacg 

701 ccaacagcac caatcaggcg ctcgaccgca tcatcaactt cttccccgag 

751 gageggegeg aacaattget gaeggatttg tcgctcaacc ttcaggcgtt 

801 tatttegcaa cgcctcgttc cgegagaegg eggcaaggge agggtggcgg 

851 cagtcgaggt getgetcaat tcgcccctga tttcggagtt gattcacaac 

901 ggcaacatcc atgaaatcaa agaagtgatg aaaaaatcca ctaccctggg 

951 tatgeagace ttcgaccaac acctttacca attgtatgaa aaaggegaga 

1001 tttccttgea ggatgccttg aaaaatgccg attccgcaca tgatttgcgt 

1051 ttggcggtac agttgcgcag ccgcagggca caaagttccg accccgattt 

1101 ggaactgetc tga 

This corresponds to the amino acid sequence <SEQ ID 1 1 10; ORF 272.ng>: 

g272.pep 

1 MTAKEELFAW LRHMNKNKGS DLFVTTHFPP AMKLDGKITR ITDEFLTAEK 

51 CMEIAFSIMS AKQAEEFSST NECNFAISLP DT5RFRVNAM IQRGATALVF 

101 RAITSKIPKF ESLNLPPALK DVALKKRGLV IFVGGTGSGK STSLASLIDY 

151 RNBNSFGHII TIEDPIEFVH EHKNCIITQR EVGVDTENWM AALKNTLRQA 

201 PDVILIGEIR DRETMDYAIA FAETGHLCMA TLHANSTNQA LDRIINFFPE 

251 ERREQL.LTDL SLNLQAFISQ RLVPRDGGKG RVAAVEVLLN SPLISELIHN 

301 GNIHEIKEVM KKSTTLGMQT FDQHLYQLYE KGEISLQDAL KNADSAHDLR 

351 LAVQIiRSRRA QSSDPDLELL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1 1>: 

m272 .seq 

1 ATGACCGCAA AGGAAGAACT GTTCGCATGG CTGCGCCATA TGAwCCAAAA 

51 CAAAGGTTCC GACCTGTTCG TGACAACCCA TTTCCCGCCC GCAATGAAGC 

101 TGGACGGCAA AATCACCCGC ATCACGGACG AACCGCTGAC GGCGGAAAAA 

151 TGTATGGAAA TCGCCTTTTC GATTATGAGT GCGAAGCAGG CGGAAGAATT 

2 01 TTCATCGACC AACGAGTGCA ACTTCGCCAT CAGCCTGCCG GACACCAGCC 

251 GCTTCCGCGT CAATGCGATG ATACAGCgCG GCGCGACGGC GTTGGTATTC 

301 CGTACGATTA CCAGCAAGAT TCCCAAGTTT GAAAGCCTGA ACCTGCCGCC 

351 AGTCTTGAAG GATGTCG CGC TGAAAAAACG CGGGCTGGTT ATTTTTGTCG 

4 01 GCGGCACCGG CTCGGGTAAA TCGACTTCGC TTGCCTCGCT TATCGACTAC 

451 CGCAATGAAA ATTCGTTCGG ACACATCATC ACCATCGAAG ACCCGATCGA 

501 GTTTGTCCAC GAACACAAAA ACTGCATCAT CACCCAGCGC GAGGTCGGCG 
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551 TGGATACGGA AAACTGGATG GcGGCGTTGA AAAACACGCT GCGTCAGGCG 

601 CCTGATGTCA TCCTTATCGG CGAAATCCGT GACCGCGAAA CAATGGACTA 

651 CGCCATTGCC TTTGCCGAAA CGGGGCATTT GTGTATGGCG ACGCTGCACG 

701 CCAACAGCAC CAATCAGGCA CTCGACCGCA TCATCAACTT TTTCCCCGAG 

751 GAGCGGCGCG AACAATTGCT GACGGATTTG TCGCTCAACC TTCAGGCGTT 

801 T ATT TCGCAA CGCCTCGTTC CGCGAGACGG CGGCAAGGGC AGGGTGGCGG 

851 CAGTCGAGGT GCTGCTCAAT TCGCCCCtGA TTTCGGAGTT GATTCACAAC 

901 GGCAACATCC ATGAAATCAA AGAAGTGATG AAAAAATCCA CTACCCTGGG 

951 TATGCAGACC TTCGATCAAC ACCTTTACCA ATTGTATGAA AAAGGCGATA 

1001 TTTCCCTGCA AGAAGCATTG AAAAATGCCG ATTCCGCACA CGATTTGCGT 

1051 TTGGCGGTAC AGTTGCGCAG CCGCCGCGCG CAaAGTTyCA GCCCCGATTT 

1101 GGnACTGCTC TGA 

This corresponds to the amino acid sequence <SEQ ID 1 1 12; ORF 272>: 

m272 .pep 

1 MTAKEELFAW LRHMXQNKGS DLFVTTHFPP AMKLDGKI TR ITDEPLTAEK 

51 CMEIAFSIMS AKQAEEFSST NECNFAISLP DTSRFRVNAM IQRGATALVF 

101 RTITSKIPKF ESLNLPPVLK DVALKKRGLV IFVGGTGSGK STSLASLIDY 

151 RNENSFGHII TIEDPIEFVH EHKNCIITQR EVGVDTENWM AALKNTLRQA 

201 PDVILIGEIR DRETMDYAIA FAETGHLCMA TLHANSTNQA LDRI INFFPE 

251 ERREQLLTDL SLNLQAFISQ RLVPRDGGKG RVAAVEVLLN 3PLISELIHN 

301 GNIHEIKEVM KKSTTLGMQT FDQHLYQLYE KGDISLQEAL KNADSAHDLR 

351 LAVQLRSRRA QSXSPDLXLL * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 272 shows 97.6% identity over a 370 aa overlap with a predicted ORF (ORF 272.ng) 
from N. gonorrhoeae: 

m272/g272 

10 20 30 40 50 60 

m272 . pep MTAKEELFAWLRHMXQNKGSDLFVTTHFPPAMKLDGKITRITDEPLTAEKCMEIAFSIMS 
I I I I ! I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g272 MTAKEELFAWLRHMNECNKGSDLFVTTHFPPAMKLDGKITRITDEPLTAEKCMEIAFSIMS 
10 20 30 40 50 60 

70 80 90 100 110 120 

m272.pep AKQAEEFSSTNECNFAISLPDTSRFRVNAMIQRGATALVFRTITSKIPKFESLNLPPVLK 
li | I I I I I I I I I I i I I I I I I I I I II I I I I I I II I I I i I I I I : I I I I I I I I I ! I I I I I : I I 
g 272 AKQAEEFSSTNECNFAISLPDTSRFRVNAMIQRGATALVFRAITSKIPKFESLNLPPALK 

70 80 90 100 110 120 

130 140 150 160 170 180 

m272 . uep DVALKKRGLVIFVGGTGSGKSTSLASLIDYRNENSFGHIITIEDPIEFVHEHKNCIITQR 
I | | I M I I I I I I I I I i I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g 272 DVALKKRGLVIFVGGTGSGKSTSLASLIDYRNENSFGHIITIEDPIEFVHEHKNCIITQR 

130 140 150 160 170 180 

190 200 210 220 230 240 

n272 . pep EVGVDTENW^4AALKNTLRQAPDVrLIGEIRDRETMDYAIAFAETGHLCMATLHANSTNQA 
I | | | I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I ! I I I I II I I I I I I I I I I I I I 
g 2 7 2 EVGV DTENWMAALKNTLRQAPD V I LI GEI RDRETMDYAI AFAETGHLCMATLHAN STNQA 

190 200 210 220 230 240 

250 260 270 280 290 300 

m272 . pep LDRI INFFPEERREQL.LTDLSLNLQAFISQRLVPRDGGKGRVAAVEVLLKSPLISELIHN 

I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I II I I 1 1 I I I I I I I I I I I II I I I II I I ! I I 
g272 LDRIINFFPEERREQLLTDLSLNLQAFISQRLVPRDGGKGRVAAVEVLLNSPLISELIHN 

250 260 270 280 290 300 

310 320 330 340 350 360 

m272.pep GNIHEIKEVMKKSTTLGMQTFDQHLYQLYEKGDISLQEALKNADSAHDLRIAVQLRSRRA 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I : I I I I : I I I I I I I I I I I I II I I I I I I I I 
g272 GNIHEIKEVMKKSTTLGMQTFDQHLYQLYEKGEISLQDALKNADSAHDLRLAVQLRSRRA 

310 320 330 340 350 360 
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m272.pep 
g272 



370 
QSXSPDLXLLX 
II : I I I III 
QSSDPDLELLX 

370 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 1 13>: 

a272 . seq 

1 ATGACCGCAA AGGAAGAACT GTTCGCATGG 

51 CAAAGGTTCC GACCTGTTCG TGACGACCCA 

101 TGGACGGCAA AATCACCCGC ATCACGGACG 

151 TGTATGGAAA TCGCCTTTTC GAT TAT GAG T 

201 TTCATCGACC AACGAGTGCA ACTTCGCCAT 

251 GCTTCCGCGT CAATGCGATG ATACAGCGCG 

301 CGTGCGATTA CCAC-CAAGAT TCCCAAGTTT 

351 GGTCTTGAAG GATGTCGCGC TGAAAAAACG 

4 01 GCGGCACCGG CTCGGGCAAA TCGACTTCGC 

451 CGCAATGAAA ATTCGTTCGG ACACATCATC 

501 GTTTGTCCAC GAACACAAAA ACTGCATCAT 

551 TGGATACGGA AAACTGGATG GCGGCGTTGA 

601 CCGGATGTGA TTCTGATCGG CGAAATCCGC 

651 CGCCATTGCT TTTGCCGAAA CGGGGCATTT 

701 CCAACAGCAC CAATCAGGCA CTCGACCGCA TCATCAACTT 

751 GAGCGGCGCG AACAATTGCT GACGGATTTG TCGCTCAACC 

301 TATTTCGCAA CGCCTCGTTC CGCGAGACGG 

851 CAGTCGAGGT GCTGCTCAAT TCGCCCCTGA 

901 GGCAATATCC ATGAAATCAA AGAAGTGATG 

951 TATGCAGACT TTCGACCAAC ACCTTTACCA A7TGTATGAA AAAGGCGAGA 

1001 TTTCCTTGCA GGATGCCTTG AAAAATGCCG ATTCCGCACA CGATTTGCGT 

1051 TTGGCGGTAC AGTTGCGCAG CCGCCAGGCG CAAAGTTCCG GTCCCGATTT 

1101 GGAACTGCTC TGA 



CTGCGCCATA 
TTTCCCGCCC 
AACCGCTGAC 
GCGAAGCAGG 
CAGTCTGCCG 
GTGCGACGGC 
GAAAGCCTGA 
CGGGCTGGTT 
TTGCCTCGCT 
ACCATCGAAG 
CACCCAGCGC 
AAAACACGCT 
GACCGCGAAA 
GTGTATGGCG 



CGGCAAGGGC 
TTTCGGAGTT 
AAAAAATCCA 



TGAACAAAAA 
GCAATGAAGC 
GGCGGAAAAA 
CGGAAGAATT 
GACACCAGCC 
GTTGGTATTC 
ACCTGCCGCC 
ATTTTTGTCG 
TATCGACTAC 
ACCCGATCGA 
GAGGTCGGCG 
GCGTCAGGCA 
CAATGGACTA 
ACGCTGCACG 
TTTCCCCGAG 
TTCAGGCATT 
AGGGTGGCGG 
GATTCACAAC 
CTACCCTGGG 



This corresponds to the amino acid sequence <SEQ ID 1 1 14; ORF 272.a>: 

a272.pep 

1 MTAKEELFAW LRHMNKNKGS DLFVTTHFPP AMKLDGKITR 

51 CMEIAFSIMS AKQAEEFSST NECNFAISLP D7SRFRVNAM 

101 RAITSKIPKF ESLNLPPVLK DVALKKRGLV IFVGGTGSGK 

151 RNENSFGHII TIEDPIEFVIl EHKNCIITQR EVGVDTENWM 

201 PDVILIGEIR DRETMDYAIA FAETGHLCMA TLHANSTNQA 

251 ERREQLLTDL SLNLQAFISQ RLVPRDGGKG RVAAVEVLLN 

301 GNIHEIKEVM KKSTTLGMQT FDQHLYQLYE KGEISLQDAL 

351 LAVQLRSRQA QSSGPDLELL * 



m272/a272 97.6% identity in 370 aa overlap 

10 20 30 



40 



50 



ITDEPLTAEK 
IQRGATALVF 
STSLASLIDY 
AALKNTLRQA 
LDRIINFFPE 
SPLISELIHN 
KNADSAHDLR 



60 



m272 .pep MTAKEELFAWLRIiMXQNKGSDLF/TTHFPPAMKLDGKITRITDEPLTAEKCMEIAFSIMS 
I I I I I I I I I I I I i I : I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I j I II I I I I ! I i I 
a272 MTAKEELFAWLRHMNKNKGSDLFVTTHFPPAMKLDGKITRITDEPLTAEKCMEIAFSIMS 
10 20 30 40 50 60 

70 80 90 100 110 120 

m272.pep AKQAEEFSSTNECNFAISLPDTSRFRVNAMIQRGATALVFRTITSKIFKFESLNLPPVLK 
I I I I I I I I I I I I I I i I I I I i I I I I I I I I I II I I I I I I I I I I : I I I I I I I I I I I I I I ! I I I 
a272 AKQAEEFSSTNECNFAISLPDTSRFRVNAMIQRGATALVFRAITSKIPKFESLNLPPVLK 

70 80 90 100 110 120 

130 140 150 160 170 180 

m272.pep DVALKKRGLVIFVGGTGSGKSTSLASLIDYRNENSFGHIITIEDPIEFVHEHKNCIITQR 
I I I I I t I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 
a272 DVALKKRGLVIFVGGTGSGKSTSLASLIDYRNENSFGHIITIEDPIEFVHEHKNCIITOR 

130 140 150 160 170 180 
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190 200 210 220 230 240 

m272 . pep EVGVDTENWMAALKNTLRQAPDVILIGEIRDRETMDYAIAFAETGHLCMATLHANSTNQA 
I I I I I 1 I 1 I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I 
a272 EVGVDTENWMAALKNTLRQAPDVILIGEIRDRETMDYAIAFAETGHLCMATLHANSTNQA 

190 200 210 220 230 240 

250 260 . 270 280 290 300 

m272.pep L DRI I N FF PEERREQLLT DL S LNLQA FI SQRL VPR DGG KGRV AAVEVLLN SPLISELIHN 

I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I M I I I I 
a272 LDRI IN FFPEERREQLLTDLS LNLQA FI SQRLVPRDGGKGRVAAVEVLLNSPLI SELIHN 

250 260 270 280 290 300 

310 320 330 340 350 360 

m272.psp GNIKEIKEVMKKSTTLGNQTFDQHLYOLYEKGDISLOEALKNADSAHDLRLAVQLRSRRA 

I I I I I I I I I I I I I I I II I It I I I I I I t I t I I I : I I I I : I I t I I I i I I I I I I I I I I i I I : I 
a272 GNIHEIKEVMKKSTTLGMQTFDQHLYQLYEKGEISLQDALKNADSAHDLRLAVQLRSRQA 

310 320 330 340 350 360 

370 

m2 72 . pep QSXS PDLXLLX 

II : I I I III 
a272 QSSGPDLELLX 

370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 1 5>: 

g273 .seq 

1 atgagtcttc aggcggtatt tatatacccc 

51 caacgaaaat caggaaaacg gcggtaaagc 

101 gcaaacacac cgaccgccgt caggatatag 

151 ccattcaccg ttttcctgcc gtttcttgtc 

201 tgccggcaag cagcgcggca gccgcgcccg 

251 gttgttcctt aacggttaaa aacccgcccg 

301 cgggaaattg caaaatttgt ttgcgggcgc 

351 gtttgagaag tgtttccnac gcgcccgccc 

401 gtcgctcacc tgcaaaatcg ccaagaacgc 

451 tgcccacttg tttcatacgg cgtttgcctg 

501 tttttcttac gcgtaa 
This corresponds to the amino acid sequence <SEQ ID 1 1 16; ORF 273.ng>: 

g273 .pep 

1 MSLQAVFIYP PSRTAQYNEN QENGGKAHKQ GQSGKHTDRR QDIGVFEAGT 

51 PFTVFLPFLV AFEIKDDAGK QRGSRARHWH CVHCCSLTVK NPPGRATVLR 

101 REIAKFVCGR VPLKSRRFEK CFXRARPMCR NIICRSPAKS PRTRFAEFPR 

151 CPLVSYGVCL LFVFQAVF SY A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1 7>: 

m273 . seq 

1 ATGAGTCTTC AGGCGGTATT TATATACCCra CCAAGCCGTA CCGCACAATA 

51 CAACGAAAAT CAGGAAAACG GCGGTAAAGC TCAyAAACAG GGACAAAGCG 

101 GCAAACACGC CGACCGCTGT CAGGATATAG GCGTATTCAA GGCCGGAACT 

151 CCATTCCCCG TTTTCCTGCC GCTTCTTGTC GCTTTTGAAA TAAAGGATGA 

201 TGCCGGCAAG CAGCGCGGCA GCCGCGCCCG ACATTAGCAT TGTGTTCATT 

251 GTTGTTCCTT AATGCTTAAA AACCCGCCTG TCCGTGCAAC CGTTTTAAGG 

301 CGGCAAATTG CAAAATTTGT TTGCGGGCGC GTGCCCCTGA AATCAGGGCG 

351 GTTTGAGGGG TGTTCCCGAC GCGCCGCCCT GTGTGCCGGA GTTATTTGTC 

401 GCTCACCTGC AAAATCGCCA AGAACGCGCT TTGCGGAATT TCCACATTGC 

451 CCACTTGTTT CATACGGCGT TTACCTG CCT TTTGTkTwTC AAGCAGTTTT 

501 7TCTTACGCG TAA 

This corresponds to the amino acid sequence <SEQ ID 1 1 1 8; ORF 273>: 
m273 .pep 

1 MSLQAVFIYP PSRTAQYNEN QENGGKAHKQ GQSGKHADRC QDIGVFKAGT 

51 PFPVFLPLLV AFEIKDDAGK QRGSRARH*H CVHCCSLMLK SPPVRATVLR 

101 RQIAKFVCGR VPLKSGRFEG CSRRAALCAG VICRSPAKSP RTRFAEFPHC 

151 PLVSYGVYLP FVXQAVF SYA * 

Computer analysis of this amino acid sequence gave the following results: 



ccaagccgta 
tcataaacag 
gcgtattcga 
gcttttgaaa 
acattggcat 
gccgtgcaac 
gtgccgctga 
tatgtgccga 
gctttgcgga 
ctttttgttt 



ccgcacaata 
ggacaaagcg 
ggccggaact 
taaaggatga 
tgtgttcatt 
cgctttaagg 
aatcaaggcg 
aatattattt 
atttccacgt 
ttcaagcagt 
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Homology with a predicted OKF from N. gonorrhoeae 

ORF 273 shows 86.0% identity over a 171 aa overlap with a predicted ORF (ORF 273.ng) 
from N. gonorrhoeae: 

m273/g273 

10 20 30 40 50 60 

m27 3 . pep M SLQ A V F I Y P P S RT AQ YN ENQEN GGKAHKQGQS GKH A DRC QD I G V FKAG TPFPVFLPLLV 

I | | I I I I I I I I I I J I I I I I I t I I i I I I I I I I II I I I : I i 111111:11111 1111:11 
g 273 MSLQAVFIYPPSRTAQYNENQENGGKAHKQGQSGKHTDRRQDIGVFEAGTPFTVFLPFLV 

10 20 30 4 0 50 60 

70 80 90 100 110 120 

m273.pep AFEIKDDAGKQRGSRARHXHCVHCCSLMLKNPPVRATVLRRQIAKFVCGRVPLKSGRFEG 

I || H I I I II I I I I I I I I I II I I I I I : I I I I I I I I I II : I I I I I I I I I I I I I ill 
g 273 AFEIKDDAGKQRGSRARHWHCVHCCSLTVKNPPGRATVLRREIAKFVCGRVPLKSRRFEK 

70 80 90 100 - 110 120 



130 140 150 160 170 

m27 3 . pep CSRRA-ALCAGVICRSPAKSPRTRFAEFPHCPLVSYGVYLPFVXQAVFSYAX 
I II : I : : I I II I I I I II I I I I I I I : I I I I I I II I II I I I I! II I 
g273 C FX RARPMCRN 1 1 C R S P AKS PRTR FAE F PRC PLVSYGVCLLFV FQ A V FS Y AX 

130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 19>: 

AGGCGGTATT TGTATACCCC CCAAGCCGTA CCGCACAATA 
CAGGAAAACG GCGGTAAAGC TCATAAACAG GGACAAAGCG 
CGACCGCCGT CAGGATATAG GCGTATTCCA GACCGGAACT 



a273 . seq 




1 


ATGAGTCTTC 


51 


CAACGAAAAT 


101 


GCAAACACGC 


151 


CCATTCACCG 


201 


TGCCGGCAAG 


251 


GTTGTTCCTT 


■ 301 


AGGCGGTAAA 


351 


GCGGTTTAAG 


401 


GCTCACCTGC 


451 


CCACTTGTTT 


501 


TTCTTACGCG 



GGGCATGATG CACTGCCCCG 



CATACGGCGT TTGCCTGCTT 



TGTGCCGGAT 
TTGCGGGATT 
TTTGTTTTTC 



ATTATTTGTC 
TCCACATTGC 
AAGCAGTTTT 



This corresponds to the amino acid sequence <SEQ ID 1 120; ORF 273.a>: 

a273 .pep 

1 MSLQAVFVYP PSRTAQYNEN QENGGKAHKQ GQSGKHADRR QDIGVFQTGT 

51 PFTVFLPLFV AFEIKDDAGK QRGSRARH*H NVHCCSLTVK NPPVRATVFK 

101 RR*ITKFVGG RALLQSGRFK GHDALPRVPD IICRSPAKLP RTRFAGFPHC 

151 PLVSYGVCLL FVFQAV FSYA * 

m273/a273 80.1% identity in 171 aa overlap 

10 20 30 40 50 60 

m273 . pep MSLQAVFI YPPSRTAQYNENQBNGGKAHKQGQSGKHADRCQDrGVFKAGTPFPVFLPLLV 

! I I I I I I : I I I I I 1 1 I I 1 1 I I I I I II I 1 1 1 1 1 1 1 1 I I I I I I I I I I :: I I I I I I 1 1 I : I 
a273 MSLQAVFVYPPSRTAQYMENQENGGKAHKQGQSGKHADRRQDIGVFQTGTPFTVFLPLFV 

10 20 30 40 50 60 



70 80 90 100 110 119 

m273 . pep AFEIKDDAGKQRGSRARHXHCVHCCSLMLKNPPVRATVL-RRQIAKFVCGRVPLKSGRFE 
I I I I I I I I I I I I I I I I I 1 1 I HUM : I I 1 1 I I I II : II I : I I I 1 1 : I : I I I I : 
a273 AFEIKDDAGKQRGSRARHXHNVHCCSLTVKNPPVRATVFKRRXITKFVGG RALLQSGRFK 

70 80 90 100 110 120 



120 130 140 150 160 170 

m273.pep GCSRRAALCAGVICRSPAKSPRTRFAEFPHCPLVSYGVYLPFVXQAVFSYAX 
I : : MINIM MINI MUM Mill I It IM MM! 
a273 GH DALPRV -P DI I C RS PAKL PRTRFAG FPHC PLVSYGVCLL FVFQA VFS Y AX 

130 140 150 160 170 
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The following partial DNA sequence was identified in M. gonorrhoeae <SEQ ID 1 121>: 

g274 .seq 

1 ATGGCGGGGC CGATTTTTGT CGTCatCGCC AgcgTCGCTA TGTTTTTTGT 

51 CGCGCAGCAG CACGCGACAG ATTTGGTTAC GGACGATTAT TATAAGGATG 

101 GCAAGCATAT CGACATCCAG CTTCATCGGG ATGAAGAAGC CGTCAGACGG 

151 CATATCGGGG TGCAGGTCCT CATTTCTCCC GATATGAATG CGGCAAAAGT 

201 GTTTGTCGGc ggCgagtTTG ACGGCAAACA GCCTTTGAAC CTGCTGCTGA 

251 TGCACCCGAC CCGCAAGGCG GACGATCAAA CCGTCGCCCT CAAGCCCGTC 

301 GGCAGCGCGC AGAACGGCAG GGCGGAATAT GAGGCGGTgt tcaaAACCCT 

351 TCCGCCGGCC AACCACTGGT ATGTGCGCGT GGAggacgCG GCAGGCGTGT 

401 GGCGCGTCGA GAACAAATGG ATTACCAGCC AGGGCAATGC GGTCGATTTG 

451 ACCCCGATGG ACAAACTTTT CAATAATGCA GGAAGCAAAT AA 

This corresponds to the amino acid sequence <SEQ ED 1 122; ORF 274 .ng>: 

g274 .pep 

1 MAGPIFWIA SVAMFFVAQQ HA TDLVTDDY YKDGKHIDIQ LHRDEEAVRR 

51 HIGVQVLISP DMNAAKVFVG GEFDGKQPLN LLLMHPTRKA DDQTVALKPV 

101 GSAQNGRASY EAVFKTLPPA NHWYVRVEDA AGVWRVENKW ITSQGNAVDL 

151 TPMDKLFNNA GSK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 123>: 

m274 .seq 

1 ATGGCGGGGC CGATTTTTGT CGTCATCGCC AGCGTCGCTA TGTTTTTTGT 

51 CGCGCAGCAG CACGCGACAG ATTTGGTTAC GGACGATTAT TATAAAGACG 

101 GCAAACATAT CGACATCCAG CTTCATCGGG ATGAAGAAGC CGTCAGACGG 

151 CATATCGGGG TGCAGGTTCT CATTTCCCCC GATATGAATG CGGCAAAAGT 

201 GTTTGTCGGC GGCGAGTTTG ACGGCAAACA GCCTTTGAAC CTGCTGCTGA 

251 TGCACCCGAC CCGCAAGGCG GACGATCAAA CCGTCGCCCT CAAGCCCGTC 

301 GGCAGCGCGC AGAACGGCAG GGCGGAATAT GAGGCGGTGT TCAAAACCCT 

351 TTCGCCGACC AACCACTGGT ATGTGCGCGT GGAGGACGCG GCAGGCGTGT 

401 GGCGCGTCGA GAACAAATGG ATTACCAGCC AAGGCAATGC GGTCGATTTG 

451 ACCCCGATGG ACAAGCTTTT CAATAATACT GAAAGCAAAT AA 

This corresponds to the amino acid sequence <SEQ ID 1 124; ORF 274>: 

m274 .pep 

1 MAGPIFWIA SVAMFFVAQQ HA TDLVTDDY YKDGKHIDIQ LHRDEEAVRR 

51 HIGVQVLISP DMNAAKVFVG GEFDGKQPLN LLLMHPTRKA DDQTVALKPV 

101 GSAQNGRAEY EAVFKTLSPT NHWYVRVEDA AGVWRVENKW ITSQGNAVDL 

151 TPMDKLFNNT ESK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from AT. gonorrhoeae 

ORF 274 shows 97.5% identity over a 163 aa overlap with a predicted ORF (ORF 274.ng) 
from N. gonorrhoeae: 

g274/m274 

10 20 30 40 50 60 

g274 . pep MAGPIFWIASVAMFFVAQQHATDLVTDDYYKDGKHIDIQLHRDEEAVRRHIGVQVLISP 
[ I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I M I I I I I I I I I I 
m274 MAGPIFWIASVAMFFVAQQHATDLVTDDYYKDGKHIDIQLHRDEEAVRRHIGVQVLISP 

10 20 30 40 50 60 

70 B0 90 100 110 120 

g274.pep DMNAAKVFVGGEFDGKQPLNLLLMHPTRKADDQTVALKPVGSAQNGRAEYEAVFKTLPPA 
I I I I I I 1 I I I I I I I I I I I I 1 I I I i I I I I I I I II I I I II I I I I I I I I I I I I II I I I I I I : 
m274 DMN AAKV FVGGE F DG KQ P LN LLLMH PT RKAD D QT V ALK P VG S AQN GRAE YE AVFKT L S PT 

70 60 90 100 110 120 

130 140 150 160 

g274 . pep NHWYVRVEDAAGVWRVENKWITSQGNAVDLTPMDKLFNNAGSKX 
I I I I I I I I I I I I II II I I I I I I II I I I I II II I I I I I I I : III 
m274 NHWYVRVEDAAGVWRVENKWITSQGNAVDLTPMDKLFNNTESKX 

130 140 150 160 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 125>: 

a274 . seq 

1 ATGGCGGGGC CGATTTTTGT CGTCATCGCC AGCGTCGCTA TGTTTTTTGT 

51 CGCGCAGCAG CACGCGACAG ATTTGGTTAC GGACGATTAT TATAAAGACG 

101 GCAAGCATAT CGACATCCAG CTTCATCGGG ATGAAGAAGC CGTCAGACGG 

151 CATATCGGGG TGCAGGTTCT CATTTCCCCC GATATGAATG CGGCAAAAGT 

201 GTTTGTCGGC GGCGAGTTTG ACGGCAAACA GCCTTTGAAC CTGCTGCTGA 

251 TGCACCCGAC CCGCAAGGCG GACGATCAAA CCGTCGCCCT CAAGCCCGTC 

301 GGCAGCGCGC AGAACGGCAG GGCGGAATAT GAGGCGGTGT TCAAAACCCT 

351 TTCGCCGACC AACCACTGGT ATGTGCGCGT GGAGGACGCG GCAGGCGTGT 

4 01 GGCGCGTCGA GAACAAATGG ATTACCAGCC AAGGCAATGC GGTCGATTTG 

4 51 ACCCCGATGG ACAAACTTTT CAATAATACT GAAAGCAAAT AA 

This corresponds to the amino acid sequence <SEQ ID 1 126; ORF 274.a>: 

a274 .pep 

1 MAGPIFWTA SVAMFFVAQQ HA TDLVTDDY YKDGKHIDIQ LHRDEEAVRR 

51 HIGVQVLISP DMNAAKVFVG GEFDGKQPLN LLLMHPTRKA DDQTVALKPV 

101 GSAQNGRAEY EAVFKTLSPT NHWYVRVEDA AGVWRVENKW ITSQGNAVDL 

151 TPMDKLFNNT E5K* 

m274/a274 100.0% identity in 163 aa overlap 

10 20 30 40 50 60 

m274 .pep MAGPIFWIASVAMFFVAQQHATDLVTDDYYKDGKHIDIQLHRDEEAVRRHIGVQVLISP 
I 1 I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a2 7 4 MAGP I FVV IAS VAMFFVAQQHATDLVTDDY YKDGKH I D IQLHRDEEAVRRHI GVQVL ISP 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 74 . pep dmnaakvfvggefdgkqplnlllmhptrkaddqtvalkpvgsaqngraeyeavfktlspt 
I I I 1 1 1 1 1 1 1 II I 1 1 I M II 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 I 1 1 1 1 I 

a274 DMNAAKVFVGGEFDGKQPLNLLIUHPTRKADDQTVALKPVGSAQNGRAEYEAVFKTLSPT 
70 80 90 100 110 120 

130 140 150 160 

m274 .pep NHWYVRVEDAAGVWRVENKWITSQGNAVDLTPMDKLFNNTESKX 
I I I I I II I I I I M I I I I I II I I I I II I I I I I I I I I I I I I I II I I 
a2 7 4 NH WY VRVE DAAGVWRVEM KW I T SQGN AV DLTPMDKL FN NT E S XX 

130 140 150 160 



The following partial DNA sequence was identified in ^.gonorrhoeae <SEQ ID 1 127>: 

g276 .seq 

l atgattttgc cgccatccat gacgatgatg cggtcggcgg attcgacggt 

51 ggtcaggcgg tgggcgacga tgatgccggt gcggttttcc atcaggcgtt 

101 cgagcgcttg ttggacgagg cgttcggatt cgttgtccaa tgcgctggtg 

151 gcttcgtcca ataataatat cggcgcgtct ttcaaaatgg cgcgggcgat 

201 ggcgacgcgt tgccgctgtc cgccggataa gttgctgccg ttcgatccga 

251 tgggctggtg cagtccgagc ggggatgcgt cgatcaggct ttgcaggttg 

301 gcggcttgga gggcggacag gacttcggct tcgcccgcgt cgggacggct 

351 gcatcggacg ttttcaaaca gggtgtcgtc aaacaggaat acgtcttggg 

401 agacgagggc gaattgggcg cgcaggcagt cgagtttgat gtcggcgatg 

451 tcgataccgt ctatgcagat gttgccggca gacggttcga caaagcgggg 

501 cagaaggttg acgacggtgg atttgccgct gccggaacgt ccgaccaggc 

551 cgacgcgttc gccttgtctg atgtcgaggt tgaagttgtc gagggctttc 

601 atgccgtctg aacggtattc gacatcgacg ttgcggaagc tgatgcgccc 

651 ttcgacacgc tgcggcgcga gcgtgccttt gtcctgttcg ggcggggtgt 

701 cgagaaatgc acatacgccg tcggcggcga ggaacatcgt ctgcataggc 

751 atgctgatgt cggcaaggct tttgatgggg gcgtacattt gcagcatcgc 

801 gacgatgaat gccataaatt cgccgatggt ggtgtag 

This corresponds to the amino acid sequence <SEQ ID 1 128; ORF 276.ng>: 

g276 .pep 
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1 MILPPSMTMM RSADSTWRR WATMMPVRFS IRRSSACWTR RSDSLSNALV 

51 ASSNNNIGAS FKMARAMATR CRCPPDKLLP FDPMGWCSPS GDASIRLCRL 

101 AAWRADRTSA SPASGRLYRT FSNRVSSNRN TSWETRANWA RRQSSLMSAM 

151 SIPSMQMLPA DGSTKRGRRL TTVDLPLPER PTRATRSPCL MSRLKLSRAL 

2 01 MPSERYSTST LRKLMRPSTR CGASVPLSCS GGVSRNAHTP SAARNIVCIG 

251 MLMLARLLMG AYI CSIATMN AINSPMW* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 129>: 

m2 76.seq 

1 ATGATTTTGC CGTCGTCCAT CACGATGATG CGGTCGGCCC CTTCGATGGT 

51 GGTCAGGCGG TGGGCGACGA TGATGCCGGT GCGGTTTTCC ATCAGGCGTT 

101 CGAGCGCCTG TTGGACGAGG CGTTCGGATT CGTTGTCTAA TGCGCTGGTG 

151 GCTTCGTCCA ATAATAATAT CGGCGCGTCT TTCAAAATGG CGCGGGCAAT 

201 GGCGACGCGT TGCCGCTGTC CGCCGGATAA GTTGCTGCCG TTCGATCCGA 

251 TGGGCTGGTG CAGTCCGAGC GGGGAGCTGT CAATCAGGCT TTGCAGGTTG 

301 GCGGTTTGGA GGGCGAACAG GACTTCGGCT TCGCCCGCGT CGGGACGGCT 

351 GTATCGGACG TTTTCAAACA GGGTGTCGTC AAACAGGAAT ACGTCTTGGG 

401 AGACGAGGGC GAATTGGGCG CGCAGGCAGT CGAGTTTGAT GTCGGCGATG 

4 51 TCGATACCGT CTATGCAGAT GTTGCCGGCA GACGGTTCGA CAAAGCGGGG 

501 CAGCAGGTTG ACGACGGTGG ATTTGCCGCT GCCGGAACGT CCGACCAGGG 

551 CGACGCGTTC GCCTTGTCTG ATGTCGAGGT TGAAGTTGTC GAGGGCTTTG 

601 ATGCCGTCTG AACGGTATTC GACATCGACG TTGCGGAAGC TGATGCGCCC 

651 TTCGACACGC TGCGGTGCGA GCGTGCCCTT GTCCTGTTCG GGCGGGGTGT 

701 CGAGAAATGC ACATACACCG TCGGCGGCGA GGAACATCGT CTGCATAGGG 

751 ATGCTGATGT TGGCAAGGCT TTTGATGGGG GCGTACATTT GCAGCATCGC 

801 GACGATGAAT GCCATAAATT CGCCGATGGT GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 1 130; ORF 276>: 

m276.pep 

1 MILPSSITMM RSAPSMWRR WATMMPVRFS IRRSSACWTR RSDSLSNALV 

51 ASSNNNIGAS FKMARAMATR CRCPPDKLLP FDPMGWCSPS GBLSIRLCRL 

101 AVWRANRTSA SPASGRLYRT FSNRVSSNRN TSWETRANWA RRQSSLMSAM 

151 SIPSMQMLPA DGSTKRGSRL TTVDLPLPER PTRATRSPCL MSRLKLSRAL 

201 MPSERYSTST LRKLMRPSTR CGASVPLSCS GGVSRNAHTP SAARNI VCIG 

251 MLMLARLLMG AYI CSIATMN AINSPMW* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from gonorrhoeae 

ORF 276 shows 96.8% identity over a 278 aa overlap with a predicted ORF (ORF 276.ng) 
from TV. gonorrhoeae: 
m276/o276 

10 20 30 40 50 60 

m276.pep M I L P S S I TMMR S A P SMWRRWAT MMP VR FS I RR S S ACWT RR S D S L SN ALV AS SNNN I GAS 

I I I I 1:111111 I I I 1 I I I I I I I I I II I I I I I II 1 I I I I I I i I I I I I I I I I I I I I I I 
g276 MILPPSMTMMRSADSTVVRRWATMMPVRFSIRRSSACWTRRSDSLSNALVASSNNNIGAS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m27 6.pep FKMARAMATRCRCPPDKLLPFDPMGWCSPSGELSIRLCRLAVWRANRTSASPASGRLYRT 
i I II I II I I I I M I : I I I I I I I I I I I I I I I I : I I I I I I M : I I I : I I II I I I I t I I ! I I 
g276 FKMARAMATRCRCPPDKLLPFDPMGWCSPSGDASIRLCRLAAWRADRTSASPASGRLYRT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m27 6 . pep FSNRVSSNRNTSWETRANWARRQSSLMSAMSI PSMQMLPAD3STKRGSRLTTVDLPLPER 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I 11 I I II I I I I I 
g276 FSNRVSSNRNTSWETRANWARRQSSLMSAMSIPSMQMLPAD3STKRGRRL7TVDLPLPER 

130 140 150 160 170 180 

190 200 210 220 230 240 

m276.pep PTRATRSPCLMSRLKLSRALMPSERYSTSTLRKLMRPSTRCGASVFLSCSGGVSRNAHTP 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 " 1 1 1 1 1 II 1 1 1 1 1 1 11 1 1 1 1 1 1 1 i 1 1 1 1 1 1 I 1 1 1 1 1 ! 1 1 1 1 1 i 1 1 | 

g27 6 PTRATRSPCLMSRLKLSRALMPSERYSTSTLRKLMRPSTRCGASVPLSCSGGVSRNAHTP 
190 200 210 220 230 240 
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250 260 270 279 

m27 6. pep S AARNI VC I GM LM LARL LMG AY I C S I ATMN A INS PMWX 

I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
g27 6 SAARNIVCIGMLMLARLLMGAYICSIATMNAINSPMWX 

250 260 270 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13 1>: 

a276 . seq 

1 ATGATTTTGC CGTCGTCCAT TACGATGATG CGGTCGGCCC CTTCGATGGT 

51 GGTCAGGCGG TGGGCGACGA TGATGCCGGT GCGGTTTTCC ATCAGGCGTT 

101 CGAGCGCCTG TTGGACGAGG CGTTCGGATT CGTTGTCCAA TGCGCTGGTG 

151 GCTTCGTCCA ATAATAATAT CGGCGCGTCT TTCAAAATGG CGCGGGCAAT 

201 GGCAACGCGT TGCCGCTGTC CGCCGGATAA GTTGCTGCCG TTCGATCCGA 

251 TGGGCTGGTG CAGTCCGAGC GGTGATGCGT CGATCAGGCT TTGCAGGTTA 

301 GCGGCTTGGA GGGCGGATAG GACTTCGGCT TCGCCCGCGT CGGGACGGCT 

351 ATATCGGACG TTTTCAAACA GGGTGTCGTC AAACAGGAAT ACGTCTTGGG 

4 01 AGACGAGGGC AAATTGGGCG CGCAGGCAGT CGAGTTTGAT GTCGGCGATG 

4 51 TCGATACCGT CTATGCAGAT GTTGCCGGCA GACGGTTCGA CAAAGCGGGG 

501 CAGCAGGTTG ACGACGGTGG ATTTGCCGCT GCCGGAACGT CCGACCAGGG 

551 CGACGCGTTC GCCTTGTCTG ATGTCGAGGT TGAAGCCGTC GAGGGCTTTG 

601 ATGCCGTCCG AACGGTATTC GACATCGACG TTGCGGAAGC TGATGCGCCC 

651 TTCGACACGC TGCGGTGCGA GCGTGCCTTT GTCCTGTTCG GGCGGGGTGT 

701 CGAGAAATGC ACATACGCCG TCGGCGGCGA GGAACATCGT CTGCATAGGG 

751 ATGCTAATGT TGGCAAGGCT TTTGATGGGG GCGTACATTT GCAGCATCGC 

801 GACGATGAAT GCCATAAATT CGCCGATGGT GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 1 132; ORF 276.a>: 

a27 6 .pep 

1 MILPSSITMM RSAPSMVVRR WATMMPVRFS IRRSSACWTR RSDSLSNALV 

51 ASSNNNIGAS FKMARAMATR CRCPPDKLLP FDPMGWCSPS GDASIRLCRL 

101 AAWRADRTSA SPASGRLYRT FSNRVSSNRN TSWETRANWA RRQSSLMSAM 

151 SIPSMQMLPA DGSTKRGSRL TTVDLPLPER FTRATRSPCL M5RLKPSRAL 

201 MPSERYSTST LRKLMRPSTR CGASVPLSCS GGVSRNAHTP SAARNIVCIG 

251 MLMLARLLMG AYI C5IATMN AINSPMVV* 

m276/a276 98.2% identity in 278 aa overlap 

10 20 30 40 50 50 

m276.pep M I L PS S I TMMRSA P S MWRRWATMMP V RFS I RRS S ACWT RRS D S L SN ALV AS S NNN I GAS 

1 1 1 1 1 1 1 1 1 1 1 1 I M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 i 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 I 
a276 MILPSSITMMRSAPSMVVRRWATMMPVRFSIRRSSACWTRRSDSLSNALVASSNNNIGAS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m276 . pep FKMARAMATRCRCPPDKLLPFDPMGWCSPSGELSIRLCRLAVWRANRTSASPASGRLYRT 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I i : t I I : I I I I I I I t I I I I I I 
a276 FKMARAMATRCRCPPDKLLPFDPMGWCSPSGDASIRLCRLAAWRADRTSASPASGRLYRT 
70 80 90 100 110 120 

130 140 150 160 170 180 

m276 . pep FSNRVSSNRNTSWETRANWARRQS SLMSAMSI PSMQMLPADGSTKRGSRLTTVDLPLPSR 

I I I I I I I I I I I I I I I I i I I I I ! I I I 1 I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I i I t 
a276 FSNRVSSNRNTSWETRANWARRQSSLMSAMSIPSMQMLPADGSTKRGSRLTTVDLPLP2R 

130 140 150 160 170 130 

190 200 210 220 230 240 

m276 . pep PTRAT RS PCLMS RLK.LS RALMPSERY ST STLRKLMRPSTRCGAS VPLSCSGG VS RN AHT P 

1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 M 1 1 1 1 1 1 ; I I I I I I I II I I 1 I I I I I 

a276 PTRATRSPCLMSRLKPSRALMPSERYSTSTLRKLMRPSTRCGASVPLSCSGGVSRNAHTP 
190 200 210 220 230 240 

250 260 270 279 

m276 . pep SAARNIVCIGMLNLARLLMGAYICSIATMNAINS PMVVX 

I I I I I I I I I I I I I I ! I I I I I I I I I I I I i I I I I I I I i I I I 
a276 S AA RN I VC I GMLMLA RL LMG A Y I C S I ATMN AI NS PMVVX 
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250 260 270 



The following partial DNA sequence was identified in N gonorrhoeae <SEQ ID 1 133>: 



g277 . seq 


(partial) 










1 


. . atggtacacg 


tcgccgtagc 


ttacggtatt 


gccgtccggc 


gttcttgccc 


51 


aaacgaggtc 


atagacgttt 


tccacgcctt 


gcaggtacat 


cgccaagcgt 


101 


tcgatgccgt 


aggtaatttc 


gccgagtacg 


ggcgtgcaat 


cgataccgcc 


151 


gacttgttgg 


aaataggtaa 


actgggttac 


t tccatgccg 


ttgagccaga 


201 


cttcccagcc 


caaaccccac 


gcaccgaggg 


tggggttctc 


ccagtcgtct 


251 


tcgacaaagc 


ggatgtcgtg 


gactttggga 


tcgatgccca 


attcgcgcag 


301 


ggagtcgaga 


tagaggtctt 


ggatattggc 


gggggcgggt 


ttgagggcga 


351 


cttggaattg 


gtaatagtgt 


tgcaggcggt 


tggggttgtc 


gccgtagcgg 


401 


ccgtctttgg 


ggcggcggct 


gggttggacg 


taggcggcaa 


accaaggctc 


451 


ggggccgagc 


gcgcgcaggc 


aggtggcggg 


atgggatgtg 


ccggcaccga 


501 


cttccatgtc 


gaagggttgg 


atgacggtgc 


agcctttgtc 


tgcccagaag 


551 


gtttgcagtt 


tgaagatgat 


ttgttggaag 


gtaagcatgg 


cttattgttc 


601 


gataaaataa 


aggttttatt 


ttactgtttc 


catagccgct 


tgaatagatt 


651 


tatctcgaag 


acagcctga 









This corresponds to the amino acid sequence <SEQ ID 1 134; ORF 277.ng>: 

g277.pep (partial) 

1 . . MVHVAVAYGI AVRRFCPNEV IDVFHALQVH RQAFDAVGNF AEYGRAIDTA 

51 DLLEIGKLGY FHAVEPDFPA QTPRTEGGVF PWFDKADV\ f DFGIDAQFAQ 

101 GVEIEVLDIG GGGFEGDLE L VIVLQAVGW AVAAVF GAAA GLDVGGKPRL 

151 GAERAQAGGG MGCAGTDFHV EGLDDGAAFV CPEGLQFEDD LLEGKHGLLF 

201 DKIKVLFYCF HSRLNRFISK TA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 135>: 



m277 . seq 




1 


ATGCCCCGCT 


51 


TTTCTTCGGC 


101 


CGCAGCAGCC 


151 


GACTTCGTTT 


201 


GCGTTTTTGC 


251 


ATCGCCAAGC 


301 


GTCGATGCCG 


351 


CGTTGAGCCA 


401 


TCCCAGTCGT 


451 


CAATTCGCGC 


501 


GCTTGAGGGC 


551 


TCGCCGTAGC 


601 


AAACCAAGGC 


651 


TGCCGGCACC 


701 


TCTGCCCAGA 


751 


GGCTTATGA 



This corresponds to the amino acid sequence <SEQ ID 1 136; ORF 277>: 

m277.pep 

1 MPRFEDKLVG RQGEGGVFFG KQAFGLRFW VELAQQPVGI AVFEWGGLL 
51 DFVLWHVAV GDGVAVERFC PNEWDVFYT LQVHRQAFDA VGDFAEYGRA 
101 VDAADLLEIG KLGYFHAVEP DFPAQTPRAE GGVFPWFDK AD WD FG I DA 
151 QFAQRVEIEV LDIGGSGLEG DLE LVIVLQA VGWAVAAVF GAAAGLDVGG 
201 KPRLGABCAQ AGGC^IGCAGT DFHVEGLDDG AAFVCPECLQ FEDDLLEGKH 
251 GL* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 277 shows 90.0% identity over a 221 aa overlap with a predicted ORF (ORF 277.ng) 
from N. gonorrhoeae: 

g277/m277 



g277 .pep 



10 20 30 

MVHVAVAYG IAVRRFCPNEV IDVFHALQVH 
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: I I I I I : I : f I: I I I I I I I : I I I :: II I I 
m277 GLRrWVELAQQPVGIAVFEVVGGLLDFVLWHVAVGDGVAVERrCPNEWDVFYTLQVH 
30 40 50 60 70 80 



40 50 60. 70 80 90 

g277 . pep RQAFDAVGNFAEYGRAIDTADLLEIGKLGYFHAVEPDFPAQT PRTEGGVFPWFDKADVV 

I I I I I I I I : I t I I f I I : I: I I I i I I I I I I I I I I If I I | I I | I I | : | | | | | 1 I M I I I I I I 
m277 RQAFDAVGDFAEYGRAVDAADLLEIGKLGYFHAVEPDFPAQTPRAEGGVFPWFDKADW 
90 100 110 120 130 140 



100 110 120 130 140 150 

g277 . pep DFGTDAQFAQGVEIEVLDIGGGGFEGDLELVIVLQAVGWAVAAVFGAAAGLDVGGKPRL 
I I I I 1 I II I I I i I I I I I I I I : I : I I I I I I I I I I I II I I I I I I I I I I I I | I I I | | | | I | | 
m277 D FG I DAQFAQRVE I E VLD I GG SGLEG DLE LVI VLQAVGVVAVAAVFGAAAGLDVGGKPRL 

150 160 170 180 190 200 

160 170 180 190 200 

g277 . pep G AE RAQ AGGGMGC AGT DFH VEGL DDG AA FVC P EG LQ FE DDL LEGKHG L L 

Mi I I I I I II I I I I I I I I I I I i I I I I I I II I I II I I I I I I I I I I I 
m277 GAECAQAGGGMGCAGTDFHVEGLDDGAAFVCPECLQFEDDLLEGKHGLX 
210 220 230 240 250 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 137>: 
3.211 . seq 

1 ATGCCCCGCT TTGAGGACAA GCTCGTAGGC AGGCAGGGCG AGGGCGGCGT 

51 TTTCTTCGGC AAGCAGGCGT TTGGCTTGCG CTTCGTAGTC GTTGAACTGG 

101 CGCAGCAGCC AATCGGCATC GCTGTATTCG AAGTTGTAGG TGGATTGTTC 

151 GACTTCGTTT TGGTGGTACA CGTCGCCGTA AGTTACTGTA TTACCGTCCA 

201 GCGTTTTTGC CCAAACGAGG TCATAGACGT TTTCCACGCC TTGCAGGTAC 

251 ATCGCCAAGC GTTCGATGCC GTAGGTGATT TCGCCGAGTA CGGGGGTGCA 

301 GTCGATGCCG CCGACTTGTT GGAAATAGGT GAACTGGGTT ACTTCCATAC 

351 CGTTGAGCCA GACTTCCCAG CCCAAACCCC ACGCGCCGAG GGTGGGGTTT 

4 01 TCCCAGTCGT CTTCGACAAA GCGGATGTCG TGCACTTTGG GGTCGATGCC 

451 CAATTCGCGC AGGGAGTCGA GATAGAGGTC TTGGATATTG GCGGGAGCGG. 

501 GCTTGAGGGC GACTTGGAAT TGGTAATAGT GTTGCAGGCG GTTGGGGTTG 

551 TCGCCGTAGC GACCGTCTTT GGGGCGGCGG CTGGGTTGGA CGTAGGCGGC 

601 AAACCAAGGC TCGGGGCCGA GTGCGCGCAG ACAGGTGGCG GGATGGGATG 

651 TGCCGGCACC GACTTCCATG TCGAAGGGTT GGATGACGGT GCAGCCTTTG 

701 TCTGCCCAGA ATGTTTGCAG TTTGAAGATG ATTTGTTGGA AGGTAAGCAT 

751 GGCTTATGA 

This corresponds to the amino acid sequence <SEQ ED 1 138; ORF 277.a>: 

a277 .pep 

1 MPRFEDKLVG RQGEGGVFFG KQAFGLRFVV VELAQQPIG I AVFEWGGLF 

51 DFVLVV HVAV SYCITVQRFC PNEVIDVFHA LQVHRQAFDA VGDFAEYGGA 

101 VDAADLLEIG ELGYFHTVEP DFPAQTPRAE GGVFPWFDK ADWHFGVDA 

151 QFAQGVEIEV LDIGGSGLEG DLE LVIVLQA VGVVAVATVF GAAAGLDVGG 

201 KPRLGAECAQ TGGGMGCAGT DFHVEGLDDG AAFVCPECLQ FEDDLLEGKH 

251 GL* 

m277/a277 92.5% identity in 252 aa overlap 

10 20 30 40 50 60 

m277.pep MPRFEDKI.VGRQGEGGVFFGKQAFGLRFVWELAGXJPVGIAVFEVVGGLLDFVLVVHVAV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : t I I I I I I I I I I : I I M I I I I II 
a277 MPRFEDKLVGRQGEGGVFFGKQAFGLRFWVELAQQPIGIAVFEVVGGLFDFVLVVHVAV 

10 20 30 40 50 60 

70 30 90 100 110 120 

m277 .pep GDGVAVERFCPNEVVDVFrTLQVHROAFDAVGDFAEYGRAVDAADLLEIGKLGYFHAVEP 
: l:IHI!ll:M l::MI MINIMUM III I I I I i i I I I I I : I I I I I : I 1 1 
a277 SYCITVQRFC PNEVIDVFHALQVHRQAFDA VGDFAEYGGA VDAADLLETGELGYFHTVEP 

70 80 90 100 110 120 



130 140 150 160 170 180 
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m277 . pep DFPAQTPRAEGGVFPVVFDKADWDFGIDAQFAQRVEJEVLDIGGSGLEGDLELVIVLQA 
I I I I I I j I I I I I I I I I I I I I I I I I t I : I I 1 I 1 I I I I I I I I I I I I I I I I I I I I I It I I I 
a277 DFPAQTPRAEGGVFPVVFDKADV\'HFGVDAQFAQGVEIEVLDIGGSGLEGDLELVIVLQA 
130 140 150 160 170 180 

190 200 210 220 230 240 

m277 .pep VG V VA VAA V FGAAAGL DVGG K P RLGAEC AQAGGG WGCAGT DFHV EG LD DG AA FVC P EC LQ 

I I I I I I I : I I I I I I I I I I I I I I I 1 I ! I I I I : I I I I I I ! I I I I I I I I I II I I I I I I I I I I I 
a277 VG WAV AT V FGAAAGL DVGG K P RLGA EC AQTGGGMGC AGT DFHV EGLD DG AA F VC P ECLQ 

190 200 210 220 230 210 

250 

m277.pep FEDDLLEGKHGLX 
I I I 1 1 Ml I I I 1 I 
a277 FEDDLLEGKHGLX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 139>: 

tcacgcccgg tgcgattttt tcgacagggg cggtcaaagt 



g278 . seq 


(partial) 


1 


ttgcgtgcaa 


51 


tgtattaatc 


101 


cgcgtccgac 


151 


caggtaaccg 


201 


ggcaccgacg 


251 


ccgggaattc 


301 


cgaacgatac 


351 


ggcatttaca 


401 


cagcaggatt 


451 


caaagcactc 


501 


cttcaacttt 



ccttgcattg catctgacaa accatggatg 



ta. . . 

This corresponds to the amino acid sequence <SEQ ID 1 140; ORF 278. ng>: 

g278.pep (partial) 

1 LRAITPGAIF STGAV KWLI GPLPSIGRPN ASTTRPTNSR PTGTSKIRPV 

51 QVTVSPSLIC SYSPNTTAPT ESRSRFIAKP KVLPGNSSIS PCIASDKPWM 

101 RTIPSVTEIT VPRVLTSAFT DR FSILALIR SLISAGLSC M KTLLIRHSRV 

151 QSTOFALYRQ IQNLITHFNF 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 141 >: 



m278 . seq. 




1 


TTGCGCGCAA 


51 


TGTATTAATC 


101 


CGCGTCCGAC 


151 


CAGGTAACCG 


201 


GGCG CCGACG 


251 


CCGGGAATTC 


301 


CGAACGATAC 


351 


GGCATTTACA 


401 


CAGCAGGATT 


451 


CAAGGCACTC 


501 


CTTCAACTTT 


551 


TTCAGCTCGC 


601 


CTGTTTGTCG 


651 


GGTTGAATGA 



This corresponds to the amino acid sequence <SEQ ID 1 142; ORF 278>: 

m278.pep 

1 LRAITPGAIF SIGA VKWLI GPLPSIGRPN ASTTRPTSSR PTGTSKIRPV 

51 QVTVSPSLMC SYSPNTTAPT ESRSRFIAKP KVLPGNSSIS PCIASDKPWM 

101 RTIPSVTEIT VPQVRTSAFT DR FSILALIK SLISAGLSCM KTLLIRHSRV 

151 QGTQFALYRQ IQNLITHFNF YAANQLRFDF DRDFQLAVET LIQHLHQLAD 

201 LFVGQRIGTV NDGRFDMVE* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from ^.gonorrhoeae 

ORF 278 shows 95.9% identity over a 170 aa overlap with a predicted ORF (ORF 278.ng) 
from N. gonorrhoeae: 

g278/m278 

10 20 30 40 50 60 

LRAITPGAIFSTGAVKVVLIGPLPSIGRPNASTTRPTNSRPTGTSKIRPVQVTVSPSLIC 
I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I : I I I I I I I 11 I I I I I I I I I I I : I 
LRAITPGAIFSIGAVKVVLIGPLPSIGRPNASTTRPTSSRPTGTSKIRPVQVTVSPSLMC 
10 20 30 40 50 60 

70 80 90 100 110 120 

SYSPNTTAPTESRSRFIAKPKVLPGNSSISPCIASDKPWMRTIPSVTEITVPRVLTSAFr 
I | I | | | I ) I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I II I I I I I I : I I I I If 
SYSPNTTAPTESRSRFIAKPKVLPGNSSISPCIASDKPWMRTIPSVTEITVPQVRTSAFT 
70 80 90 100 110 120 



g27 8.pep 
m27 8 

g278.pep 
m27 8 



130 140 150 160 170 

g278.pep DRFSILALIRSLISAGLSCMKTLLIRHSRVQSTQFALYRQIQNLITHFNF 
I II I I I I I I : I I I I II I I I I I I I I I ! I I I I I : 1 I I I I I I I I I I I M I M 1 
m27 8 DRFSILALIKSLISAGLSCMKTLLIRHSRVQGTQFALYRQIQNLITHFNFYAANQLRFDF 

130 140 150 160 170 180 



m 21 8 DRDFQLAVETLIQHLHQLADLFVGQRIGTVNDGRFDMVE* 
190 200 210 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 1 143>: 

a278 .seq 

1 TTGCGCGCAA TCACGCCCGG TGCGATTTTT TCGATAGGGG CGGTCAAAGT 

51 TGTATTAATC GGGCCTTTGC CGTCGATAGG CCGACCCAAT GCATCAACGA 

101 CGCGTCCGAC CAG7TCGCGT CCGACCGGCA CTTCCAAGAT ACGACCGGTA 

151 CAGGTAACCG TGTCGCCTTC TTTAATATGT TCGTGCTCGC CCAACACTAC 

201 GGCGCCGACG GAG?CGCGCT CCAGGTTCAT CGCCAAGCCG AAAGTGTTAC 

251 CCGGGAATTC GAGCATCTCA CCTTGCATTG CATCTGACAA ACCATGGATG 

301 CGAACGATAC CGTCAGTTAC CGAAATCACC GTACCACGGG TACGCACTTC 

351 GGCATTTACA GACAGATTTT CGATCTTGGC TTTAATCAAA TCGCTAATTT 

4 01 CAGCAGGATT AAGCTGCATG AAAACTCTCC TAATTCGTCA TAGTCGTGTA 

451 CAAGGCACTC AATTTGCCTT GTACAGACAA ATCCAAAACC TGATCACCCA 

501 CTTCAACTTT TATGCCGCCA ATCAGCTCCG GTTCGATTTC GACAGAGATT 

551 TTCAGCTCGC TGTCGAAACG CTTATTCAGC ATTTGCGCCA ACTCGCCGAC 

601 CTGTTTGTCG GTCAACGGAT AGGCACTGTA AATGACGGCA GATTTGATAT 

651 GGTTGAATGA 

This corresponds to the amino acid sequence <SEQ ED 1 144; ORF 278.a>: 
a27 8.pep 

1 LR AITPGAIF SIGAVKWLI GPLPSIGRPN ASTTRPTSSR PTGTSKIRPV 

51 QVTVSPSLIC SCSPNTTAPT ESRSRFIAKP KVLPGNSSIS PCIASDKPWM 

101 RTIPSVTEIT VPRVRTSAFT D RFSILAL1K SLISAGLSC M KTLLIRHSRV 

151 QGTQFALYRQ IQNLITHFNF YAANQLRFDF DRDFQLAVET LIQHLRQLAD 

201 LFVGQRIGTV NDGRFDMVE* 



m278/a278 98.2% identity in 21 9 aa overlap 

10 20 30 40 50 60 

ra278.pep LRAITPGAIFSIGAVKVVLIGPLPSIGRPNASTTRPTSSRPTGTSKIRPVQVTVSPSLMC 
I i I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I 1 I I I t I I I I I I I M t I I I I I ! t I I : I 
a278 LRAITPGAIFSIGAVKVVLIGPLPSIGRPNASTTRPTSSRPTGTSKIRPVQVTVSPSLIC 

10 20 30 40 50 60 



70 80 90 100 110 120 

ni278 .pep SYSPNTTAPTESRSRFIAKPKVLPGNSSISPCIASDKPWMRTIPSVTEI TVPQVRT3AFT 

I I i I I I i I I I I I t I I I II I I I I I I I I I I I I I t I I I I I f I I I I I I I I I I I I I : I I I I I I I 
a278 SCSPNTTAPTESRSRF1AKPKVLPGNSSISPCIASDKPWMRTI PS VT EI T VPRVRTSAFT 

70 80 90 100 110 120 
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130 140 150 1 60 170 130 

m278.pep DRFSILALIKSLISAGLSCMKTLLIRHSRVQGTQFALYRQIQHLITHFNFYAANQLRFDF 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I \ I I I I ! I I I I I I I i I I I I M I I I 
a278 DRFSILALIKSLISAGLSCMKTLLIRHSRVQGTQFALYRQIQNLITHFNFYAANQLRFDF 

130 140 150 160 170 130 

190 200 210 220 

m278 . pep DRD FQLA VETL I QHLH QL A DL F VGQR I GT VN DG RF DM V EX 

I I 1 ti I 1 I I I I I I ! I : I I i I II II I I II I II I t I I I II I I 
a278 D RD FQ LAV ETL I QHLRQL A DL FVGQRI GT VN DG RF DMV EX 

190 200 210 220 



The following partial DNA sequence was identified in N .gonorrhoeae <SEQ ID 1 145>: 

g279 .seq 

1 atgacgcgga tttgcggctg cttgatttca acggttttga gtgtttcggc 

51 aagtttgtcg gcggcgggtt tcatcaggct gcaatgggaa ggaacggata 

101 ccggcagcgg cagggcgcgt ttggctccgg cttctttggc ggcagccatg 

151 gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcga 

2 01 gttgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 
251 tctgcctgac ctgttcatct tccaaaccca aaatggccgc cattgcgcct 

3 01 acgccttgcg gtacggcgga ctgcatcagt tcggcgcgca ggcggacgag 
351 tttgacggca tcggcaaaat ccaatgcttc ggcggcgaca agcgcggtgt 
401 attcgccgag gctgtgtccg gcaacggcgg caggcgtttt gccgcccact 

4 51 tccaaatag 

This corresponds to the amino acid sequence <SEQ ID 1 146; ORF 279.ng>: 

g279.pep 

1 MTRICGCLIS TVLSVSASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 VRPTAAA LPA ITTCPGELKL TASTTSPCAD SAQICLTCSS SKPKMAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVLPPT 
151 SK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 147>: 

m279 .seq 

1 ATAACGCGGA TTTGCGGCTG CTTGATTTCA ACGGTTTTCA GGGCTTCGGC 

51 AAGTTTGTCG GCGGCGGGTT TCATCAGGCT GCAATGGGAA GGTACGGACA 

101 CGGGCAGCGG CAGGGCGCGT TTGGCACCGG CTTCTTTGGC GGCAGCCATG 

151 GCGCGTCCGA CGGCGGCGGC GTTGCCTGCA ATCACGATTT GTCCGGGTGA 

2 01 GTTGAAGTTG ACGGCTTCGA CCACTTCGCT TTGGGCGGCT TCGGCACAAA 
251 TGGCTTTAAC CTGCTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

3 01 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA GGCGGACGAG 
3 51 TTTGACCGCG TCGGCAAAAT TCAATGCGCC GGCGGCAACG AGTGCGGTGT 
401 ATTCGCCGAG GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 
451 TCTAAATAG 

This corresponds to the amino acid sequence <SEQ ID 1 148; ORF 279>: 

m279 .pep 

1 ITRICGCLIS TVFRASASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 ARPTAAA LPA ITICPGELKL TASTTSLWAA SAQMALTCSS SKPRIAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKFNAPAAT SAVYSPRLCP ATAAGVLPPA 
151 SK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N.gonorrhoeae 

ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

m279 .pep ITRICXSCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLJU^AMARPTAAALPA 

niiiiiiiiih on i ii mi iiiiiii nun i ii iii iii iihii mi i n 

g279 KntlCGCLISTVLSVSASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMVRPTAAALPA 
10 20 30 40 50 60 



70 



80 



90 100 110 120 
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m279 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 

I I I I II I I I I I I I I t I I I I : I I I III I h : I I I I I I I I I ! I I I I I I I t I I I I I I I 
g279 ITTCPGELKLTASTTSPCADSAQICLTCSSSKPKMAAIAPTPCGTADCISSARRRTSLTA 
70 80 90 100 110 120 

130 140 150 

m279 - pep S AKFN APAATS AVYS PRLCPATAAGVLP PAS KX 

III II M II III Mil Hill ill II hill 

g279 S AXSNASAATS AVYS PRLCPATAAGVLP PTSKX 

130 140 150 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 149>: 

a279. seq 

1 ATGACNCNGA TTTGCGGCTG CTTGATTTCA ACGGTTTNNA GGGCTTCGGC 

51 GAGTTTGTCG GCGGCG3GTT TCATGAGGCT GCAATGGGAA GGTACNGACA 

101 CNGGCAGCGG CAGGGCGCGT TTGGCGCCGG CTTCTTTGGC GGCAAGCATA 

151 GCGCGCTCGA CGGCGGCGGC ATTGCCTGCA ATCACGACTT GTCCGGGCGA 

201 GTTGAAGTTG ACGGCTTCAA CCACTTCATC CTGTGCGGAT TCGGCGCAAA 

251 TTTGTTTTAC CTGTTCATCP TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA NGCGCACGAG 

353 TTTGACCGCG TCGGCAAAAT CCAATGCGCC GGCGGCAACN AGTGCGGTGT 

401 ATTCGCCGAN GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

451 TCCGAATAG 

This corresponds to the amino acid sequence <SEQ ID 1 150; ORF 279.a>: 

a27 9.pep 

1 MTXICGCLIS TVXRASASLS AAGFMRLQWE GTDTGSGRAR LAPASLAASI 
51 ARSTAAALPA ITTCPGELKL TASTTSSCAD SAQICETCSS SKPRIAAIAP 
101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP ATAAGVLPPA 
151 SE* 

m279/a279 88.2% identity in 1 52 aa overlap 

10 20 30 40 50 60 

m2 7 9 . pep ITRICGCLI STVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

:| MINIMI I t M M I I 1 I t : ) t M I M E I II I I f I M I M I I 1 : : I I MIIMI 
a279 MTXICGCLISTVXRASASLSAAGFMRLQWEGTDTGSGRARLAPASLAASIARSTAAALPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m27 9 . pep I T I C PGELKLT ASTT SLWAAS AQMALT CS S S K PR I AAI APT PCGTADC I S S ARRRT S LTA 

I I i I I II I I I I It II I III: : I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a279 ITTCPGELKLTASTTSSCADSAQICFTCSSSKPRIAAIAPTPCGTADCISSARXRTSLTA 

70 80 90 100 110 120 

130 140 150 

iu27 9 . pep S AK FN APAAT S AVY S PR LC P AT AAG VL P PAS KX 

III I I 1 I II I I I I I I I I I I I I I I I I I I I I : I 
a279 SAKSNAPAATSAVYS PXLCPATAAGVLPPASEX 

130 140 150 

Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
ORF 279 was cloned in pET and pGex vectors and expressed in E.coli as above-described. 
The products of protein expression and purification were analyzed by SDS-PAGE. Figure 2 A 
shows the results of affinity purification and Figure 2B shows the expression in Exoli. 
Purified GST-fusion protein was used to immunize mice whose sera were used for ELISA 
(positive result), FACS analysis (Figure 2C), western blot (Figure 2D). These experiments 
confirm that 279 is a surface-exposed protein and that it is a useful immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 279 are provided in 
Figure 6. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1 996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
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1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 279 and the amino 
acid sequence encoded thereby is provided herein. 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 151>: 

g280 . seq 

1 atgaaacacc tcaaacttac ccttattgcc gcattgctgg ccaccgccgc 

51 aactgccgca ccccttccgg ttgtaaccag tttcagcatt ttaggcgacg 

101 tagccaaaca aatcggcggt gagcgcgtag ccgtacaaag cctcgzcgga 

151 gccaaccaag atactcatgc ctatcacatg accagtggcg acattaaaaa 

201 aatccgcagt gcaaaactcg tcctgctcaa cggcttggga cttgaagccg 

251 ccgacatcca acgcgccgtc aaacagagca aagtatccta tgccgaagcg 

3 01 accaaaggca tccaacccct caaagccgaa gaagaaggcg gacaccatca 

3 51 cgaccaccat cacgaccacg atcatgaeca cgaaggacac caccacgacc 

4 01 acggcgaata tgacccccac gtctggaacg accctgttct tatgtccgac 
4 51 tatgcccaaa acgtcgctga aaccctgata aaggccgatc ccgaaggcaa 
501 agtttattat caacaacgct tgggcaacta ccaaatgcag cttaaaaaac 
551 tgcacagcga cgcacaagcc ccatttaatg ccgtccctgc cgccaaacgc 
601 aaagtcctga ccgggcacga cgcattttcc tacatgggca accgctacaa 
651 catcagctcc atcgccccgc aaggcgtgag cagcgaagcc gagccgtccg 
701 ccaaacaagt cgccgccatc atccggcaaa tcaaacgcga aggcatcaaa 
751 gccgtattta ccgaaaatat caaagacacc cgcatggttg accgcatcgc 
801 caaagaaacc ggcgtcaacg tcagcggcaa actgtattcc gacgcactcg 
8 51 gcaacgcgcc cgcagacacc tacatcggca tgtaccgcca caacgtcgaa 
901 gccttgacca acgcgatgaa gcaataa 

This corresponds to the amino acid sequence <SEQ ID 1 152; ORF 280.ng>: 

g280 .pep 

1 MKHLKLTLIA ALLATAATAA PLPWTSFSI LGDVAKQIGG ERVAVQSLVG 

51 ANQDTHAYHM TSGDI KKIRS AKLVLLNGLG LEAADIQRAV KQSKVSYAEA 

101 TKGIQPLKAE EEGGHHHDHH HDHDHDHEGH HHDHGEYDPH VWNDPVLMSD 

151 YAQNVAETLI KADPEGKVYY QQRLGNYQMQ LKKLHSDAQA AFNAVPAAKR 

201 KVLTGHDAFS YMGNRYNISF IAPQGVSSEA EPSAKQVAAI IRQ1KREGIK 

251 AVFTENIKDT RMVDRIAKET GVNVSGKLYS DALGNAPADT YIGMYRHNVE 

301 ALTNAMKQ* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 153>: 

m280.seq 

1 ATGAAACACC TCAAACTCAC CCTTATTGCC GCATTGCTGA CCGCCTCCGC 

51 AACTGCCGCC CCCCTGCCGG TTGTAACCAG CTTCAGCATT TTAGGCGACG 

101 TAGCCAAACA AATCGGCGGA GAGCGCGTAT CCATACAAAG TTTGGTCGGA 

151 GCCAACCAAG ATACGCACGC CTATCATATG ACCAGTGGCG ACATTAAAAA 

201 AATCCGCAGT GCAAAACTCG TCCTGCTCAA CGGCTTAGGA CTTGAAGCTG 

251 CCGATGTGCA ACGCGCCGTC AAACAAAGCA AAGTATCCTA TACCGAAGCG 

301 ACCAAAGGCA TCCAACCCCT CAAAGCCGAA GAAGAAGGCG GACACCATCA 

351 CGACCACGAT CATGACCACG AAGGACACCA CCATGACCAC GGCGAATATG 

401 ACCCGCACGT CTGGAACGAC CCCGTCCTTA TGTCCGCCTA TGCCCAAAAC 

451 GTTGCCAAAG CCCTGATAAA GGCCGATCCC GAAGGCAAAG TTTATTATCA 

501 ACAACGCTTG GGCAACTACC AAATGCAGCT CAAAAAACTG CACAGCGACG 

551 CACAAGCCGC ATTTAATGCC GTCCCTGCTG CCAAACGCAA AGTCCTGACC 

601 GGGCACGATG CCTTTTCCTA TATGGGCAAA CGTTACCATA TCGAATTCAT 

651 CGCCCCGCAA GGCGTGAGCA GCGAAGCCGA GCCTTCGGCC AAACAAGTCG 

701 CCGCCATCAT CCGACAAATC AAACGCGAAG GCATCAAAGC CGTCTTTACC 

751 GAAAACATCA AGGACACCCG TATGGTTGAC CGTATCGCCA AAGAAACCGG 

801 TGTCAACGTC AGCGGCAAAC TGTATTCCGA CGCACTCGGC AACGCGCCCG 

851 CAGACACCTA CATCGGAATG TACCGCCACA ACATCAAAGC CTTGACCAAC 

901 GCGATGAAGC AATAA 

This corresponds to the amino acid sequence <SEQ ID 1 154; ORF 280>: 

m280.pep 

1 MKHLKLTLIA ALLTASATA A PLPWTSFSI LGDVAKQIGG ERVSIQSLVG 

51 ANQDTHAYHM TSGDI KKIRS AKLVLLNGLG LEAADVQRAV KQSKVSYTEA 

101 TKGIQPLKAE EEGGHHHDHD HDHEGHHHDH GEYDPHVWND PVLMSAYAQN 

151 VAKALIKADP EGKVYYOQRL GNYQMQLKKL HSDAQAAFNA VPAAKRKVLT 
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201 GHDAFSYMGK RYHIEFIAPQ GVSSEAEPSA KQVAAIIRQI KREGIKAVFT 
251 EN I KDTRMVD RIAKETGVNV SGKLYSDALG NAPADTYIGM YRHNIKALTN 
301 AMKQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A {gonorrhoeae 

ORF 280 shows 93.8% identity over a 308 aa overlap with a predicted ORF (ORF 280.ng) 
from N. gonorrhoeae: 

m280/g280 

10 20 30 40 50 60 

m280.pep MKHLKLTL I AALLTAS ATAAPLP WTS FSI LGDVAKQ I GGERVS IQ3 LVGANQDTHAYHM 

II I III I I I III h:HII llllll IINI II Mil III II I MM I III! M I I 

g280 MKHLKLTL I AALLATAATAAP LP WTS FSI LGDVAKQIGGERVAVQ3 LVGANQDTHAYHM 

10 20 30 40 50 60 

70 80 90 100 110 119 

m280.pep TSGDIKKIRSAKLVLLNGLGLEAADVQRAVKQSKVSYTEATKGIQPLKAEEEGGHHHDH- 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 h I M I M 1 1 1 1 1 ; M 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 

g2 80 TSGDIKKIRSAKLVLLNGLGLEAADIQRAVKQSKVS YAEATKGIQPLKAEEEGGHHHDHH 

70 80 90 100 110 120 

120 130 140 150 160 170 

m2 80 - pep - - - DHDHEGHHHDHGEYDPHVWMDPVLMSAYAONVAKALIKADPEGKVYYQQRLGNYQMQ 

I II I M 1 1 1 1 1 1 1 1 1 IE II 1 1 1 1 1 1 1 IMMhMMIIIMMIIIMMIMM 

g2 80 HDHDHDHEGHHHDHGEYDPHVWNDPVLMSDYAQNVAETLIKADPEGKVYYQQRLGNYQMO 
130 140 150 160 170 180 

180 190 200 210 220 230 

m2 80 . pep LKKLHSDAQ AA FN AVP AAKR KVLTGHDAFS YMG KR YH I E F I A PQGV S S EAEP S AKQ VAAI 

IMIIIMIIIMIIIIMIIIIMIIIIIIIhlhl IMIIMI MIMMIIMII 

g280 LKKLHSDAQAAFNAVPAAKRKVLTGHDAFS YMGNRYNI S FI APQGVS SEAEP S AKQVAAI 

190 200 210 220 230 240 

240 250 260 270 280 290 

m280.pep IRQIKREGIKAVFTENIKDTRMVDRIAKETGVNVSGKLYSDALGNAPADTYI GMYRHNIK 

IIMI Ml Ml II I II IIMM III III IMMI IMIMMIMM Mill III Mh: 

g2 8 0 IRQ I KREGI KAVFTENI KDTRMVDRI AKETGVNVSGKLYSDALGNAPADTYI GMYRHNVE 

250 26G 270 280 290 300 



m2 80.pep 



g280 



300 
ALTNAMKQX 

Illllllll 
ALTNAMKQX 



The following partial DNA sequence was 

a280. seq 

1 ATGAAACACC CCAAACTCAC CCTTATCGCC 

51 AACTGCCGCC CCCCTGCCGG TTGTAACCAG 

101 TAGCCAAACA AATCGGCGGA GAGCGCGTAT 

. 151 GCCAACCAAG ATACGCACGC CTATCATATG 

201 AATCCGCAGT GCAAAACTCG TCCTGATTAA 

251 CCGACATCCA ACGTGCCGTC AAACAGAGCA 

301 ACCAAAGGCA TCCAACCCCT CAAAGCCGAA 

351 CGACCACGAT CATGACCACG ACCATGACCA 

401 ACGGCGAATA TGACCCCCAC GTCTGGAACG 

451 TATGCCCAAA ACGTCGCCGA AGCCCTGATA 

501 AGTTTATTAT CAACAACGCT TGGGCAACTA 

551 TGCACAGTGA CGCACAAGCC GCATTTAATG 

601 AAAGTCCTGA CCGGGCACGA TGCCTTTTCC 

651 TATCGAATTC ATCGCCCCAC AAGGTGTGAG 

701 CCAAACAAGT CGCCGCCATC ATCCGACAAA 

751 GCCGTATTTA CCGAAAATAT CAAAGACACC 



identified in N. meningitidis <SEQ ID 1 155>: 



GCATTGCTGA 
CTTCAGCATT 
CCATACAAAG 
ACCAGCGGCG 
CGGCTTAGGA 
AAGTATCCTA 
GAAGAAGGCG 
CGAAGGACAC 
ACCCCGTCCT 
AAGGCCGACC 
CCAAATGCAG 
CCGTCCCTGC 
TATATGGGCA 
CAGCGAAGCC 
TCAAACGCGA 
CGCATGGTTG 



CCACTGCCGC 
TTAGGCGACG 
TTTGGTCGGA 
ACATTAAAAA 
CTTGAAGCTG 
TGCCGAAGCG 
GACACCATCA 
CACCACGACC 
TATGTCCGCC 
CCGAAGGCAA 
CTCAAAAAAC 
CGCCAAACGC 
AACGTTACCA 
GAGCCTTCAG 
AGG CATC AAA 
ACCGCATCGC 
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801 CAAAGAAACC GGTGTCAACG TCAGCGGCAA ACTGTATTCC GACGCACTCG 

851 GCAACGCACC CGCAGACACC TACATCGGCA TGTACCGCCA CAACATCAAA 

901 GCCTTAACCA ACGCGATGAA GCAATAA 

This corresponds to the amino acid sequence <SEQ ID 1 156; ORF 280.a>: 

a280.pep 

1 MKHPKLTLIA ALLTTAATA A PLPWTSFSI LGDVAKQIGG ERVSIQSLVG 

51 ANQDTHAYHM TSGDIKKIRS AKLVLINGLG LEAADIQRAV KQSKVSYAEA 

101 TKGIQPLKAE JJEGGHHHDHD HDHDHDHEGH HHDHGEYDPH VWNDPVLMSA 

151 YAQNVAEALI KADPEGKVYY QQRLGNYQMQ LKKLHSDAQA AFNAVPAAKR 

201 KVLTGHDAFS YMGKRYHIEF IAPQGVSSEA EPSAKQVAAI IRQIKREGIK 

251 AVFTENIKDT RMVDRIAKET GVNVSGKLYS DALGNAPADT YIGMYRHNIK 

301 ALTNAMKQ* 



m280/a280 96.4% identity in 308 aa overlap 

10 20 30 40 50 60 

m280.pep MKHLKLTLI AALLTASATAAPLPWTS FS I LGDVAKQ I GGERVS IQSLVGANQDTHAYHM 

III I I I I I I I I I I : : I I I I I I I I I I I I I i I I I I I I I I I 1 I I ! I I I I II I I I I j I I ! I I I 
a280 MKHPKLTLIAALLTTAATAAPLPVVTSFSILGDVAKQIGGERVSIQSLVGANQDTHAYHM 

10 20 30 40 50 60 



70 80 90 100 110 120 

m280 . pep TSGDIKKIRSAKLVLLNGLGLEAADVQRAVKQSKVSYTEATKGIQPLKAEEEGGHHHDHD 
I I I I I I I I I I I I I I I : I I I II I II I : I I II I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 
a280 T SGDTKK I R S AKLVLINGLGLEAADI QRAVKQSKVS Y AE ATKG X.QPI.KAEF.F.GGHHH DH D 

70 80 90 100 110 120 



130 140 150 160 170 

m280 . pep HDH EGHHHDHGEYDPHVWNDPVLMSAYAQNVAKALIKADPEGKVYYQQRLGNYQMQ 

III I I I I II I M I I I I I I I I I I I I I I I I II I I : I I I I I I I 1 I I I I I II I I I I I II I 

a280 HDHDHDHEGHHHDHGEYDPHVWNDPVLMSAYAQNVAEALIKADPEGKVYYQQRLGNYQMQ 
130 140 150 160 170 180 



180 190 200 210 220 230 

m2 80 . pep LKKLHSDAQAAFNAV PAAKRKVLTGHDAFSYMGKRYHIEFIAPQGVSSEAEPSAKQVAAI 

I I I I I II I I I I I I I I I I II I I I I I I II I I I II I I I I II I I I I I I I I I I 1 I I II I I I I I I I 
a280 LKKLHSDAQAAFNAVPAAKRKVLTGHDAFSYMGKRYHIEFIAPQGVSSEAEPSAKQVAAI 
190 200 210 220 230 240 



240 250 260 270 280 290 

m280.pep I RQIKREGIKAVFTENIKDTRMVDRIAKETGVNVSGKLYS DALGNAPADT YIGMYRHNIK 

I I II I M II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I II I I II 
a280 I RQIKREGIKAVFTENIKDTRMVDRIAKETGVNVSGKLYS DALGNAPADT YIGMYRHNIK 

250 260 270 280 290 300 



m280.pep 
a280 



300 
ALTNAMKQX 

I I I I I I I I I 

ALTNAMKQX 



The following partial DMA sequence was identified in N. gonorrhoeae <SEQ ID 1 157>: 

g281 . seq 

1 atgcactacg ccctcgcatc cgtctrctgc ctgtccctca gcgccgcacc 

51 cgtcggcgta ttcctcgtca tgcgccgtat gagcctgata ggcgacgcat 

101 tgagccacgc cgtcctgccc ggtgccgccg tcggctacat gtttgccggc 

151 ttgagcctgc ccgctatggg tgtgggcggg tttgccgccg gtatgctgat 

201 ggcgctgctt gccggactcg tcagccgctt taccaccctg aaagaagatg 

251 ccaactttgc cgccttttac ctgagcagcc tcgccatcgg cgtaatcctc 

301 atcagcaaaa acggcagcag cgtcgattta ctccacctcc ttttcggatc 

351 tgtgcttgcc gtcgatattc ccgcactgca actcatcgcc gccgtctccg 

401 gcctcacgct cattaccctt gccgtcatct accgccccct ggtgctagaa 

4 51 agcatagacc cccttttcct caagtccgtc aacggcaaag gcgggctttg 
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501 gcacgtcatt ttcctcatcc tcgtcgttat gaacctcgta tccggcttcc 

551 aagccctcgg catcctgatg tcggtcggaa ttatgatgct gcccgccatt 

601 accgcccgtt tatgggcaag aaatatgggg acgctcattc tgttgtccgt 

651 cctcatcgcc cttttttgcg gtttgatcgg gctgctcatt ccctaccaca 

701 tcgaaatccc ttccggcccc gccatcatcc tctgttgcag cgtcctttat 

751 cttttttccg tcatactcgg caaagaaggc ggcatcttgc ccaaatggtt 

801 caaaaaccac cgccaccaca ccacctga 

This corresponds to the amino acid sequence <SEQ ID 1 158; ORF 28Lng>: 

g281 -pep 

1 MHYALASVFC LSLSA APVGV FLVMRRMSLI GDALSHAVLP GAAVGYMFAG 

51 LSLPAM GVGG FAAGMLMALL AGLV SRFTTL KEDAN FAAFY LSSLAIGVIL 

101 ISKNGSSV DL LHLLFGSVLA VDIPAL Q LIA AV5GLTLITL AVIY RPLVLE 

151 SIDPLFLKSV NGKGGLW HVI FLILWMNLV SGFQALGILM SVGIMMLPAI 

201 TARLWARNMG TLI LLSVLIA LFCGLIGLLI SYHIEIPSGP AIILCCSVLY 

251 LFSVILG KEG GILPKWFKNH RHHTT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 159>: 

m281.seq (partial) 

1 ATGCGCTACG CCCTCGCATC CGTCTTCTGC CTGTCCCTCA GTGCCGCACC 

51 CGTCGGCGTA TTCCTCGTCA TGCGCCGTAT GAGCCTGATA GGCGACGCAT 

101 TGAGCCACGC CGTCCTGCCC GGTGCCGCCG TCGGCTACAT GTTTGCCGGC 

151 TTGAGCCTGC CCGCCATGGG TTTGGGCGGC GTAGCCGCAG GCATGCTGAT 

201 GGCACTGCTT GCCGGACTCG TCAGCCGCTT CACCACCCTG AAAGAAGATG 

251 CCAACTTTGC CGCCTTTTAT CTCAGCAGCC TCGCCATCGG CGTAGTCCTC 

301 GTCAG CAAAA ACGGGAGCAG CGTCGATTTG CTCCACCTCC TTTTCGGCTC 

351 TGTACTTGCC GTCGATATTC CTGCCCTGCA GCTCATCGCC GCCGTCTCCA 

401 GCCTCACGCT CATTACCCTT GCCGTCATCT ACCGCCCGCT CGTACTCGAA 

451 AGCATCGACC CCCTGTTTCT CAAATCCGTC GGCGGCAAAG GCGGGCTTTG 

501 GCACGTCCTC TTTCTCGTCC TGGTCGTCAT GAACCTCGTA TCCGGCTTTC 

551 AAGCCCTCGG CACACTCATG TCCGTCGGAC TCATGATGCT GCCAGCCATT 

601 ACCGCCCGCC TGTGGGCGAA GC AT ATGGGC GCACTCATCC TCCTATCCGT 

651 TCTGACAGCC CTGCTGTGCG GCTTGAGCGG ACTGCTCATT TCCTACCACA 

701 TCGAAATTCC TTCCGGTCCC GCCATCATCC TCTGTTGCAG CGTCCTTTAT 

751 CTCTTTTCCG TCATACTCGG CAAAGAAGGC GGCATTCTGA CC. . 

This corresponds to the amino acid sequence <SEQ ID 1 160; ORF 281>: 

tn281.pep (partial) 

1 MRYALASVFC LSLSAA PVGV FLVMRRMSLI GDALSHAVLP GAAVGYMFAG 

51 LSLPAMG LGG VAAGMLKALL AGLV SRFTTL KEDAN FAAFY LSSLAIGWL 

101 VSKNGSSVD L LHLLFGSVLA VDIPALQLIA AVSSLTLITL AVIY RPLVLE 

151 SIDPLFLKSV GGKGGLW HVL FLVLWMNLV SGFQALGTLM SVGLMMLPAI 

201 TARLWAKHMG ALI LLSVLTA LLCGLSGLLI SYHIEIPSGP AIILCCSVLY 

251 LFSVILG KEG GILT.. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from ^.gonorrhoeae 

ORF 281 shows 93.5% identity over a 263 aa overlap with a predicted ORF (ORF 281.ng) 
from N. gonorrhoeae: 

m281/g281 

10 20 30 40 50 60 

m281 . pep MRYALASVFCLSLSAAPVGVFLVMRRMSLIGDALSHAVLPGAAVGYMFAGLSLPAMGLGG 

hi II MMIIIIIII iillll lllllll MINI illl III I HUM llllll Ihl I 

g281 MHYALASV FCLSLSAAPVGV FLVMRRMS LI GDALSHAVLPGAAVG YMFAGLSL PAMGVGG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 81 . pep VAAGMliMALLAGLVSRFTTLKEDANFAAFYLSSLAIGVVLVSKNGSSVDLLHLLFGSVLA 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I h h I I I I I I I I I I I I I II I I I I 
g281 FAAGMli^LLAGLVSRFTTLKEDANFAAFYLSSLAIGVILISKNGSSVDLL^LFGSVLA 
70 80 90 100 110 120 

130 140 150 160 170 180 

m2 8 1 . pep VDI PALQL I AAVSS LTLITLAVI YRPLVLES IDPLFLKSVGGKGGLWHVLFLVLWMNLV 
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| | | I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I I « I I I I I I I 
VD I PALQLI AAVSGLTLI TLAV I YRPLVLES I D PLFLKS VNGKGGLWHV I FL I LWMNLV 
130 140 150 160 170 180 

190 200 210 220 230 240 

SGFQALGTLMSVGLMMLPAITARLWAKHMGALILLSVLTALLCGLSGLLISYHIBIPSGP 

II II II I III I Ml Mill III 1 1- I hi I Mill I hi II III Ml I II II Ml 

SGFQALGI LMSVGIMMLPAITARLWARNMGTLILLSVLIALFCGLIGLLI S YHI EI PSGP 
190 200 210 220 230 240 

250 260 
AI ILCCSVLYLFSVILGKEGGILT 

I I II III Ml 1 IN M I M II I 1 
AI I LCC S VLYLFS V I LG KE GG I LPKW FKNHRHHTTX 
250 260 270 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 161>: 

a281.seq 

1 ATGCGCTACG CCCTCGCATC CGTCTTCTGC CTGTCCCTCA GTGCCGCACC 

51 CGTCGGCGTA TTCCTCGTCA TGCGCCGTAT GAGCCTGATA GGCGACGCAT 

101 TGAGCCACGC CGTCCTGCCC GGTGCCGCCG TCGGCTACAT GTTTGCCGGC 

151 TTAAGCCTGC CCGCCATGGG TTT GGGCGGC GTAGCCGCAG GTATGCTGAT 

201 GGCACTGCTT GCCGGACTCG TCAGCCGCTT CACCACCCTG AAAGAAGATG 

251 CCAACTTTGC CGCCTTTTAT CTCAGCAGCC TCGCCATCGG TGTAGTCCTC 

301 GTCAGCAAAA ACGGCAGCAG CGTCGATTTG CTCCACCTCC TTTTCGGCTC 

351 CGTACTTGCC GTCGATATTC CTGCCCTGCA ACTCATCGCC GCCGTATCCA 

401 CCCTCACACT GCTTACCCTT GCCGTCATCT ACCGCCCGCT CGTACTCGAA 

451 AGCATCGACC CCCTGTTTCT CAAATCTGTC GGCGGCAAAG GCGGGCTTTG 

501 GCACGTCCTC TTTCTCGTCC TGGTCGTCAT GAACCTCGTA TCCGGCTTTC 

551 AAGCCCTCGG CACACTCATG TCCGTCGGAC TTATGATGCT GCCAGCCATT 

601 ACCGCCCGCC TATGGGCGAA GCACATGGGC GCACTCATCC TCCTATCCGT 

651 TCTGACAGCC CTGCTGTGCG GCTTGAGCGG ACTGCTCATT TCCTACCACA 

701 TCGAAATTCC TTCCGGTCCC GCCATCATCC TCTGTTGCAG CGTCCTTTAT 

751 CTCTTTTCCG TCATACTCGG CAAAGAAGGC GGCATTCTGA CCAAATGGCT 

801 CAAAAACCAC CGCCACCACA CCACCTGA 

This corresponds to the amino acid sequence <SEQ ID 1 162; ORF 281. a>: 

a281.pep 

1 MRYALASVFC LSLSA APVGV FLVMRRMSLI GDALSHAVLP GAAVGYMFAG 

51 LSLPAMG LGG VAAGMLMALL AGLV SRFTTL KEDAN FAAFY 1SSLAIGWL 

101 VSKNGSSVD L LHLLFGSVLA V PI PAL Q LI A AVSTLTLLTL AVIY RPLVLE 

151 SIDPLFLKSV GGKGGLWH VL FLV LWMNLV SGFQALGTLM SVGLMMLPAI 

201 TARLWAKH MG ALILLSVLTA LLCGL SGLLI SYHIEIPSGP AIILCCSVLY 

251 LFSVILG KEG GILTKWLKNH RHHTT* 

m281/a281 99.2% identity in 264 aa overlap 

10 20 30 40 50 50 

m281.pep MR Y ALA S V FCL S L S AA P VG V F LVMRRM S L I G DAL S HAV L PG AAVG YM FAGLS L P AMG L G G 

1 I M I I I I I I I II I II I I I I I I ! I I I I I I II II I I I I I I It I I I I I I I I I I I I I I I I I I I 
a281 MRYALASVFCLSLSAAPVGVFLVMRRMSLIGDALSHAVLPGAAVGYMFAGLSLPAMGLGG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m281 . pep VAAGMLMALLAGLVSRFTT1KEDANFAAFYLSSLAIGWLVSKNGSSVDLLHLLFGSVLA 
I | | I I I I I I I I I I I I I I I I I I I I 1 I I II I I I I I I I II I I I I I I I II I I I II I I I I I I I I I 
a281 VAAGMLMALLAGLVSRFTTLKEDANFAAFYLSSLAIGWLVSKNGSSVDLLHLLFGSVLA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m281.pep VDI PALQLI AAVSSLTLITLAVI YRPLVLES I DPLFLKSVGGKGGLWHVLFLVLWMNLV 

I I I I I I I I I I I I I : I I I : I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
a281 VDIPALQLIAAVSTLTLLTLAV I YRPLVLES I DPLFLKSVGGKGGLWHVLFLVLWMN LV 

130 140 150 160 170 180 



9281 

m281 .pep 
g281 

m281 -pep 
g281 
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. 190 200 210 220 230 240 

m281 . pep SGFQALGTLMSVGLMMLPAITARLWAKHMGALILLSVLTALLCGLSGLLISYHIEIPSGP 

I M I I I I II I I I II II I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I i I I I 
a 281 SGFQALGTLMSVGLMT^LPAITARLWAKHMGALILLSVLTALLCGLSGLLISYHIEIPSGP 

190 200 210 220 230 240 

250 260 
m281 .pep AIILCCSVLYLFSVILGKEGGILT 

II M I M ll 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 

a 281 AIILCCSVLYLFSVILGKEGGILTKWLKNHRHHTTX 
250 260 270 



The following partial DNA sequence was identified in ^.gonorrhoeae <SEQ ID 1 1 63>: 

g282.seq 

1 atgggattgg gtatggaaat cggcaagctg attgtggctc ttttggtgct 

51 gatcaatccg tttagcgcgt tgtcgcttta ccttgacctg accaacggac 

101 acagcacgaa ggagcgcagg aaggtcgcgc ggacggccgc cgtcgccgtg 

151 tttgccgtga ttgcggtatt tgcgctgatc ggcggtgcgc tattgaaggt 

201 tttgggcatc agcgtcggtt cgtttcaggt cggcggcggg attttcgtgc 

251 cgctgatcgc catttcgatg atgaacggca acgacaatcc cgccaagcag 

301 aatctcggcg cgcagccgga aacggggcaa gcgcgccccg cccgcaatgc 

351 aggggcgatt gccgtcgtgc ccatcgccat accgatcacc atcggtccgg 

401 gcggtatttc gactgtgatt atttatgctt cggcagccaa aacgtacagc 

451 gatatcgcgc tgattatcgc ggccggtttg gtggtcagtg cgatttgtta 

501 tgccatttta atcgttgccg ggaaggtcag ccgcctgctg ggcgcgacgg 

551 ggctgacgat tttaaaccgc attatgggta tgatgctggc ggcggtatcg 

601 gtggagatta ttgtgtcggg actgaaaacg atattcccgc aactggcagg 

651 ttga 

This corresponds to the amino acid sequence <SEQ ID 1164; ORF 282.ng>; 

g282 .pep 

1 MGLGMEIGKL IVALLVLINP FSALS LYLDL TNGHST KERR KVARTAAVAV 

51 FAVIAVFALI GGAL LKVLGI SVGS FQVGGG ILVLLIAISM MNGNDNPAKQ 

101 NLGAQPETGQ ARPARNAG AI AWPIAIPIT IGPGGI STVI IYASAAKTYS 

151 DIAL IIAAGL WSAICYAIL IV AGKVSRLL GATGLTILNR IMGMMLAAVS 

201 VEIIVSG LKT IFPQLAG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 165>: 

m282 . seq 

1 ATGGGATTGG GCATGGAAAT CGGCAAGCTG ATTGTGGCTT TTTTGGTGCT 

51 GATTAATCCG TTTAGCGCGT TGTCGCTTTA CCTTGACCTG ACCAACGGGC 

101 ACAGCACGAA GGAGCGCAGG AAGGTCGCGC GGACGGCCGC CGTTGCCGTG 

151 TTTGCCGTGA TTGCGGTATT TGCGCTGATC GGCGGTACGC TGCTGAAGGT 

201 TTTGGGCATC AGCGTCGGTT CGTTTCAGGT CGGCGGCGGG ATTTTGGTGC 

251 TGCTGATCGC CATTTCGATG ATGAACGGCA ACGACAATCC CGCCAAGCAG 

301 AATCTCGGCG CGCAGCCGGA AACGGGGCAG GCGCGCCCCG CCCGCAATGC 

351 CGGAGCGATT GCCGTCGTGC CCATCGCCAT ACCGATCACC ATCGGCCCGG 

4 01 GCGGTATTTC GACCGTGATT ATTTACGCTT CGGCGGCTAA AACATACGGC 

451 GACATCGCGT TGATTATCGC GGCCGGTTTG GTGGTCAGTG CGATTTGTTA 

501 TGCCATTTTA ATCGTTGCCG GGAAGGTCAG CCGCCTGCTG GGCGCGACGG 

551 GGCTGACGAT TTTAAACCGC ATTATGGGTA TGATGCTGGC GGCGGTATCG 

601 GTGGAGATTA TTGTGTCGGG ACTGAAAACG ATATTCCCGC AACTGGCAGG 

651 TTGA 

This corresponds to the amino acid sequence <SEQ ID 1 166; ORF 282.ng>: 

m282 .pep 

1 MGLGMEIGKL IVAFLVLINP FSALS LYLDL TNGHST KERR KVARTAAVAV 

51 FAVIAVFALI GGTL LKVLGI SVGS FQVGGG ILVLLIAISM MN GNDNPAKQ 

101 NLGAQPETGQ ARPARNAG AI AWPIAIPIT IGPGGI STVI IYASAAKTYG 

151 DIA LIIAAGL WSAICYAIL IV AGKVSRLL GATGLTILNR IMGMMLAAVS 

201 VEIIVSG LKT IFPQLAG* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N.gonorrhoeae 

ORF 282 shows 98.6 % identity over a 217 aa overlap with a predicted ORF (ORF 282.ng) 
from AT gonorrhoeae: 

m282/g282 

10 20 30 40 50 60 

m282.pep MG LGME I G KL I VAFLVL I N P FS ALS L YLDLTNGHS T KE RRKV ARTAAVAVFAV I AV FAL I 

f 1 1 1 1 1 1 1 1 f 1 1 1 = i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 i f I lllllllllllll 

g 282 MGLGME I GKL I VALLVL I N P FS ALS L YLD LTNGHS T KE RRKV ARTAAVAVFAV I AV FAL I 

10 20 30 40 50 60 

70 80 90 100 110 120 

m282.pep GGTLLKVLGI SVGSFQVGGGI LVLLI AI SMMNGNDNPAKQNLGAQPETGOARPARNAGAI 

I h I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 I I ! 1 1 1 1 I 1 1 1 I I I 1 1 1 1 1 1 1 I 1 1 1 1 I 

g282 GGALL KVLG I SVG S FQVGGG I LVLL I A I SMMNGNDN P AKQNLG AQP ETGQAR PARKAGAI 

70 80 90 100 110 120 

130 140 150 160 170 180 

m282 .pep AWPIAIPITIGPGGISTVIIYASAAKTYGDIALIIAAGLWSAICYAILIVAGKVSRLL 

I | | I I I I I I I II I II I I I I I I I I I I I I I h I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I 
g282 AWPIAIPITIGPGGI STVIIYASAAKTYSDIALI IAAGLWSAICYAILIVAGKVSRLL 

130 140 150 160 170 180 

190 200 210 

m282 .pep ' GATGLTILNRIMGMMLAAVSVEIIVSGLKTI FPQLAGX 

III MM III II IMMI I II I III II M II M ill 

g 282 GATGLTILNRIMGMMLAAVSVEIIVSGLKTIFPQLAGX 
190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 167>: 
a282.seq 

1 ATGGGATTGG GCATGGAAAT CGGCAAGCTG ATTGTGGCTT TTTTGGTGCT 

51 GATTAATCCG TTTAGCGCGT TGTCGCTTTA CCTTGACCTG ACCAACGGGC 

101 ACAGCACGAA GGAGCGCAGG AAGGTCGCGC GGACGGCCGC CGTTGCCGTG 

151 TTTGCCGTGA TTGCGGTATT TGCGCTGATC GGCGGTACGC TGCTGAAGGT 

201 TTTGGGCATC AGCGTCGGTT CGTTTCAGGT CGGCGGCGGA ATTTTGGTGT 

251 TGCTGATTGC CATTTCGATG ATGAACGGCA ACGACAATCC CGCCAAGCAG 

301 AATCTCGGCG CGCAGCCGGA AACGGGGCAG GTGCGCCCCG CCCGCAATGC 

351 CGGAGCGATT GCCGTCGTGC CCATCGCCAT ACCGATCACC ATCGGCCCGG 

401 GCGGTATTTC GACCGTGATT ATTTACGCTT CGGCGGCTAA AACATACGGC 

4 51 GACATCGCGT TGATTATCGC GGCCGGTTTG GTGGTCAGTG CGATTTGTTA 

501 TGCCATTTTA ATCGTTGCCG GGAAGGTCAG CCGCCTGCTG GGTGCGACGG 

551 GGCTGACGAT TTTAAACCGT ATCATGGGTA TGATGCTGGC GGCGGTATCG 

601 GTGGAGATTA TTGTGTCGGG ACTGAAAATG A7ATTCCCGC AACTGGCAGG 

651 TTGA 

This corresponds to the amino acid sequence <SEQ ID 1 168; ORF 282.a>: 
a282.?ep 

1 MGLGME I GKL IVAFLVLINP FSALS LYLDL TNGHSTKERR KVARTAAVAV 

51 FAVIAVFALI GG TLLKVLGI SVGS FQVGGG ILVLLIAISM M NGNPNPAKQ 

101 NLGAQPETGQ VRPARNAGAI AVVPIAIPIT^IGPGGISTVI IYASAAKTYG 

151 DIAL IIAAGL V V S A I C YAI L I VAGKVS RLL GATGLTILNR IMGMMLAAVS 

201 VEIIVSG LKM IFPQLAG* 

m282/a282 99. 1 % identity in 2 1 7 aa overlap 

10 20 30 40 50 60 

m282 . pep MGLGMEIGKLI VAFLVL IN PFSALSLYLDLTNGHSTKERRKV ART AAVAV FAVIAVFALI 

! I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I t 1 I t ! I I I I i I I I I I I I I I I > I I I I I I I 
a282 MGLGMEI GKLIVAFLVL IN PFSALSLYLDLTNGHSTKERRKVARTAAVAV FAVIAVFALI 

10 20 30 40 50 60 
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70 80 90 100 110 120 

in2 82 . pep GGTLLKVLGI SVGSFQVGGGILVLLI AISMMNGNDNPAKQNLGAQPETGQARPARNAGAI 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I III I I I ! I I II I I I I I I : I I I 1 I I I I I 
a 282 GGTLLKVLGI SVGSFQVGGGILVLLIAISMMNGNDNPAKQNLGAQPETGQVRPARNAGAI 

70 80 90 100 110 120 

130 140 150 160 170 180 

m282.pep AVVPIAIPITIGPGGISTVIIYASAAKTYGDIALIIAAGLVVSAICYAILIVAGKVSRLL 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I 
a2 82 AVVPI AI PITIGPGG1STV 1 1 YASAAKT YGD1 ALI IAAGL VVSAICYAILIVAGKVSRLL 

130 140 150 160 170 180 

190 200 210 

jd2 8 2 . pep G ATGLTI LNRIMGMMLAAVS VEI I VS GLKT I FPQLAGX 

I I I 1 1 I ! I I I I I J 1 1 1 1 1 1 I I S I 1 1 1 I I 1 II! MM I 
3282 GATGLTILNRIMGMMLAAVS VEI I VSGLKMI FPQLAGX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 169>: 



g283.seq 








1 


atgaactttg 


ctttatccgt 


catcacattt 


51 


cgtcccgcct 


gccggaaccg 


ccgtctttac 


101 


acagctattc 


ggatgtgccg 


aaacagcttc 


151 


ctcaacctgc 


ggacgctcca 


aaccaaaccg 


201 


cgtcgatacg 


aatgcggaca 


gtgcgaagga 


251 


agaaaaacgg 


gcagcttgag 


gaagaaaaga 


301 


cggcagaaca 


aagaagaaaa 


ctgccggatt 


351 


ggtgggaaac 


tcaaatgcga 


aaaacaagga 


401 


ataacgccgt 


aaacaaatac 


tgccgttaa 



accctcgcct ctttcctgcc 

ttggaaagac ggcggcggca 

atcccgacca gagccaaatc 

gcggtcaagc ccaaacctgc 

aaacgaaaag gatatcgccg 

aaaaaattgc cgaaaccgaa 

tcaaaaatga acctgaaggc 



This corresponds to the amino acid sequence <SEQ ID 1 170; ORF 283. ng>: 

g283.pep 

1 KNFALSVITF TLASFLFVPP AGTA VFTWKP GGGNSYSDVP KQLHPDQSQI 

51 LNLRTLQTKP AVKPKPAVDT NADSAKENEK DIAEKNC-QLE EEKKKIAETE 

101 RQNKEENCRI SKMNLKAVGN SNAKNKDDLI RKYNNAVNKY CR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 171>: 

m283.seq 

1 ATGAACTTTG CTTTATCCGT CATTATGTTG ACCCTCGCCT CTTTCCTGCC 

51 CGTCCCGCCT GCCGGAGCCG CCGTCTTTAC TTGGAAGGAC GGCGGCGGCA 

101 ACAGCTATTC GGATGTACCG AAACAGCTTC ATCCCGACCA AAGCCAAATC 

151 TTAAACCTGC GGACGCGCCA AACCAAACCG GCGGTCAAAC CCGCCCAAGC 

201 CGACGCAGGG AAGCGCACAG ACGGCGCGGC ACAGGAAAAC AATCCCGACA 

251 CTGCCGAGAA AAACCGGCAG CTTGAGGAAG AAAAGAAAAG AATTGCCGAA 

301 ACCGAACGGC AGAACAAAGA AGAAAACTGC CGGATTTCAA AAATGAACCT 

351 GAAGGCGGTG GGAAATTCAA ATGCAAAAAA CAAGGATGAT TTGATTCGGA 

401 AATACAATAA CGCCGTAAAC AAATACTGCC GTTAA 

This corresponds to the amino acid sequence <SEQ ID 1 172; ORF 283>: 

m283 .pep 

1 MNFALSVIML TLASFLPVPP AGAA VFTWKD GGGNSYSDVP KQLHPDQSQI 

51 LNLRTRQTKP AVKPAQADAG KRTDGAAQEN NPDTAEKNRQ LEEEKKRIAE 

101 TERQNKEENC RISKMNLKAV GNSNAKNKDD LIRKYNNAVN KYCR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m283/g283 86.1% identity in 14 4 aa overlap 

10 20 30 40 50 60 

m283 .pep MNFALSVIMLTLASFLPVPPAGAAVFTWKDGGGNSYSDVPKQLHPDQSQI LNLRTRQTKP 

I I I I I I I I :l I I I II I I I I I I : I I I I I I I I I I I I I I I I I i I I M I I I I I I I I I I Mil 
g283 MNFALSVITFTLASFLPVPPAGTAVFTWKDGGGNSYSDVPKQLHPDQSQI LNLRTLQTKP 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m283.pep AVKPAQADAGKRTDGAAQENNPDTAEKNRQLEEEKKRIAETERQNKEENCRISKMNLKAV 
I I I I I : : I : I : I I : I ! I I I I I I I I I I : I I I I I I I ! I I I I I i I I 1 I I I 1 I i 
g 283 AVKPKPA-VDTNAD-SAKENEKDIAEKNGQLEEEKKKIAETERQNKEENCRISKMNLKAV 

70 80 90 100 110 



130 140 
ra283 . pep GNSNAKNKDDL I RKYNNAVNKYCRX 

I I I I I II I I I I I I I I I I I I I I 1 I I I 
g 2 8 3 GNSNAKNKDDL I RKYNNAVNKYCRX 

120 130 140 



The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 1 173>: 

a283. seq 

1 ATGAACTTTG CTTTATCCGT CATTATGTTG ACCCTCGCCT CTTTCCTGCC 

51 CGTCCCGCCT GCCGGAGCCG CCGTCTTTAC TTGGAAGGAC GGCGGCGGCA 

101 ACAGCTATTC GGATGTACCG AAACAGCTTC ATCCCGACCA AAGCCAAATC 

151 TTAAACCTGC GGACGCGCCA AACCAAACCG GCGGTCAAAC CCGCCCAAGC 

201 CGACGCAGGG AAGCGCACAG ACGGCGCGGC ACAGGAAAAC AATCCCGACA 

251 CTGCCGAGAA AAACCGGCAG CTTGAGGAAG AAAAGAAAAG AATTGCCGAA 

301 ACCGAACGGC AGAACAAAGA AGAAAACTGC CGGATTTCAA AAATGAACCT 

351 GAAAGCGGTG GGAAATTCAA ATGCAAAAAA CAAGGATGAT TTGATTCGGA 

401 AATACAATAA CGCCGTAAAC AAATACTGCC GTTAA 

This corresponds to the amino acid sequence <SEQ ID 1 174; ORF 283.a>: 

a283 .pep 

1 MNFALSVIML TLASFLPVPP AGA AVFTWKD GGGNSYSDVP KQLHPDQSQI 

51 LNLRTRQTKP AVKPAQADAG KRTDGAAQEN NPDTAEKNRQ LEEEKKRIAE 

101 TERQNKEENC RISKMNLKAV GNSNAKNKDD LIRKYNNAVN KYCR* 



m283/a283 100.0% identity in 144 aa overlap 

10 20 30 40 50 60 

m283 . pep MNFALSVIMLTLASFLPVPPAGAAVFTWKDGGGN SYS DVPKQLHPDQSQI LNLRTRQTKP 

III 111 I II II Ml II III III II III II I Mil! II till Ml Mill lllll I III I 
a 2 8 3 MNFALSVIMLTLAS FLPVPPAGAAVFTWKDGGGNSYS DVPKQLHPDQSQI LNLRTRQTKP 

10 20 30 40 50 60 



70 80 90 100 110 120 

m283.pep AVKPAQADAGKRTDGAAQENNPDTAEKNRQLEEEKKR I AETERQNKEENCRI SKMN LKAV 

I II II M I I M II I I I I I I II I 1 1 I M M I M II I I M II I M I 11 II I I ! I M I I I II 
a283 AVKPAQADAGKRTDGAAQENNPDTAEKNRQLEEEKKRIAETERQNKEENCRISKMNLKAV 
70 80 90 100 110 120 



130 140 
m283 . pep GNSNAKNKDDLIRKYNNAVNKYCRX 
IMIMMMIIMIIIII.IMII 
a 2 8 3 GN SNAKNKDDLIRKYNNAVNKYCRX 

130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 175>: 

g284 . seq. 

1 atgccgtctg aaactcgaaa tcggtttcag acggcattgg tttacgcggc 

51 aggttggggc ttagcggtct ttgtaacggc attcgctttt gcctgcaaaa 

101 gagtcgccgg ctttgcgttt gcctttgaag ccttcgccgg tttttttgaa 

151 actgtctttc ttaaagcctt ctttcttgaa accttcgccg cgcgttttgc 

201 cgccgaagcc ttctttgccc ggtttatgat cgccgcgccg gccgccggat 

251 ttcctatcgc cccagccgcc tttgcctttc ggcttgccgc ctgcggattt 

301 ccgtttgcgg gccggctcca tgccttcgat ggtcagttcg ggcagtttgc 

351 cgttaatgta tttttcgatt ttgtggactt tgacgtcttc gttcacttcg 

401 gcaaacgtaa tcgcaatacc cgtgcggcct gcgcggccgg tgcgcccgat 

451 gcggtggacg tagtcttccg cctgtttcgg caggtcgtag tttatgacgt 



WO 99/57280 



PCT/US99/09346 



645 

501 gggtaatggt cggtacgtca ataccgcgtg cggcaacgtc ggtggcaacc 

551 aaaattittgc agcggccttt acgcaaatcc gtcagcgtgc ggttgcgcca 

601 gccctgcggc atatcgccgt gcaggcagtt ggcggcgaaa cctttttcgt 

651 acaattcatc cgcgatgact tcggtcatcg ctttgg-gga cgtgaaaatc 

701 acacattggt cgatgttggc atcgcgcagg atgtggtcga gcaggcggtt 

751 tttgtggcgc atatcgtcgc agtacaacaa ctgctcttcg attttgcctt 

801 ggccgtccac gcgttcgact tcgataattt cagagtcttt ggtcagtttg 

851 cgcgccagtt tgccgactgc gccgtcccaa gtggcggaga acaataa 

This corresponds to the amino acid sequence <SEQ ID 1 176; ORF 284.ng>: 

g284 .pep 

1 MPSETRNRFQ TALVYAAGWG LAVFVTAFA F ACKRVAGFAF AFEAFAGFFE 

51 TVFLKAFFLE TFAARFAAEA FFARFMIAAP AAGFPIAPAA FAFRLAACGF 

101 AFAGRLHAFD GQFGQFAVNV FFDFVDFDVF VHFGKRNRNT RAACAAGAPD 

151 AVDWFRLFR QWVYDVGNG RYVNTACGNV GGNQNFAAAF TQIRQRAVAP 

201 ALRHIAVQAV GGETFFVQFI RDDFGHRFGG RENHTLVDVG IAQDWEQAV 

251 FVAHIVAVQQ LLFDFALAVH AFDFDNFRVF GQFARQFADC AVPSGGEQ* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 177>; 

m284 . seq. . 

1 ATGCCGTCTG AAACTCGAAA TCGGTTTCAG ACGGCATTGG TTTATGCGGC 

51 AGGTTGGGGC TTAGCGGTCT TTGTAACGGC GTTCGCCTTT 3CCTGCAAAA 

101 GAATCGCCGG CTTTGCGTTT GCCTTTGAAG CCTTCGCCGG TTTTTTTGAA 

151 ACCGTCTCTC TTAAAGCCTT CTTTCTTGAA ACCTTCGCCG CGCGTTTTGC 

201 CGCCGAAGCC TTCTTTGCTC GGTTTATGAT CGCCGCGCCA ACCGCCGGAT 

251 TTACGATCGC CCCAGCCGCC TTTGCCTTTC GGCTTGCCGC CTGCGGATTT 

301 GCGTTTGCGG GTCGGTTCCA TGCCTTCGAT GGTCAGTTCG GGCAGTTTTC 

351 GGTTAATGTA TTTTTCGATT TTGTGGACTT TGACGTATTC GTTCACTTCG 

401 GCAAACGTAA TCGCAATACC CGTGCGGCCT GCGCGGCCGG TGCGCCCGA? 

451 GCGGTGGACG TAGTCTTCCG CCTGTTTCGG CAGGTCGTAG TTGATAACG? 

501 GGGTAATGGT CGGTACGTCG ATACCGCGTG CGGCAACATC GGTGGCAACC 

551 AAAATTTTGC AGCGGCCTTT ACGCAAATCC ATCAGCGTGC GGTTGCGCCA 

601 GCCTTGCGGC ATATCGCCGT GCAGGCAGTT TGCGGCGAAA CCTTTTTCGT 

651 ACAGTTCATC CGCAATGACT TCGGTCATGG CTTTGGTGGA CGTGAAAATC 

701 ACGCATTGAT CGATATTGGC ATCGCGCAAG ATATGATCGA GCAGGCGGTT 

751 TTTGTGGCGC ATATCGTCGC AGTACAGCAG TTGTTCTTCG ATTTTGCCTT 

801 GATCGTCCAC GCGTTCGACT TCGATGATTT CAGGGTCTTT GGTCAGTTTG 

851 CGCGCCAGTT TGCCGACCGC GCCGTCCCAA GTGGCGGAGA ACAACAAAGT 

901 CTGACGGTCG CTCGGCGTTG CTTCCACGAT GGTTTCGATG TCGTCGATAA 

S51 AGCCCATATC CAACATACGG TCGGCTTCGT CCAAAATCAG CACTTCCAAA 

1001 CGTTCAAAAT CAACTTTGCC GCTTTGCATC AGGTCCATCA GACGGCCCGG 

1C51 CGTGGCGACA ATCAGATCGA CCGGTTTGCT CAGGGCACGG GTTTGGTAGC 

1101 CGAAAGACGC GCCGCCGACG ATGCTGACGG TGCGGAACCA ACGCATATTT 

1151 TTGGCATACG CCAGCGCGTT TTTCTCGACT TGAGCCGCCA GTTCGCGGGT 

1201 CGGGGTCAAC ACCAAAGCAC GCGGGCCTTT GCCCGGTTTT TCGCTGCGTT 

1251 TGGTCAGTTT TTGCAAAGTC GGTAA 



This corresponds to the amino acid sequence <SEQ ID 1 178; ORF 284>: 

m284.pep 

1 MPSETRNRFQ TALVYAAGWG LAVFVTAFA F ACKRIAGFAF AFEAFAGFFE 

51 TVSLKAFFLE TFAARFAAEA FFARFMIAAP TAGFTIAPAA FAFRLAACGF 

101 AFAGRFHAFD GQFGQFSVNV FFDFVDFDVF VHFGKRNRNT RAACAAGAPD 

151 AVDWFRLFR QVVVDNVGNG RYVDTACGNI GGNQNFAAAF TQIHQRAVAP 

201 ALRHIAVQAV CGETFFVQFI RNDFGHGFGG RENHALIDIG IAQDMIEQAV 

251 FVAHIVAVQQ LFFDFALIVH AFDFDDFRVF GQFARQFADR AVPSGGEQQS 

301 LTVARRCFHD GFDWDKAHI QHTVGFVQNQ HFQTFKINFA ALHQVHQTAR 

351 RGDNQI DRFA QGTGLVAERR AADDADGAEP THIFGIRQRV FLDLSRQFAG 

401 RGQHQSTRAF ARFFAAFGQF LQSR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m284/g284 92.3% identity in 298 aa overlap 



WO 99/57280 



PCT/US99/09346 



646 



10 20 30 40 50 60 

m284 . pep MPSETRNRFQTALVYAAGWGLAVFVTAFAFACKRIAGFAFAFEAFAGFFETVSLKAFFLE 
I I I I I I I I ! i I I I I II I I I I I I I I I I I I I I I I I I : I I I I I I I I I M I I I I I I I I 1 1 I I I 
g 284 MP SETRNRFQT ALVYAAGWG LAV FVT AFAFACKRVAG FAFAFE AFAG F FET V FLKAFFLE 

1.0 20 30 40 50 60 



70 80 90 100 110 120 

m284 . pep T FAAR FAAE AFFAR FM I AA PT AG FT I A P AA FAFR LAACG FA FAGR FH A FDGQ FGQ FS VN V 

I | I I I I I I I j I I I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I I : I I I I I I I I I I : I I I 
g 284 T FAAR FAAE AF FAR FM I AAP AAG F P I APAAFA FR LAACG FAFAGR LHA FDGQFG Q FA VN V 

70 80 90 100 110 120 



130 140 150 160 170 180 

m28 4 . pep FFDFVDFDVFVHFGKRNRNTRAACAAGAPDAVDWFRLFRQVVVDNVGNGRYVDTACGNI 
I I I I I I I I I ! I I II I i I I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I 1 I : I I I I i : 
g284 FFDFVDFDVFVHFGKRNRNTRAACAAGAPDAVDWFRLFROWVYDVGNGRYVNTACGNV 
130 140 150 160 170 180 



190 200 210 220 230 240 

ir.284 . pep GGNQNFAAAFTQIHQRAVAPALRIIIAVQAVCGETFFVQFIRNDFGRGFGGREN MALI DIG 

I I I I I M I I I I I I : I I I I I I I I ! I I I I I ! I I I I I I I I I I I : I I I I I I I I I I I : 1 : I : t 
g 284 GGNQNFAAAFTQIRQRAVAPALRHIAVQAVGGETFFVQFIRDDFGHRFGGRENHTLVDVG 

190 200 210 220 230 240 



250 260 270 280 290 300 

1*284 . pep IAQDMIEQAVFVAHIVAVQQLFFDFALIVHAFDFDDFRVFGQFARQFADRAVPSGGEQQS 

I I M :: I I I I I I I I I I I I I I I : I I I I I I I I i I I I • I I I I I I I I I I I II I II I I I I I 
g284 IAQDVVEQAVFVAHIVAVQQLLFDFALAVHAFDFDNFRVFGQFARQFADCAVPSGGEQX 
250 260 270 280 290 

310 320 330 340 350 360 

m284 . pep LTVARRCFHDGFDVVDKAHIQHTVGFVQNQHFQTFKINFAALHQVHQTARRGDNQIDRFA 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 179>: 

a284 . seq 

1 ATGCCGTCTG AAACTCGAAA TCGGTTTCAG ACGGCATTGG TTTATGCGGC 

51 AGGTTGGGGC TTAGCGGTCT TTGTAACGGC GTTCGCCTTT GCCTGCAAAA 

101 GAATCGCCGG CTTTGCGTTT GCCTTTGAAG CCTTCGCCGG TTTTTTTGAA 

151 ACCGTCTCTC TTAAAGCCTT CTTTCTTGAA ACCTTCGCCG CGCGTTTTGC 

201 CGCCGI^AGCC TTCTTTGCTC GGTTTATGAT CGCCGCGCCA ACCGCCGGAT 

251 TTACGATCGC CCCAGCSGCC TTTGCCTTTC GGCTTGCCGC CTGCGGATTT 

301 GCGTTTGCGG GTCGGTTCCA TGCCTTCGAT GGTCAGTT CG GGCAGTTTTC 

351 GGTTAATGTA TTTTTC3ATT TTGTGGACTT TGACGTATTC GTTCACTTCG 

401 GCAAACGTAA TCGCAATACC CGTGCGGCCT GCGCGGCCGG TGCGCCCGAT 

451 GCGGTGGACG TAGTCTTCCG CCTGTTTCGG CAGGTCGTAG TTGATAACGT 

501 GGGTAATGGT CGGTAC3TCG ATACCGCGTG CGGCAACGTC GGTGGCAACC 

551 AAAATTTTGC AGCGGCCTTT GCGCAAATCC ATCAGCGTGC GGTTGCGCCA 

601 GCCTTGCGGC ATATCGCCGT GCAGGCAGTT GGCGGCGAAA CCTTTTTCGT 

651 ACAATTCATC CGCGATGACT TCGGTCATGG CTTTGGTGGA CGTGAAAATC 

701 ACGCATTGAT CGATGTCGGC ATCGCGCAAG ATATGATCGA GCAGGCGGTT 

751 TTTGTGGCGC ATATCGTCGC AGTACAGCAG TTGTTCTTCG ATTTTGCCTT 

801 GGTCGTCCAC GCGTTCGACT TCGATGATTT CAGGGTCTTT GGTCAGTTTG 

851 CGCGCCAGTT TGCCGACCGC GCCGTCCCAA GTGGCGGAGA ACAACAAAGT 

901 CTGACGGTCT TCCGGCGTGG CTTCGACGAT GGTTTCGATG TCGTCGATAA 

951 AGCCCATATC CAACATACGG TCGGCTTCGT CCAAAATCAG CACTTCCAAG 

1001 CGGGCGAAAT CGACTTTGCC GCTTTGCATC AAGTCCATCA GACGGCCCGG 

1051 CGTGGCGACA ATCAGATCGA CCGGTTTGCT CAGGGCGCGG GTTTGGTAGC 

1101 CGAACGATGC ACCACCGACG ATGCTGACGG TACGGAACCA ACGCATATTT 

1151 TTGGCATACG CCAGCGCGTT TTTCTCGACT TGAGCCGCCA ATTCGCGGGT 

1201 CGGCGTCAAC ACCAACGCGC GCGGGCCTTT GCCCGGTTTT TCGCTGCGTT 

1251 TGGTCAGTCG CTGCAAAGTC GGTAA 

This corresponds to the amino acid sequence <SEQ ED 1 180; ORF 284.a>: 

a284 .pep 

1 MPSETRNRFQ T ALVYAAGWG LAV FVT A FA F ACKRIAGFAF AFEAFAGFFE 
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51 TVSLKAFFLE TFAARFAAEA FFARFWIAAP TAGFT IAPAA FAFRLAACGF 

101 AFAGRFHAFD GQFGQFSVNV FFDFVDFDVF VHFGKRNRNT RAACAAGAPD 

151 AVDVVFRLFR QWVDNVGNG RYVDTACGNV GGNQNFAAAF AQIHQRAVAP 

201 ALRHIAVQAV GGETFFVQFI RDDFGHGFGG RENHALI DVG IAQDMIEQAV 

251 FV AHIVAVQQ LFFDFALVV H AFDFDDFRVF GQFARQFADR AVPSGGEQOS 

301 LTVFRRGFDD GFDVVDKAHI QHTVGFVQNQ HFQAGE1DFA ALHQVHQTAR 

351 RGDNQIDRFA QGAGLVAERC TTDDADGTEP THIFGIRQRV FLDLSRQFAG 

401 RRQHQRARAF ARFFAAFGQS LQSR* 



m284/a284 94.8% identity in 424 aa overlap 



10 20 30 40 50 60 

m28 4 . pep MPSETRNRFQTALVYAAGWGLAVFVTAFAFACKRIAGFAFAFEAFAGFFE TVSLKAFFLE 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a284 MPSETRNRFQTALVYAAGWGLAVFVTAFAFACKRIAGFAFAFEAFAGFFETVSLKAFFLE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m28 4 . pep TFAARFAAEAFFARFMIAAPTAGFTIAPAAFAFRLAACGFAFAGRFHAFDGQFGQFSVNV 
I I 1 I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 
a284 T FAAR FAAE A F FAR FM I AAPTAG FT I A PAAFAFRLAACG FA FAGR FH A FDGQFGQFS VN V 

70 80 90 100 110 120 



130 140 150 160 170 180 

m284 . pep FFDFVDFDVFv/HFGKRNRNTRAACAAGAPDAVDWFRLFRQVVVDNVGNGRYVDTACGNI 
I I I 1 1 M I I I I I I I I I I I I I I I I II I I I i I I I I I I I I I I I I I I I I I I I I I I I ! I I i I I I : 
a284 FFDFVDFDVFVHFGKRNRNTRAACAAGAPDAVDWFRLFRQVWDNVGNGRYVDTACGNV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m28 4 . pep GGN QN FAAAFT Q I HQRAVAPALRH I AV QAV CGE TFFVQFIRNDFGHG FGGRENHAL I D I G 

I I I I I I I I I I: I I I I I ! I I I I I I I I I I I I I I I I I I I II I I : I I I I I I I I I I I I I I I I : I 
a284 GGN QN FAAAFAQ I HQRAVAPALRH I AV QAVGGE T FFVQ FI R D D FG HG FGG REN HAL I DVG 

190 200 210 220 230 240 



250 260 270 280 290 300 

m284 . pep I AQ DM I E QAV FVAH I V AVQQ L FF D FAL I VH A FD FDD FRV FGQ FARQ FADRAV P SGGE QQ S 

I I I I I I I I I I II I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I ! 
a284 I AQ DM I E QAV FVAH I VAVQQ L FFD FAL WHA FD F D D FRV FGQ FARQ FADRAV P SGGE QQ S 

250 260 270 280 290 300 



310 320 330 340 350 360 

m284 . pep LTVARRCFHDGFDWDKAHIQHTVGFVQNQHFQTFKINFAALHQVHQTARRGDNQIDRFA 
III II I I I I I I I I I I I I I I I I I I I !! I I I 1 : : i : i I I I I I I I I M I I I ! I I I I I I I 
a284 LTVFRRGFDDGFDWDKAHIQHTVGFVQNQHFQAGEI DFAALHQVHQTARRGDNQI DRFA 

310 320 330 340 350 360 



370 380 390 400 410 420 

m284 .pep QGTGLVAERRAADDADGAEPTHIFGIRQRVFLDLSRQFAGRGQHQSTRAFARFFAAFGQF 

I I : I I I I I I I I I I I : I I I I I I I I I I ( I I I I I I I I I I I I III : I I I I I I I I I I I I 
a284 QGAGLVAERCTTDDADGTEPTHIFGIRQRVFLDLSRQFAGRRQHQRARAFARFFAAFGQS 

370 380 390 400 410 420 



m284.pep LQSRX 
I I I I I 

a284 LQSRX 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 1 181>: 

g285. seq 

1 atgaccgata ccacaccgac agataccgat ccgaccgaaa acggcacgcg 

51 caaaatgccg tctgaacacc gccccgcccc gccggcaaaa aaacgccgcc 

101 cgctgctgaa gctgtcggcg gcactgctgt ctgtcctgat tttggcagta 
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151 tgtttcctcg gctggatcgc cggtacggaa gcaggtttgc gcttcgggct 

201 gtaccaaatc ccgtcctggt tcggcgtaaa catttcctcc caaaacctca 

251 aaggcacact gctcgacggc ttcgacggcg acaactggtc gatagaaacc 

301 gagggggcag accttaaaat cagccgcttc cgcttcgcgt ggaaaccgtc 

351 cgaact.gatg cgccgcagcc tgcacatcac cgacatctcc gccggcgaca 

401 tcgccatcgt aaccaaaccg actccgccta aagaagaacg cccgcctcaa 

451 ggcctgcccg acagcataga cctgcccgcc gctgtctatc tcgaccgctt 

501 cgagacgggc aaaatcagca tgggcaaaac ctttgacaaa caaaccgtct 

551 atctcgaacg cctcaacgcg gcataccgtt acgaccgtaa agggcaccgc 

601 ctcgacctga aggccgccga cacgccgtgg agcagttcgt cggggtcagc 

651 ctcggtcggc ttgaaaaaac cgtttgccct cgataccgcc atttacacca 

701 aaggcggatt cgaaggcgaa accatacaca gtacggcgcg gctgagcggc 

751 agcctgaagg atgtgcgcgc cgaactgacg atcgaccgcg gcaatatccg 

801 cctctcggga aaatccgtca tccacccgtt tgccgaatca ttggataaaa 

851 cattggaaga agtactggtc aaaggattca acatcaatcc gtccgccttc 

901 gtgccttccc tgcccgatgc cgggctgaat ttcgacctga ccgccatccc 

951 gtcgttttca gacggcatcg cgctggaagg ctcgctcgat ttggaaaaca 

1001 ccaaagccgg ctttgccgac cgcaacggca tccccgtccg tcaggttttg 

1051 cgcggctttg tcatccggca ggacggcacg gtgcatatcg gcaatacgtc 

1101 cgccgccctg ctcggacggg gcggcatcag gctgtcgggc aaaatcgaca 

1151 ccgaaaaaga catccttgat ttaaatatag gcatcaactc cgtcggcgcg 

1201 gaagacgtgc tgcaaaccgc gttcaaaggc aggttggacg gcagcatcgg 

1251 catcggcggc acgaccgcct cgcccaaaat ctcttggcaa ctcggcaccg 

1301 gcacggcacg cacggacggc agcctcccca tcgcaagcga ccccgcaaac 

1351 gaacagcgga aactggtgtt cgacaccgtc aacatctccg ccggggaagg 

1401 cagcctgacc gcgcaaggcb alctcgagcL gtttaaagac cgcctgctca 

1451 agctggacat ccgttcccgc gcattcgacc cttcgcgcat cgatccgcaa 

1501 tttccggcag gcaatatcaa cggttcgatt catcttgccg gtgaactggc. 

1551 aaaagagaaa tttacgggca aaatgcgttt tttgcccggt acgttcaacg 

1601 gcgtgccgat tgccggcagc gccgacattg tttacgagtc ccgccacctt 

1651 ccgcgcgccg ccgtcgattt gcggttgggg cggaacatcg tcaaaacaga 

1701 cggcggcttc ggcaaaaaag gcgaccggct taacctcaat atcaccgcac 

1751 ccgatttatc ccgtttcggt ttcggactcg cggggtcttt aaatgtacgc 

1801 ggacaccttt ccggcgattt ggacggcggc atccgaacct ttgaaaccga 

1851 cctttccggc acggcgcgca acttacacat cggcaaagcg gcagacatcc 

1901 gttcgctcga ttttaccctc aaaggctcac ccggcacaag ccgcc.cqa tg 

1951 cgcgccgata tcaagggcgg ccgcctttcc ctgtcgggcg gcgcggcggt 

2001 tgtcgatacc gccggcctga cgctggaagg tacgggcgcg cagcaccgca 

2051 tccgcacaca cgccgccatg acgctggacg gcaaaccgtt caaactcgat 

2101 ttggacgctt caggcggcat caacagggaa cttacccgat ggaaaggcag 

2151 catcggcatc ctcgacatcg gcggcgcatt caacctcaag ctgcaaaacc 

2201 gtatgacgct cgaagccggt gcggaacacg tggcggcaag tgcggcaaat 

2251 tggcaggcaa tgggcggcag cctcaacctg caacactttt cttgggacag 

2301 gaaaaccggc atatcggcaa aaggcggcgc acgcggcctg cacatcgccg 

2351 agttgcacaa tttcttcaaa ccgcccttcg aacacaatct ggttttaaac 

2401 ggcgactggg atgtcgccta cgggcacaac gcgcgcggct acctcaatat 

2451 cagccggcaa agcggcgatg ccgtattgcc cggcgggcag gctttgggtt 

2501 tgaacgcatt ttccctgaaa acgcgctttc aaaacgaccg catcggaatc 

2551 ctgcttgacg gcggcgcgcg tttcggacgg attaacgccg atttgggcat 

2601 cggcaacgcc ttcggcggca atatggcaaa tacaccgctc ggcggcagga 

2651 ttacagcctc ccttcccgac ttgggcgcat tgaagccctt tctgcccgcc 

2701 gccgcgcaaa acattaccgg cagcctgaat gcctccgcgc aaatcggcgg 

2751 acgggtaggc tctccgtccg tcaatgccgc cgtcaacggt agcagcaact 

2801 acgggaaaat caacggcaat atcaccgtcg ggcaaagccg ctccttcgat 

2851 accgcacctt tgggcggcag gctcaacctg accgttgccg atgccgaagc 

2901 attccgcaac ttcctaccgg tcggacaaac cgtcaaaggc agcctgaatg 

2951 ccgccgtaac cctcggcggc agcatcgccg acccgcactt gggcggcagt 

3001 atcaacggcg acaagcrcta ttaccgcaac caaacccaag gcatcatctt 

3051 ggacaacggc tcgctgcgtt cgcatattgc aggcaggaaa tgggtaatcg 

3101 acagcctgaa attccggcac gaagggacgg cggaactctc cggcacggtc 

3151 agcatggaaa acagcgtgcc cgatgtcgat atcggcgcgg tgttcgacaa 

3201 ataccgcatc ctgtcccgcc ccaaccgccg cctgacggtt tccggcaaca 

3251 cccgcctgcg ctattcgccg caaaaaggca tatccgttac cggtatgatt 

3301 aaaactgatc aggggctgtt cggttcgcaa aaatcctcga tgccgtccgt 

3351 cggcgacgat gtcgtcgtat tgggcgaagt caagaaagag gcggcggcat 

3401 cgctccccgt caatatgaac ctgactttag acctcaatga cggcacccgc 
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3451 ttctccggct acggcgcgga cgttaccata 

3501 cgcgcaaccg ggcggaaatg tgcgtggggt 

3551 aagggcgtta caaagcatac cggcaggatt 

3601 gtctcctttg tcggcccgct caacgacccc 

3651 acgccgcctt tcccccgtcg ctgcgggcgt 

3701 acagcccgcg catitacgctg acggcaaacg 

3751 aagctctcct ggctcatcct caaccgtgcc 

3801 caatgccgcc ctgtccgcag ccgcaggcgc 

3851 acgaccgcat cgggctggtg catgatttgg 

3901 cgcaacgcgc aaaccggcga actcaacccc 

3951 cggcaaacaa ctgaccggca aactctacat 

4001 ccagcgcgga acagtccgtc aaactgattt 

4051 caggcggttg cccgtatcgg cagccgttcg 

4101 caccatacgt ttcgaccgcc tcttcggttc 

4151 gaaacggcaa agggaaataa 

This corresponds to the amino acid sequence <SEQ ID 1182; ORF 285.ng>: 

g285.pep 

1 MTDTTPTDTD PTENGTRKMP SEHRPAPPAK KRRPLLKLS A ALLSVLILAV 

51 CFLGWI AGTE AGLRFGLYQI PSWFGVNISS QNLKGTLLDG FDGDNWSIET 

101 EGADLKISRF RFAWKPSELM RRSLHITDIS AGDIAIVTKP TPPKEERPPQ 

151 GLPDSIDLPA AVYLDRFETG KISMGKTFDK QTVYLERLNA AYRYDRKGHR 

201 LDLKAADTPW SSSSGSASVG LKKPFALDTA IYTKGGFEGE TIHSTARLSG 

251 SLKDVRAELT IDGGNIRLSG KSVIHPFAES LDKTLEEVLV KGFNINPSAF 

301 VPSLPDAGLN FDLTAIPSFS DGIALEGSLD LENTKAGFAD RNGIPVRQVL 

351 GGFVIRQDGT VHIGNTSAAL LGRGGIRLSG KIDTEKDILD LN I GINS VGA 

401 EDVLQTAFKG RLDGSIGIGG TTASPKISWQ LGTGTARTDG SLPIASDPAN 

451 EQRKLVFDTV NISAGEGSLT AQGYLELFKD RLLKLDIRSR AFDPSRIDPQ 

501 FPAGNINGSI HLAGELAKEK FTGKMRFLPG TFNGVPIAGS ADIVYESRHL 

551 PRAAVDLRLG RNIVKTDGGF GKKGDRLNLN ITAPDLSRFG FGLAGSLNVR 

601 GHLSGDLDGG IRTFETDLSG TARNLHIGKA ADIRSLDFTL KGSPGTSRPM 

651 RADIKGGRLS LSGGAAWDT AGLTLEGTGA QHRIRTHAAM TLDGKPFKLD 

701 LDASGGINRE LTRWKGSIGI LDIGGAFNLK LQNRMTLEAG AEHVAASAAN 

751 WQAMGGSLNL QHFSWDRKTG ISAKGGARGL HIAELHNFFK PPFEHNLVLN 

801 GDWDVAYGHN ARGYLNISRQ SGDAVLPGGQ ALGLNAFSLK TRFQNDRIGI 

851 LLDGGARFGR INADLGIGNA FGGNMANTPL GGRITASLPD LGALKPFLPA 

901 AAQNITGSLN ASAQIGGRVG SPSVNAAVNG SSNYGKINGN ITVGQSRSFD 

951 TAPLGGRLNL TVADAEAFRN FLPVGQTVKG SLNAAVTLGG SIADPHLGGS 

1001 INGDKLYYRN QTQGIILDNG SLRSHIAGRK WVIDSLKFRH EGTAELSGTV 

1051 SMENSVPDVD IGAVFDKYRI LSRPNRRLTV SGNTRLRYSP QKGISVTGMI 

1101 KTDQGLFGSQ KSSMPSVGDD VWLGEVKKE AAASLPVNMN LTLDLNDGIR 

1151 FSGYGADVTI GGKLTLTAQP C-GNVRGVGTV RVIKGRYE<AY C-QDLDITKGT 

1201 VSFVGPLNDP NLNIRAERRL SPVGAGVEIL GSLNSPRITL TANEPMSEKD 

1251 KLSWLILNRA GSGSSGDNAA LSAAAGALLA GQINDRIGLV DDLGFTSKRS 

1301 RNAQTGELNP AEQVLTVGKQ LTGKLYIGYE YGISSAEQSV KLIYRLTRAI 

1351 QAVARTGSRS SGGELTYTIR FDRLFGSDKK DSAGNGKGK* 



ggcggcaaac 
gggcacggtc 
tagacattac 
aacctgaaca 
ggaaatattg 
aaccgatgag 
ggcagcggca 
gctgcttgcc 
gctttaccag 
gccgaacagg 
cggctacgaa 
accggctgac 
tcgggcggcg 
ggacaaaaaa 



tgaccctgac 
cgcgtcatca 
caaaggcaca 
tccgcgccga 
ggcagcctca 
tgaaaaagac 
gcagcggcga 
gggcaaatca 
caagcgcagc 
tgctgaccgt 
tacggcatct 
ccgcgccata 
agctgacata 
gactccgcag 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 183>: 

m285.seq 

1 AT GACCG AT A CCGCACCGAC AGATACCGAT CCGACCGAAA ACGGCACGCG 

51 CAAAAT GCCG TCTGAACACC GCCCTACCCC GCCGGCAAAA AAACGCCGCC 

101 CGTTGCTGAA GCTGTCG3CG GCACTGCTGT CTGTCCTGAT TTTGGCAGTA 

151 TGTTTCCTCG GCTGGCTCGC CGGTACGGAA GCAGGTTTGC GCTTCGGGCT 

201 GTACCAAATC CCGTCTTGGT TCGGCGTAAA CATTTCCTCC CAAAACCTCA 

251 AAGGCACGCT GCTCGACGGC TTCGACGGCG ACAACTGGTC GATAGAAACC 

301 GAGGGGGCAG ACCTTAAAAT CAGCCGCTTC CGCTTCGCGT GGAAACCGTC 

351 CGAACTGATG CGCCGCAGCC TGCACATTAC CGAAATTTCC GCCGGCGACA 

401 TCGCCATCGT TACCAAACCG ACTCCGCCTA AAGAAGAACG CCCGCCGCTC 

4 51 AGCCTTCCCG ACAGCATAGA CCTGCCTGCC GCCGTCTATC TCGACCGCTT 

501 CGAGACGGGC AAAATCASCA TGGGCAAAGC CTTTGACAAA CAAACCGTCT 

551 ATCTCGAACG GCTGGATGCT TCATACCGTT ACGACCGCAA AGGACACCGC 

601 CTTGACCTGA AGGCCGCCGA CACGCCGTGG AGCAGTTCGT CGGGGGCGGC 

651 CTCGGTCGGC TTGAAAAAAC CGTTTGCCCT CGATACCGCC ATTTACACCA 

701 AAGGCGGACT CGAAGGCAAA ACCATACACA GTACGGCTCG GCTGAGCGGC 

751 AGCCTGAAGG ATGTGCGCGC CGAACTGGCG ATCGACGGCG GCAATATCCG 
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801 CCTCTCGGGA AAATCCGTCA 

851 CATTGGAAGA AGTACTGGTC 

901 GTGCCTTCCC TGCCCGATGC 

951 GTCGTTTTCA GACGGCATCG 

1001 CCAAAGCCGG CTTTGCCGAC 

1051 GGCGGCTTTG TCATCCGGCA 

1101 CGCCGCCCTG CTCGGACGGG 

1151 CCGAAAAAGA CATCCTCGAT 

1201 GAAGACGTAC TGCAAACCGC 

1251 CATCGGTGGC ACSACCGCCT 

1301 GCACGGCGCG CACGGACGGC 

1351 GGACAGCGGA AACTGGTGCT 

1401 CAGCCTGACC GCGCAAGGCT 

1451 AGCTGGACAT CCGTTCCCGC 

1501 CTTCCGGCAG GCAATATCAA 

1551 AAAAGAGAAA TTCACAGGCA 

1601 GCGT AC CG AT TGCCGGCAGT 

1651 CCGCGTGCCG CCGTCGATTT 

1701 CGGCGGCTTC GGCAAAAAAG 

1751 CCGATTTATC CCGTTTCGGT 

1801 GGACACCTTT CCGGTGATTT 

1851 CCTTTCCGGC GCGGCGCGCA 

193 1 GTTCGCTCGA TTTCACGCTC 

1951 CGCGCCGACA TCAAAGGCAG 

20D1 TGTCGATACC GCCGACCTGA 

2051 TCCGCACACA CGCCGCCATG 

2101 TTGGACGCTT CAGGCGGCAT 

2151 CATCGGCATC CTCGACATCG 

2201 GTATGACGCT CGAAGCCGGT 

2251 TGGCAGGCAA TGGGCGGCAG 

2301 AAAAACCGGC ATATCGGCAA 

2351 AGTTGCACAA TTTCTTCAAA 

24 01 GGCGACTGGG ATGTCGCCTA 

24 51 CAGCCGGCAA AGCGGCGATG 

2501 TGAACGCATT TTCCCTGAAA 

2551 CTGCTTGACG GCGGCGCGCG 

2601 CGCCAACGCC TTCGGCGGCA 

2651 TTACCGCCTC CCTTCCCGAC 

2701 GCCGCGCAAA ACATTACCGG 

2751 ACGGGTAGGC TCTCCGTCCG 

2801 ACGGGAAAAT CAACGGCAAC 

2851 ACCGCGCCTT TGGGCGGCAG 

2901 ATTCCGCAAC TTCCTACCGG 

2951 CCGCCGTAAC CCTCGGCGGC 

3001 ATCAACGGCG ACAAACTCTA 

3051 GGACAACGGC TCGCTGCGTT 

3101 ACAGCCTGAA ATTCCGGCAC 

3151 GGTATGGAAA ACAGCGGACC 

3201 ATACCGCATC CTGTCCCGCC 

3251 CCCGCCTGCG CTATTCGCCG 

3301 AAAACGGATC AGGGGCTGTT 

3351 CGGCGACGAT GTCGTCGTAT 

34 01 CGCTCCCCGT CAATATGAAC 

34 51 TTCGCCGGCT ACGGCGCGGA 

3501 CGCCCAATCG GGCGGAAGCG 

3551 AAGGGCGTTA TAAGGCATAC 

3601 GTCTCCTTTG TCGGCCCGCT 

3651 ACGCCGCCTT TCCCCCGTCG 

3701 ACAGCCCGCG CATTACGCTG 

3751 AAGCTCTCTT GGCTCATCCT 

3801 CAATGCCGCC CTGTCTGCAG 

3851 ACGACCGCAT CGGGCTG3TG 

3901 CGCAACGCGC AAACCGGCGA 

3951 CGGCAAACAA CTGACCG3CA 

4001 CCAGCGCGGA ACAGTCCGTC 

4051 CAGGCGGTTG CCCGTATCGG 



650 



TCCACCCGTT TGCCGAATCA TTGGATAAAA 
AAAGGGTTCA ACATCAATCC GGCCGCCTTC 
CGGACTGAAT TTCGACCTGA CCGCCATCCC 
CGCTGGAAGG TTCGCTCGAT TTGGAAAACA 
CGCAACGGCA TCCCCGTCCG TCAGGTTTTA 
GGACGGCACG GTGCATATCG GCAATACGTC 
GCGGCATCAG GCTGTCGGGC AAAATC G AC A 
TTAAATATAG GCATCAACTC CGTCGGCGCG 
GTTCAAAGGC AGGTTGGACG GCAGCATCGG 
CGCCCAAAAT CTCTTGGCAA CTCGGCATCG 
AGCCTCGCCA TTGCAAGCGA CCCAGCAAAC 
CGACACCGTC AAOATCGCCG CCGGGCAAGG 
ATCTCGAGCT GTTTAAAGAC CGCCTGCTCA 
GCATTCGACC CTTCGCGCAT CGATCCGCAA 
CGGCTCAATA AACCTTGCCG GCGAACTGGC 
AAATGCGGTT TTTACCCGGC ACGTTCAACG 
GCCGACATTG TTTACGAGTC CCGCCACCTT 
GCGGCTGGGG CGGAACATTA TTAAAACAGA 
GCGACCGGCT TAACCTCAAT ATCACCGCAC 
TTCGGACTCG CGGGGTCTTT AAATGTACGC 
GGACGGCGGC ATCCGAACCT TTGAAACCGA 
ACCTGCACAT CGGCAAGGCG GCAGACATCC 
AAAGGTTCGC CCGACACAAG CCGCCCGATA 
CCGCCTTTCG CTGTCGGGCG GAGCGGCGGT 
TGCTGGACGG CACGGGCGTG CAGCACCGCA 
ACGCTGGATG GCAAACCGTT CAAATTCGAT 
CAACAGGGAA CTTACCCGAT GGAAAGGCAG 
GCGGCGCATT C AACC TCAAG CTGCAAAACC 
GCGGAACGCG TGGCGGCAAG TGCGGCAAAT 
CCTCAACCTG CAACACTTTT CTTGGGATAA 
AAGGCGGCGC ACACGGTCTG CATATCGCCG 
CCGCCCTTCG AACACAATCT GGTTTTAAAC 
CGGGCGCAAC GCGCGCGGCT ACCTCAATAT 
CCGTATTGCC CGGCGGGCAG GCTTTGGGTT 
ACGCGCTTTC AAAACGACCG CATCGGAATC 
TTTCGGGCGG ATTAACGCCG ATTTGGGCAT 
ATATGGCAAA TGCACCGCTC GGCGGCAGGA 
TTGGGCGCAT TGAAGCCCTT TCTGCCCGCC 
CAGCCTGAAT GCCGCCGCGC AAATCGGCGG 
TCAATGCCGC CGTCAACGGC AGCAGCAACT 
ATCACCGTCG GGCAAAGCCG CTCTTTCGAT 
GCTCAACCTG ACCGTTGCCG ATGCCGAAGT 
TCGGACAAAC CGTCAAAGGC AGCCTGAATG 
AGCATCGCCG ATCCGCACTT GGGCGGCAGC 
TTACCGCAAC CAAACCCAAG GCATCATCTT 
CGCATATCGC GGGCAGGAAA TGGGTAATCG 
GAAGGGACGG CGGAACTCTC CGGTACGGTC 
CGATGTCGAT ATCGGCGCGG TGTTCGACAA 
CCAACCGCCG CCTGACGGTT TCCGGCAACA 
CAAAAAGGCA TATCCGTTAC CGGCATGATT 
CGGTTCGCAA AAATCCTCGA TGCCGTCCGT 
TAGGCGAAGT CAAAAAAGAG GCGGCGGCAC 
CTGACTTTAG ACCTCAATGA CGGCATCCGC 
CGTTACCATA GGCGGCAAAC TGACCCTGAC 
TACGGGGCGT GGGCACGGTC C GCGT CAT CA 
GGGCAGGATT TGGACATTAC CAAAGGCACG 
CAACGATCCC AACCTCAACA TCCGCGCCGA 
GTGCGGGCGT GGAAATATTG GGCAGCCTCA 
ACGGCAAACG AACCGATGAG TGAAAAAGAC 
CAACCGCGCC GGCAGCGGCA GCAGCGGCGA 
CCGCAGGTGC GCTGCTTGCC GGGCAAATCA 
GATGATTTGG GCTTTACCAG CAAGCGCAGC 
ACTCAACCCC GCCGAACAGG TGCTGACCGT 
AACTCTACAT CGGCTACGAA TACAGCATCT 
AAACTGATTT ACCGGCTGAC CCGCGCCATA 
CAGCCGTTCG TCGGGCGGCG AGCTGACATA 
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4101 CACCATACGT TTCGACCGCT TCTCCGGTTC GGACAAAAAA GACTCCGCCG 
4151 GAAACGGCAA AGGAAAATAA 

This corresponds to the amino acid sequence <SEQ ID 1 184; ORF 285>: 

m205.pep 

1 MTDTAPTDTD PTENGTRKMP SEHRPTPPAK KRRPLLKLS A ALLS VLI LAV 

51 CFLGWL AGTE AGLRFGLYQI PSWFGVNISS QNLKGTLLDG FDGDNWSIET 

101 EGADLKISRF RFAWKPSELM RRSLHITEIS AGDIAIYTKP TPPKEERPPL 

151 SLPDSIDLPA AVYLDRFETG KISMGKAFDK QTVYLERLDA SYRYDRKGHR 

201 LDLKAADTPW SSSSGAASVG LKKPFALDTA IYTKGGLEGK TIHSTARLSG 

251 SLKDVRAELA IDGGNIRLSG KSVIHPFAES LDKTLEEVLV KGFNINPAAF 

301 VPSLPDAGLN FDLTAIPSFS DGIALEGSLD LENT KAG FAD RNG I PVRQVL 

351 GGFVIRQDGT VHIGNTSAAL LGRGGIRLSG KIDTEKDILD LN I GIN SVGA 

401 EDVLQTAFKG RLDGSIGIGG TTASPKISWQ LGIGTARTDG SLAIASDPAN 

451 GQRKLVLDTV NIAAGQGSLT AQGYLELFKD RLLKLDIRSR AFDPSRIDPQ 

501 LPAGNINGSI NLAGELAKEK FTGKMRFLPG TFNGVPIAGS ADIVYESRHL 

551 PRAAVDLRLG RNIIKTDGGF GKKGDRLNLN ITAPDLSRFG 7GLAGSLNVR 

601 GHLSGDLDGG IRTFETDLSG AARNLHIGKA ADIRSLDFTL KGSPDTSRPI 

651 RADIKGSRLS LSGGAAWDT ADLMLDGTGV QHRIRTHAAM TLDGKPFKFD 

701 LDASGGINRE LTRWKGSIGI LDIGGAFNLK LQNRMTLEAG AERVAASAAN 

751 WQAMGGSLNL QHFSWDKKTG ISAKGGAHGL HIAELHNFFK PPFEHNLVLN 

801 GDWDVAYGRN ARGYLNISRQ SGDAVLPGGO ALGLNAFSLK TRFQNDRIGI 

851 LLDGGARFGR INADLGIANA FGGNMANAPL GGRITASLPD LGALKPFLPA 

901 AAQNITGSLN AAAQIGGRVG SPSVNAAVNG SSNYGKINGN ITVGQSRSFD 

951 TAPLGGRLNL TVADAEVFRN FLPVGQTVKG SLNAAVTLGG SIADPHLGGS 

1001 INGDKLYYRN QTQGIILDNG SLRSHIAGRK WVIDSLKFRH EGTAELSGTV 

1051 GMENSGPDVD IGAVFDKYRI LSRPNRRLTV SGNTRLRYSP QKGISVTGMI 

1101 KTDQGLFGSQ KSSMPSVGDD VWLGEVKKE AAAPLPVNMN LTLDLNDGIR 

1151 FAGYGADVTI GGKLTLTAQS GGSVRGVGTV RVIKGRYKAY GQDLDITKGT 

1201 VSFVGPLNDP NLNIRAERRL SPVGAGVEIL GSLNSPRITL TANEPMSEKD 

1251 KLSWLILNRA GSGSSGDNAA LSAAAGALLA GQINDR1GLV DDLGFTSKRS 

1301 RNAQTGELNP AEQVLTVGKQ LTGKLYIGYE YSISSAEQSV KLIYRLTRAI 

1351 QAVARIGSRS SGGELTYTIR FDRFSGSDKK DSAGNGKGK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ra285/g285 96.5% identity in 1389 aa overlap 

10 20 30 40 50 60 

MTDTAPTDTDPTENGTRKMPSEHRPTPPAKKRRPLLKLSAALLSVLILAVCFLGWLAGTE 
I I I I : I I I I I I I I I I I I I I II I I II : I I I I I I II I I I I I i I II I i I I I I I I I I I I : I I I I 
MTDTTPTDTDPTENGTRKMPSEHRPAPPAKKRRPLLKLSAALLSVLILAVCFLGWIAGTE 
10 20 30 40 50 60 

70 80 90 100 110 120 

AGLRFGLYQIPSWFGVNISSQNLKGTLLDGFDGDNWSIETEGADLKISRFRFAWKPSELM 

I 1 1 I I II I I I I I II 1 1 I I I I I I I I I I I M I I 1 1 I I I 1 1 1 I 1 1 I I I II I I I I I I 1 1 I I M I 
AGLRFGLYQI PSWFGVN I SSQNLKGTLLDGFDGDNWS I ETEGADLKISRFRFAWKPSELM 
70 80 90 100 110 120 

130 140 150 160 170 180 

RRSLHITEISAGDIAIVTKPTPPKEERPPLSLPDSIDLPAAVYLDRFETGKISMGKAFDK 
I I I I I I 1 : I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I II I I I I M I I I I I I I : I II 
RRSLHITDISAGDIAIVTKPTPPKEERPPQGLPDSIDLPAAVYLDRFETGKISMGKTFDK 
130 140 150 160 170 180 

190 200 210 220 230 240 

QTVYLERLDASYRYDRKGHRLDLKAADTPWSSSSGAASVGLKKPFALDTAIYTKGGLEGK 
I I i I I I II : I : I I I 1 I I I I I I I 1 ! I M I I I I I I I I : I I I I I M I | M M I I I I M I : I I : 
QTVYLERLNAAYRYDRKGHRLDLKAADTPWSSSSGSASVGLKKPFALDTAI YTKGGFEGE 
190 200 210 220 230 240 

250 260 270 280 290 300 

TIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIHPFAESLDKTLEEVLVKGFNINPAAF 



m285.pep 
g285 

m285 .pep 
g285 

m285 .pep 
g285 

m285.pep 
g285 

ra285 .pep 
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I I I I I I I I I I I I I I I I I I I : I I I II I I I I I I I I I I I I ! I I I I I I I i I I I I I M I I I I : | I 
g285 T IHSTARLSGSLKDVRAELTI DGGNIRLSGKSVIHPFAE5LDKTLEEVLVKGFN INPSAF 

250 260 270 280 290 300 



310 320 330 340 350 360 

m285.pep VPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTKAGFADRNGIPVRQVLGGFVIRQDGT 

1 1 1 1 1 1 1 1 ii 1 1 1 ii 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 

g285 VPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTKAGFADRNGIPVRQVLGGFVIRQDGT 
310 320 330 340 350 360 

370 380 390 400 410 420 

m285.pep VHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGINSVGAEDVLQTAFKGRLDGSIGIGG 
I I I I I I I I I I I II I 1 I I II II I I I I II II I I I I I I ! I I I I I I II II I I I I I I I I I I I I I I 
g285 VHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGINSVGAEDVLOTAFKGRLDGSIGIGG 
370 380 390 400 410 420 

430 440 450 460 470 480 

m285.pep TTAS PKI SWQLGIGTARTDGSLAIASDPANGQRKLVLDTVNIAAGQGSLTAQGYLELFKD 

I I II I I I II I I I I I I I I I 1 I I I I I I I I I I I II I : I I I I I : I I : I I I I I I II I I I I I I 
g285 TTAS PKI SWQLGTGTARTDGSLP IAS DPANEQRKLVFDTVNISAGEG3LTAQGYLELFKD 

430 440 450 460 470 480 



490 500 510 520 530 540 

n\285.pep RLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGELAKEKFTGKMRFLPGTFNGVPIAGS 
I I I I I I I I I I I I I I I I I I I I : 1 I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 11 I I I M 
g285 RLLKLDIRSRAFDPSRIDP2FPAGNINGSIHLAGELAKEKFTGKMRFLPGTFNGVPIAGS 
490 500 510 520 530 540 

550 560 570 580 590 600 

m285.pep ADIVYESRHLPRAAVDLRLGRNIIKTDGGFGKKGDRLNLNITAPDLSRFGFGLAGSLNVR 
I I I I I I I I I I I I I I I I I I I I I II : I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
g285 ADIVYESRHLPRAAVDLRLGRNIVKTDGGFGKKGDRLNLNITAPDLSRFGFGLAGSLNVR 
550 560 570 580 590 600 

610 620 630 640 650 660 

m285.pe>p GHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRSLDFTLKGSPDTSRPIRADIKGSRLS 
. I I I I I I I I I I I I I I I I 1 I I I : I I I I I I I II I I I I I I I II I I I I I I I I I : I I I I I I : I I I 
g285 GHLSGDLDGGIRTFETDLSGTARNLHIGKAADIRSLDFTLKGSPGTSRPMRADIKGGRLS 

610 620 630 640 650 660 

670 680 690 700 710 720 

m285.pep LSGGAAWDTADLMLDGTGVQHRIRTHAAMTLDGKPFKFDLDASGGINRELTRWKGSIGI 
I I 1 I I I I I I I I I I : I I I : I I I I I I I I I I I I I I I I I I : I I I I I II I M II I II I I I I I I 
g285 LSGGAAWDTAGLTLEGTGAQHRIRTHAAMTLDGKPFKLDLDASGGINRELTRWKGSIGI 

670 680 690 700 710 720 



730 740 750 760 770 780 

m285 . pep LDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMGGSLNLQHFSWDKKTGISAKGGAHGL 
I I I I I I I I I II I I I I I I I II I I: I I I I I II I I I I I I I I I I I I I I I I: I I I M I I I I I : I I 
g285 LDIGGAFNLKLQNRMTLEAGAEHVAASAANWQAMGGSLNLQHFSWDRPCTGISAKGGARGL 

730 740 750 760 770 780 



790 800 810 820 830 840 

m285 . pep H I AE LHN F FK P P FE HN L V LN G DW D V A YGRN ARG Y LN I S RQ SG DAV L PGG QALG LN AF S LK 

I I I I I I I I I I I I I I I I I I I I II I I I I I I :| I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
g285 HIAELHNFFKPPFEHNLVLNGDWDVAYGHNARGYLNISRQSGDAVLPGGQALGLNAFSLK 

790 800 810 820 830 840 



850 860 870 880 890 900 

m285 . pep T RFQN DR I G I LLDGG AR FGR I N A DLG I AN AFGG NMAN A PLGGRITASLPD LGALK P FLP A 

I I I I I I I I I I I I I I M I i I I I I I I I I I : I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 
g285 TRFQNDRIGILLDGGARFGRINADLGIGNAFGGNMANTPLGGRITASLPDLGALKPFLPA 

850 860 870 880 890 900 



m285.pep 



910 920 930 940 950 960 

AAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYGKINGNITVGQSRSFDTAPLGGRLNL 
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II I I I I I I I I I : I I ! I I I I I I I I I II I I I I II I I I I I II I I I I I I 1 I I I I I I I i I I | | I | 

g285 AAQNITGSLNASAQIGGRVGSPSVNAAVNGSSNYGKINGNITVGQSRSFDTAPLGGRLNL 
910 920 930 94 0 950 960 



970 980 990 1000 1010 1020 

m285.pep T VA D AE V FRN FL P VGQT VKG S LN AA VT LGG S I A D PH LGG S I NG DK L Y Y RNQT QG 1 1 L DNG 

II I I I I : I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I II I I I 
g2 8 5 TVADAEAFRN FLPVGQTVKGSLNAAVT LGGS I ADPHLGGS INGDKLYYRNQTQGI I LDNG 

970 980 990 1000 1010 1020 

1030 1040 1050 1060 1070 1080 

Iti285.pep SLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENSGPDVDIGAVFDKYRILSRPNRRLTV 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I : I II I I I I I I I I I I I I I I I I I I || I I M 1 
g285 SLRSHIAGRKWVIDSLKFRKEGTAELSGTVSMENSVPDVDIGAVFDKYRILSRPNRRLTV 

1030 1040 1050 1060 1070 1080 

1090 HOG 1110 1120 1130 1140 

m285 . pep SGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMPSVGDDV\ r VLGEVKKEAAAPLPVNMN 
I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I II 
g285 S GN TRL R Y S PQKG I S VT GMIKTDQGLFGSQKSSMPSVGD D VW LG E VKKEAAAS L P VNMN 

1090 HOC 1110 1120 1130 1140 



1150 116C 1170 1180 1190 1200 

m285.pep LTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVRGVGTVRVIKGRYKAYGQDLDITKGT 
I II I I I I I 1 I I : I i I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g285 LTLDLN DGIRF5GYGADVT I GGKLT LTAQPGGNVRGVGTVRVI KGRYKAYGQDLD I TKGT 

1150 1160 1170 1180 1190 1200 

1210 1220 1230 1240 1250 1260 

m285.pep . V S FVGP LN D PN LN I RAERR L S P VGAG VE ILGSLN SPRIT LT AN E PM SEK DKL S WL I LN RA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 

g285 V S FVG P LN D PNLN I RAERR LS P VGAG VE I LG S LNS PRI T LT ANE PM SEK DKL S WL I LN RA 

1210 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

m285 . pep GSGSSGDNAALSAAAGALLAGOINDRIGLVDDLGFTSKRSRNAOTGELNPAEQVLTVGKQ 
I I I I I I I I I I I I I M I I I I I I I I I I 1 I t I i I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
g285 GSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGFTSKRSRNAQTGELNPAEQVLTVGKQ 

1270 1280 1290 1300 1310 1320 

1330 1340 1350 1360 1370 1380 

m285.pep LTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVARIGSRSSGGELTYTIRFDRFSGSDKK 
I I I I I I I I I I I : I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 
g285 LTGKLYIGYEYGISSAEQSVKLIYRLTRAIQAVARIGSRSSGGELTYTIRFDRLFGSDKK 

1330 1340 1350 1360 1370 1380 



m2 8 5 . pep 
g285 



1390 
DSAGNGKGKX 
I I I I I I I I I I 
DSAGNGKGKX 



The following partial DNA sequence was 

a285.seq 

1 ATGACCGATA CCGCACCGAC AGATACCGAT 

51 CAAAATGCCG TCTGAACACC GCCCTACCCC 

101 CGCTGCTGAA GCTGTCGGCG GCACTGCTGT 

151 TGTTTCCTCG GCTGGCTCGC CGGCACGGAA 

201 GTACCAAATC CCGTCTTGGT TCGGCGTAAA 

251 AAGGCACGCT GCTCGACGGC TTCGACGGCG 

301 GAGGGGGCAG ACCTTAAAAT CAGCCGCTTC 

351 CGAACTGATG CGCCGCAGCC TGCACATTAC 

4 01 TCGCCATCGT TACCAAACCG ACTCCGCCTA 

4 51 AGCCTTCCCG ACAGCATAGA CCTGCCTGCC 

501 CGAGACGGGC AAAATCAGCA TGGGCAAAGC 

551 ATCTCGAACG GCTGGATGCT TCATACCGTT 

601 CTCGACCTGA AGGCTGCCGA CACGCCGTGG 



identified in N. meningitidis <SEQ ID 1 185>: 



CCGACCGAAA 
GCCGGCAAAA 
CTGTTCTGAT 
GCGGGTTTGC 
CATTTCCTCC 
ACAACTGGTC 
CGCTTCGCGT 
CGAAATTTCC 
AAGAAGAACG 
GCCGTCTATC 
CTTTGACAAA 
ACGACCGCAA 
AGCAGTTCGT 



ACGGCACGCG 
AAACGCCGCC 
TTTGGCAGTA 
GCTTCGGGCT 
CAAAACCTCA 
GATAGAAACC 
GGAAACCGTC 
GCCGGCGACA 
CCCGCCGCTC 
TCGACCGCTT 
CAAACCGTCT 
AGGACACCGC 
CGGGGTCAGC 
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651 CTCGGTCGGC TTGAAAAAAC CGTTTGCCCT CGATACCGCC ATTTACACCA 

701 AAGGCGGACT CGAAGGCAAA ACCATACACA GTACGGCTCG GCTGAGCGGC 

751 AGCCTGAAGG ATGTGCGCGC CGAACTGGCG ATCGACGGCG GCAATATCCG 

801 CCTCTCGGGA AAATCCGTCA TCCACCCGTT TGCCGAATCA TTGGATAAAA 

851 CATTGGAAGA AGTACTGGTC AAAGGGTTCA ACATCAATCC GTCCGCCTTC 

901 GTGCCTTCCC TGCCCGATGC CGGGCTGAAT TTCGACCTGA CCGCCATCCC 

951 GTCGTTTTCA GACGGCATCG CGCTGGAAGG CTCGCTCGAT TTGGAAAACA 

1001 CCAAAGCCGG CTTTGCCGAC CGCAACGGCA TCCCCGTCCG TCAGGTTTTA 

1051 GGCAGCTTTG TCATCCGGCA GGACGGCACG GTGCATATCG GCAATACGTC 

1101 CGTCGCCCTG CTCGGACGGG GCGGCATCAG GCTGTCGGGC AAAATCGACA 

1151 CCGAAAAAGA CATCCTCGAT TTAAATATAG GCATCAACTC CGTCGGCGCG 

1201 GAAGACGTAC TGCAAACCGC GTTCAAAGGC AGGTTGGACG GCAGCATCGG 

1251 CATCGGTGGC ACGACCGCCT CGCCCAAAAT CTCTTGGCAA CTCGGCATCG 

1301 GCACGGCGCG CACGGACGGC AGCCTCGCCA TTGCAAGCGA CCCCGCAAAC 

1351 C-GACAGCGGA AACTGGTGCT CGACACCGTC AACATCGCCG CCGGGCAAGG 

1401 CAGCCTGACC GCGCAAGGCT ATCTCGAGCT GTTTAAAGAC CGCCTGCTCA 

14 51 AGCTGGACAT CCGTTCCCGC GCATTCGACC CTTCGCGCAT CGATCCGCAA 

1501 CTTCCGGCAG GCAATATCAA CGGCTCAATA AACCTTGCCG GCGAACTGGC 

1551 AAAAGAGAAA TTCACAGGCA AAATGCGGTT TTTACCCGGC ACGTTCAACG 

1601 GCGTACCGAT TGCCGGCAGT GCCGACATTG TTTACGAGTC CCGCCACCTT 

1651 CCGCGTGCCG CCGTCGATTT GCGGCTGGGG CGGAACATTA TTAAAACAGA 

1701 CGGCGGCTTC GGCAAAAAAG GCGACCGGCT TAACCTCAAT ATCACCGCAC 

1751 CCGATTTATC CCGTTTCGGT TTCGGACTCG CGGGGTCTTT AAATGTACGC 

1801 GGACACCTTT CCGGCGATTT GGACGGTGGC ATCCGAACCT TTGAAACCGA 

1851 CCTTTCCGGC GCGGCGCGCA ACCTGCACAT CGGCAAGGCG GCAGACATCC 

1901 GTTCGCTCGA TTTCACGCTC AAAGGTTCGC CCGACACAAG CCGCCCGATA 

1951 CGCGCCGACA TCAAAGGCAG CCGCCTTTCG CTGTCGGGCG GAGCGGAGGT 

2001 TGTCGATACC GCCGACCTGA TGCTGGACGG CACGGGCGTG CAGCACCGCA 

2051 TCCGCACACA CGCCGCCATG ACGCTGGATG GCAAACCGTT CAAATTCGAT 

2101 TTGGACGCTT CAGGCGGCAT CAACAGGGAA CTTACCCGAT GGAAAGGCAG 

2151 CAT CGG CATC CTCGACATCG GCGGCGCATT CAACCTCAAG CTGCAAAACC 

2201 GTATGACGCT CGAAGCCGGT GCGGAACGCG TGGCGGCAAG TGCGGCAAAT 

2251 TGGCAGGCAA TGGGCGGCAG CCTCAACCTG CAACACTTTT CTTGGGATAA 

2301 AAAAACCGGC ATATCGGCAA AAGGCGGCGC ACACGGTCTG CATATCGCCG 

2351 AGTTGCACAA TTTCTTCAAA CCGCCCTTCG AACACAATCT GGTTTTAAAC 

2401 GGCGACTGGG ATGTCGCCTA CGGGCGAAAC GCGCGCGGCT ACCTCAATAT 

24 51 CAGCCGGCAA AGCGGCGATG CCGTATTGCC CGGCGGGCAG GCTTTGGGTT 

2501 TGAACGCATT TTCCCTGAAA ACGCGCTTTC AAAACGACCG TATCGGAATC 

2551 CTGCTTGACG GCGGCGCGCG TTTCGGGCGG ATTAACGCCG ATTTGGACAT 

2601 CGGCAACGCC TTCGGCGGCA ATATGGCAAA TGCACCGCTC GGCGGCAGGA 

2651 TTACCGCCTC CCTTCCCGAC TTGGGCACAT TGAAGCCCTT TCTGCCCGCC 

2701 GCCGCGCAAA ACATTACCGG CAGCCTGAAT GCCGCCGCGC AAATCGGCGG 

2751 ACGGGTCGGC TCTCCGTCCG TCAATGCCGC CGTCAACGGC AGCAGCAACT 

2801 ACGGGAAAAT CAACGGCAAC ATCACCGTCG GGCAAAGCCG CTCTTTCGAT 

2851 ACCGCGCCTT TGGGCGGCAG GCTCAACCTG ACCGTTGCCG ATGCCGAAGT 

2901 ATTCCGCAAC TTCCTACCGG TCGGACAAAC CGTCAAAGGC AGCCTGAATG 

2951 CCGCCGTAAC CCTCGGCGGC AGCATCGCCG ATCCGCACTT GGGCGGCAGC 

3001 ATCAACGGCG ACAAACTCTA TTACCGCAAC CAAACCCAAG GC AT CAT CTT 

3051 GGACAACGGC TCGCTGCGTT CGCATATCGC GGGCAGGAAA TGGGTAATCG 

3101 ACAG CCTGAA ATTCCGGCAC GAAGGGACGG CGGAACTCTC CGGTACGGTC 

3151 GGTATGGAAA ACAGCGGACC CGATGTCGAT ATCGGCGCGG TGTTCGACAA 

3201 ATACCGCATC CTGTCCCGCC CCAACCGCCG CCTGACGGTT TCCGGCAACA 

3251 CCCGCCTGCG CTATTCGCCG CAAAAAGGCA TATCCGTTAC CGGGATGATT 

3301 AAAACGGATC AGGGGCTGTT CGGTTCGCAA AAATCCTCGA TGCCGTCCGT 

3351 CGGCGACGAT GTCGTCGTAT TAGGCGAAGT CAAAAAAGAG GCGGCGGCAC 

3401 CGCTCCCCGT CAATATGAAC CTGACTTTAG ACCTCAATGA CGGCATCCGC 

3451 TTCGCCGGCT ACGGCGCGGA CGTTACCATA GGCGGCAAAC TGACCCTGAC 

3501 CGCCCAATCG GGCGGAAGCG TGCGGGGCGT GGGCACGGTC CGCGTCATCA 

3551 AAGGGCGTTA TAAGGCATAC GGGCAGGATT TGGACATTAC CAAAGGCACG 

3601 GTCTCCTTTG TCGGCCCGCT CAACGACCCC AACCTCAACA TCCGCGCCGA 

3651 ACGCCGCCTT TCCCCCGTCG GTGCGGGCGT GGAAATATTG GGCAGCCTCA 

3701 ACAGTCCGCG CATTACGCTG ACGGCAAACG AACCGATGAG TGAAAAAGAC 

3751 AAGCTCTCCT GGCTCATCCT CAACCGCGCC GGCAGTGGCA GCAGCGGCGA 

3801 CAATGCCGCC CTGTCCGCAG CCGCCGGCGC GCTGCTTGCC GGGCAAATCA 

3851 ACGACCGCAT CGGGCTGGTG GATGATTTGG GCTTTACCAG CAAGCGCAGC 

3901 CGCAACGCGC AAACCGGCGA ACTCAACCCC GCCGAACAGG TGCTGACCGT 
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3951 CGGCAAACAA CTGACCGGCA AACTCTACAT CGGCTACGAA TACAGCATCT 

4 001 CCAGCGCGGA ACAGTCCGTC AAACTGATTT ACCGGCTGAC CCGCGCCATA 

4 051 CAGGCGGTTG CCCGTATCGG CAGCCGTTCG TCGGGCGGCG AGCTGACATA 

4101 CACCATACGT TTCGACCGCT TCTCCGGTTC GGACAAAAAA GACTCCGCCG 
4151 GAAACAGCAA AGGAAAATAA 

This corresponds to the amino acid sequence <SEQ ID 1 186; ORF 285. a>: 

a285.pep 

1 MTDTAPTDTD PTENGTRKMP SEHRPTPPAK KRRPLLK LSA ALLSVLILAV 

51 CFLG WLAGTE AGLRFGLYQI PSWFGVNISS QNLKGTLLDG FDGDNWSIET 

101 EGADLKISRF RFAWKPSELM RRSLHITEIS AGDIAIVTKP TPPKEERPPL 

151 SLPDSIDLPA AVYLDRFETG KISMGKAFDK QTVYLERLDA SYRYDRKGHR 

201 LDLKAADTPW SSSSGSASVG LKKPFALDTA IYTKGGLEGK TIHSTARLSG 

251 SLKDVRAELA IDGGNIRLSG KSVIHPFAES LDKTLEEVLV KGFNINPSAF 

301 VPSLPDAGLN FDLTAIPSFS DGIALEGSLD LENTKAGFAD RNGIPVRQVL 

351 GSFVIRQDGT VKIGNTSVAL LGRGGIRLSG KIDTEKDILD LNIGINSVGA 

4 01 EDVLQTAFKG RLDGSIGIGG TTASPKISWQ LGIGTARTDG SLAIASDPAN 

4 51 GQRKLVLDTV NIAAGQGSLT AQGYLELFKD RLLKLDIRSR AFDPSRIDPQ 

501 LPAGNINGSI NLAGELAKEK FTGKMRFLPG TFNGVPIAGS ADIVYESRHL 

551 PRAAVDLRLG RNIIKTDGGF GKKGDRLNLN ITAPDLSRFG FGLAGSLNVR 

601 GHLSGDLDGG IRTFETDLSG AARNLHIGKA ADIRSLDFTL KGSPDTSRPI 

651 RADIKGSRLS LSGGAEWDT ADLMLDGTGV QHRIRTHAAM TLDGKPFKFD 

701 LDASGGINRE LTRWKGSIGI LDIGGAFNLK LQNRMTLEAG AERVAASAAN 

7 51 WQAMGGSLNL QHFSWDKKTG ISAKGGAHGL HIAELHNFFK PPFEKNLVLN 

801 GDWDVAYGRN ARGYLNISRQ SGDAVLPGGQ ALGLNAFSLK TRFQNDRIGI 

851 LLDGGAR^R INADLDIGNA FGGNMANAPL GGRITASLPD LGTLKPFLPA 

901 AAQNITGSLN AAAQIGGRVG SPSVNAAVNG SSNYGKINGN ITVGQSRSFD 

951 TAPLGGRLNL TVADAEVFRN FLPVGQTVKG SLNAAVTLGG SIADPHLGGS 

1001 INGDKLYYRN QTQGIILDNG SI.RSHTAGRK WVIDSLKFRH EGTAELSGTV 

1051 GMENSGPDVD IGAVFDKYRI LSRPNRRLTV SGNTRLRYSP QKGISVTGMI 

1101 KTDQGLFGSQ KSSMPSVGDD WVLGEVKKE AAAPLPVNMN LTLDLNDGIR 

1151 FAGYGADVTI GGKLTLTAQS GGSVRGVGTV RVIKGRYKAY GQDLDITKGT 

1201 VSFVGPLNDP NLNIRAERRL SPVGAGVEIL GSLNSPRITL TANEPMSEKD 

1251 KLSWLILNRA GSGSSGDNAA LSAAAGALLA GQINDRIGLV DDLGFTSKRS 

1301 RNAQTGELNP AEQVLTVGKQ LTGKLYIGYE YSISSAEQSV KLIYRLTRAI 

1351 QAVARIGSRS SGGELTYTIR FDRFSGSDKK DSAGNSKGK* 

m285/s285 99.4% identity in 1389 aa overlap 

1C 20 30 40 50 60 

m285.pep MT DT APT DT D PT EN GTRKM PS E HR PT PP AKKRR PL LKLS AAL L S VL I LAV C FLGWL AGT E 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I 
a2B5 MTDTAPTDTDPTENGTRKMPSEHRPTPPAKKRRPLLKLSAALLSVLILAVCFLGWLAGTE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m285.pep AGLRFGLYQI PSWFGVNISSQNLKGTLLDGFDGDNWSIETE3ADLKISRFRFAWKPSELM 

I I I I II M I I I I I I I I I I II I I I I I I I II I II I I I II I M I I I H I I I II I I I I I I I I I 
a285 AGLRFGLYQI PSWFGVTJ IS SQNLKGTLLDGFDGDNWSIETEGADLKISRFRFAWKPSELM 

70 80 90 100 110 120 

. 130 140 150 160 170 180 

m285 . pep RRSLHITEISAGDIAIVTKPTPPKEERPPLSLPDSIDLPAAVYLDRFETGKISMGKAFDK 
I I I I I I I I II 1 1 I I ! 1 1 1 1 1 I I I I I I I I 1 1 I I 1 1 I 1 1 I I I I M I I I I I 1 1 I I 1 1 I I I M 1 
a285 RRSLHITEISAGDIAIVTKPTPPKEERPPLSLPDSIDLPAAVYLDRFETGKISMGKAFDK 
13C 140 150 160 170 180 

190 200 210 220 230 240 

m285.pep QTVYLERLDASYRYDRKGHRLDLKAADTPWSSSSGAASVGLKKPFALDTAIYTKGGLEGK 
I !! I I I I t II I I 1 I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I M M I I I I I I I I 1 I I I 
a285 QTVYLERLDAS YR YDRKGHRLDLKAADTPWSSSSGS AS VGLKKPFALDT A IYTKGGLEGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m285.pep TIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIHPFAESLDKTLEEVLVKGFNINPAAF 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I 
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a28S 



TIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIHPFAESLDKTLEEVLVKGFNINPSAF 
250 260 270 280 290 300 



310 320 330 340 350 360 

m285.pep VPSLPDAGLNFDLTAIPSFSDGIALEGSLDLENTKAGFADRNGIPVRQVLGGFVIRQDGT 
I I I 1 1! I I I I I 1 1 M I 1 1 II 1 1 I I I I I I I I I I II I I 1 1 I I I I 1 1 I 1 1 I I I I : I M M I I I 
a285 VPSLPDAGLNFDLTATPSFSDGIALEGSLDLENTKAGFADRNGIPVRQVLGSFVIRQDGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m285.pep VHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGINSVGAEDVLQTAFKGRLDGSIGIGG 
I I I I M I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I II I \ I 
a285 VHIGNTSVALLGRGGIRLSGKIDTEKDILDLNIGINSVGAEDVLQTAFKGRLDGSIGIGG 

370 380 390 400 410 420 

430 440 450 460 470 480 

m285.pep TTASPKISWQLGIGTARTDGSLAIASDPANGQRKLVLDTVNIAAGQGSLTAQGYLELFKD 

1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 ! i 1 1 1 i 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii mi 1 1 

a285 T T AS PK I S WQLG I G TART DG S LA I AS D PAN GQRK LVL DT VN I AAGQG S L T AQG YLE L FKD 

430 440 450 460 470 480 

490 500 510 520 530 540 

m285.pep RLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGELAKEKFTGKMRFLPGTFNGVPIAGS 
I I I I I M I I I I 1 1 I I I I I I 1 1 1 I 1 1 I I I I 1 1 I I I I II I I I I I I I I I I I I I I I I I I I 1 1 I I 

a285 RLLKLDIRSRAFDPSR1DPQLPAGNINGSINLAGELAKEKFTGKMRFLPGTFNGVPIAGS 
490 500 510 520 530 540 

550 560 570 580 590 600 

m285.pep AD I VYE S RHL PRAAV DLRLGRN 1 1 KT D GG FGKKG DRLN LN I TAP DL S R FG FGL AG S LNVR 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 1 II I I 
a285 A D I VYE S RHL PRAA VDLRLG RN 1 1 KT DGG FGKKG DRLNLN I TAP DL 3RFG FGLAG S LNVR 

550 560 570 580 590 600 

610 620 630 640 650 660 

m285.pep GHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRSLDFTLKGSPDT3RPIRADIKGSRLS 
I 1 1 1 1 I 1 1 I I M I 1 1 I 1 1 II I It I I 1 1 I I I I I I I II I I I I I I I I I III I I I I I M I 1 1 I I 
a285 GHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRSLDFTLKGSPDT3RPIRADIKGSRLS 

610 620 630 640 650 660 

670 680 690 700 710 720 

m285 . pep LSGGAAWDTADLMLDGTGVQHRIRTHAAMTLDGKPFKFDLDASGGINRELTRV3KGSIGI 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I ! I I I I I I I I I I I I I I I I I I 
a285 LSGGAEWDTADLMLDGTGVQHRIRTHAAMTLDGKPFKFDLDASGGINRELTRWKGSIGI 

670 680 690 700 710 720 

730 740 750 760 770 780 

m285 . pep LDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMGGSLNLQHFSWDKKTGISAKGGAHGL 
I I I I I I I I I I I I I i I I 1 I I I I I I I I I I 1 II I I I I I I I I I 1 I I I I II I I I I I I I I I I I I I I 
a285 LDI GGAFNLKLQNRMTLEAGAERVAASAANWQAMGGSLNLQHFSWDKKTG I SAKGGAHGL 

730 740 750 760 770 780 

790 800 810 820 830 840 

m285 . pep HIAELHNFFKPPFEHNLVLNGDWDVAYGRNARGYLNISRQSGDAVLPGGQALGLNAFSLK 
I I I I I I I I I II I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II 
a285 HIAELHNFFKPPFEHNLVLNGDWDVAYGRNARGYLNISRQSGDAVLPGGQALGLNAFSLK 

790 800 810 820 830 840 

850 860 870 880 890 900 

m285 . pep TRFQNDRIGILLDGGARFGRINADLGIANAFGGNMANAPLGGRITA3LPDLGALKPFLPA 
I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 
a285 TRFQNDRIGILLDGGARFGRINADLDIGNAFGGNMANAPLGGRITASLPDLGTLKPFLPA 

850 860 870 880 890 900 



910 920 930 940 950 960 

m285.pep AAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYGKINGNITVGQSRSFDTAPLGGRLNL 
I I I 1 1 I I I I 1 1 I I I I I ! I I 1 1 1 1 1 I I I I I I I I I I I I I M I I I 1 1 ! 1 1 M I M 1 I I I I I I I 
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a285 AAQNITGSLNAAAOZGGRVGSPSVNAAVNGSSNYGKINGNITVGQSRSFDTAPLGGRLNL 
910 920 930 940 950 960 

970 983 990 1000 1010 1020 

m285.pep TVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADPHLGGSINGDKLYYRNQTQGIILDNG 

I | | M I I I I I I I I I : II I I I I I I I I t I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I 
a 285 TVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADPHLGGSINGDKLYYRNQTQGIILDNG 

970 983 990 1000 1010 1020 

1030 1043 1050 1060 1070 1080 

m2 85 . pep SLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENSGPDVDIGAVFDKYRILSRPNRRLTV 

II M 1 1 1 1 1 1 1 II 1 : 1 1 I 1 1 1 1 1 I 1 1 II 1 1 1 M I 1 1 l I 1 1 1 1 1 1 1 II I II I 1 1 1 1 1 I l i I 

a 285 SLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENSGPDVDIGAVFDKYRILSRPNRRLTV 
1030 1043 1050 1060 . 1070 1080 

109C 1103 1110 1120 1130 1140 

m2 85 . pep SGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMPSVGDDVWLGEVKKEAAAPLPVNMN 
I || I | | | | II II I I I I I I I I I I I I I I I I II I I I II I I II I I I I I I I I I I I I I I I I I I I I I 
a 285 SGNTRLRYS PQKGI S VTGMI KTDQGL FGSQKS SMP S VGDDVWLGE VKKEAAAPLPVNMN 

109C 1103 1110 1120 1130 1140 

1150 1163 1170 1180 1190 1200 

m285.pep LTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVRGVGTVRVIKGRYKAYGQDLDITKGT 
I | I M I I I I II I I I ! I I ! I I I I I I I I I II I I I I I I I I I I I I I I M I II I I I II I I I I I I I 
a 285 LTLDLNDGIRFAGYGADVTIGGKLTLTAQSGGSVRGVGTVRVIKGRYKAYGQDLDITKGT 

1150 1163 1170 1180 1190 1200 

121C 1223 1230 1240 1250 1260 

m285.pep VSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNSPRITLTANEPMSEKDKLSWLILNRA 

1 1 1 1 1 1 1 1 1 1 ll 1 1 i l ll 1 1 1 1 1 1 1 1 1 i [ II II 1 1 1 1 1 1 1 i 1 1 II I II 1 1 1 1 1 1 l II 1 1 1 

a 285 VSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNSPRITLTANEPMSEKDKLSWLILNRA 
1210 1223 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

m285 . pep G S G S SG DNAAL S AAAG ALLAGQ I NDR I G LV D D LG FT S KR S RN AQT GE LN P AEQVLT VGKQ 

M I ! I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I 
a285 GSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGFTSKRSRNAQTGELNPAEQVLTVGKQ 

1270 1280 1290 1300 1310 1320 

1330 1340 1350 1360 1370 1380 

m285.pep LTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVARIGSRSSGGELTYTIRFDRFSGSDKK 
I || || I I I I I II I I I 1 I I I I I I I I I II II I I I I I I I I I I I II I I I I I I I I II I I I I I I I 
a 285 LTGKLYIGYEYSISSAEQSVKLIYRLTRAIQAVARIGSRSSGGELTYTIRFDRFSGSDKK 

1330 1340 1350 1360 1370 1380 



1390 

m2 8 5 . pep DSAGNGKGKX 
I I I I I: I I I I 
a285 DSAGNSKGKX 
1390 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 187>: 

g285-l.seq 

1 CTGAAGC7GT CGGCGGCACT GCTGTCTGTC CTGATTTTGG CAGTATGTTT 

51 CCTCGGC7GG ATCGCCGGTA CGGAAGCAGG TTTGCGCTTC GGGCTGTACC 

101 AAATCCCGTC CTGGTTCGGC GTAAACATTT CCTCCCAAAA CCTCAAAGGC 

151 ACACTGCTCG ACGGCTTCGA CGGCGACAAC TGGTCGATAG AAACCGAGGG 

201 GGCAGACCTT AAAATCAGCC GCTTCCGCTT CGCGTGGAAA CCGTCCGAAC 

251 TGATGCGCCG CAGCCTGCAC ATCACCGACA TCTCCGCCGG CGACATCGCC 

301 ATCGTAACCA AACCGACTCC GCCTAAAGAA GAACGCCCGC CTCAAGGCCT 

351 GCCCGACAGC ATAGACCTGC CCGCCGCCGT CTATCTCGAC CGCTTCGAGA 

401 CGGGCAAAAT CAGCATGGGC AAAACCTTTG ACAAACAAAC CGTCTATCTC 

4 51 GAACGCCTCA ACGCGGCATA CCGTTACGAC CGTAAAGGGC ACCGCCTCGA 

501 CCTGAAGGCC GCCGACACGC CGTGGAGCAG TTCGTCGGGG TCAGCCTCGG 
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551 TCGGCTTGAA AAAACCGTTT GCCCTCGATA CCGCCATTTA CACCAAAGGC 

601 GGATTCGAAG GCGAAACCAT ACACAGTACG GCGCGGCTGA GCGGCAGCCT 

651 GAAGGATGTG CGCGCCGAAC TGACGATCGA CGGCGGCAAT ATCCGCCTCT 

701 CGGGAAAATC CGTCATCCAC CCGTTTGCCG AATCATTGGA TAAAACATTG 

751 GAAGAAGTAC TGGTCAAAGG ATTCAACATC AATCCGTCCG CCTTCGTGCC 

801 TTCCCTGCCC GATGCCGGGC TGAATTTCGA CCTGACCGCC ATCCCGTCGT 

851 TTTCAGACGG CATCGCGCTG GAAGGCTCGC TCGATTTGGA AAACACCAAA 

901 GCCGGCTTTG CCGACCGCAA CGGCATCCCC GTCCGTCAGG TTTTGGGCGG 

951 CTTTGTCATC CGGCAGGACG GCACGGTGCA TATCGGCAAT ACGTCCGCCG 

1001 CCCTGCTCGG ACGGGGCGGC ATCAGGCTGT CGGGCAAAAT CGACACCGAA 

1051 AAAGACATCC TTGATTTAAA TATAGGCATC AACTCCGTCG GCGCGGAAGA 

1101 CGTGCTGCAA ACCGCGTTCA AAGGCAGGTT GGACGGCAGC ATCGGCATCG 

1151 GCGGCACGAC CGCCTCGCCC AAAATCTCTT GGCAACTCGG CACCGGCACG 

1201 GCACGCACGG ACGGCAGCCt cgcCATCGCA AGCGAcCCCG CAAACGAACA 

1251 GCGGAAACTG GTGTTCGACA CCGTCAACAT CTCCGCCGGG GAAGGCAGCC 

1301 TGACCGCGCA AGGCTATCTC GAGCTGTTTA AAGACCGCCT GCTCAAGCTG 

1351 GACATCCGTT CCCGCGCATT CGACCCTTCG CGCATCGATC CGCAATTTCC 

1401 GGCAGGCgat atCAACGGTT CGATTCATCT TGCCGGTGAA CTGGCAAAAG 

1451 AGAAATTTAC GGGCAAAATG CGTTTTTTGC CCGGTACGTT CAACGGCGTG 

1501 CCGATTGCCG GCAGCGCCGA CATTGTTTAC GAGTCCCGCC ACCTTCCGCG 

1551 CGCCGCCGTC GATTTGCGGT TGGGGCGGAA CATCGTCAAA ACAGACGGCG 

1601 GCTTCGGCAA AAAAGGCGAC CGGCTTAACC TCAATATCAC CGCACCCGAT 

1651 TTATCCCGTT TCGGTTTCGG ACTCGCGGGG TCTTTAAATG TACGCGGACA 

1701 CCTTTCCGGC GATTTGGACG GCGGCATCCG AACCTTTGAA ACCGACCTTT 

1751 CCGGCACGGC GCGCAACTTA CACATCGGCA AAGCGGCAGA CATCCGTTCG 

1801 CTCGATTTTA CCCTCAAAGG CTCACCCGGC ACAAGCCGCC CGATGCGCGC 

1851 CGATATCAAG GGCGGCCGCC TTTCCCTGTC GGGCGGCGCG GCGGTTGTCG 

1901 ATACCGCCGG CCTGACGCTG GAAGGTACGG GCGCGCAGCA CCGCATCCGC 

1951 ACACACGCCG CCATGACGCT GGACGGCAAA CCGTTCAAAC TCGATTTGGA 

2001 CGCTTCAGGC GGCATCAACA GGGAACTTAC CCGATGGAAA GGCAGCATCG 

2051 GCATCCTCGA CATCGGCGGC GCATTCAACC TCAAGCTGCA AAACCGTATG 

2101 ACGCTCGAAG CCGGTGCGGA ACACGTGGCG GCAAGTGCGG CAAATTGGCA 

2151 GGCAATGGGC GGCAGCCTCA ACCTGCAACA CTTTTCTTGG GACAGGAAAA 

2201 CCGGCATATC GGCAAAAGGC GGCGCACGCG GCCTGCACAT CGCCGAGTTG 

2251 CACAATTTCT TCAAACCGCC CTTCGAACAC AATCTGGTTT TAAACGGCGA 

2301 CTGGGATGTC GCCTACGGGC ACAACGCGCG CGGCTACCTC AATATCAGCC 

2351 GGCAAAGCGG CGATGCCGTA TTGCCCGGCG GGCAGGCTTT GGGTTTGAAC 

2401 GCATTTTCCC TGAAAACGCG CTTTCAAAAC GACCGCATCG GAATCCTGCT 

2451 TGACGGCGGC GCGCGTTTCG GACGGATTAA CGCCGATTTG GGCATCGGCA 

2501 ACGCCTTCGG CGGCAATATG GCAAATACAC CGCTCGGCGG CAGGATTACA 

2551 GCCTCCCTTC CCGACTTGGG CGCATTGAAG CCCTTTCTGC CCGCCGCCGC 

2601 GCAAAACATT ACCGGCAGCC TGAATGCCTC CGCGCAAATC GGCGGACGGG 

2651 TAGGCTCTCC GTCCGTCAAT GCCGCCGTCA ACGGTAGCAG CAACTACGGG 

2701 AAAATCAACG GCAATATCAC CGTCGGGCAA AGCCGCTCCT TCGATACCGC 

2751 ACCTTTGGGC GGCAGGCTCA ACCTGACCGT TGCCGATGCC GAAGCATTCC 

2801 GCAACTTCCT ACCGGTCGGA CAAACCGTCA AAGGCAGCCT GAATGCCGCC 

2851 GTAACCCTCG GCGGCAGCAT CGCCGACCCG CACTTGGGCG GCAGTATCAA 

2901 CGGCGACAAG CTCTATTACC GCAACCAAAC CCAAGGCATC ATCTTGGACA 

2951 ACGGCTCGCT GCGTTCGCAT ATTGCAGGCA GGAAATGGGT AATCGACAGC 

3001 CTGAAATTCC GGCACGAAGG GACGGCGGAA CTCTCCGGCA CGGTCAGCAT 

3051 GGAAAACAGC GTGCCCGATG TCGATATCGG CGCGGTGTTC GACAAATACC 

3101 GCATCCTGTC CCGCCCCAAC CGCCGCCTGA CGGTTTCCGG CAACACCCGC 

3151 CTGCGCTATT CGCCGCAAAA AGGCATATCC GTTACCGGTA TGATTAAAAC 

3201 TGATCAGGGG CTGTTCGGTT CGCAAAAATC CTCGATGCCG TCCGTCGGCG 

3251 ACGATGTCGT CGTATTGGGC GAAGTCAAGA AAGAGGCGGC GGCATCGCTC 

3301 CCCGTCAATA TGAACCTGAC TTTAGACCTC AATGACGGCA TCCGCTTCTC 

3351 CGGCTACGGC GCGGACGTTA CCATAGGCGG CAAACTGACC CTGACCGCGC 

3401 AACCGGGCGG AAATGTGCGT GGGGTGGGCA CGGTCCGCGT CATCAAAGGG 

3451 CGTTACAAAG CATACGGGCA GGATTTAGAC ATTACCAAAG GCACAGTCTC 

3501 CTTTGTCGGC CCGCTCAACG ACCCCAACCT GAACATCCGC GCCGAACGCC 

* 3551 GCCTTTCCCC CGTCGGTGCG GGCGTGGAAA TATTGGGCAG CCTCAACAGC 

3601 CCGCGCATTA CGCTGACGGC AAACGAACCG ATGAGTGAAA AAGACAAGCT 

3651 CTCCTGGCTC ATCCTCAACC GTGCCGGCAG CGGCAGCAGC GGCGACAATG 

3701 CCGCCCTGTC CGCAGCCGCA GGCGCGCTGC TTGCCGGGCA AATCAACGAC 

3751 CGCATCGGGC TGGTGGATGA TTTGGGCTTT ACCAGCAAGC GCAGCCGCAA 

3801 CGCGCAAACC GGCGAACTCA ACCCCGCCGA ACAGGTGCTG ACCGTCGGCA 

3851 AACAACTGAC CGGCAAACTC TACATCGGCT ACGAATACGG CATCTCCAGC 

3901 GCGGAACAGT CCGTCAAACT GATTTACCGG CTGACCCGCG CCATACAGGC 

3951 GGTTGCCCGT ATCGGCAGCC GTTCGTCGGG CGGCGAGCTG ACATACACCA 

4001 TACGTTTCGA CCGCCTCTTC GGTTCGGACA AAAAAGACTC CGCAGGAAAC 

4051 GGCAAAGGGA AATAA 

This corresponds to the amino acid sequence <SEQ ID 1 188; ORF 285-1 .ng>: 

g285-l.pep 
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1 LKLSAALLSV LILAVCFLGW IAGTEA GLRF GLYQIPSWFG VNISSQNLKG 

51 TLLDGFDGDN WSIETEGADL KISRFRFAWK PSELMRRSLH ITDISAGDIA 

101 IVTKPTPPKE ERPPQGLPDS IDLPAAVYLD RFETGKISMG KTFDKQTVYL 

151 ERLNAAYRYD RKGHRLDLKA ADTPWSSSSG SASVGLKKPF ALDTAIYTKG 

201 GFEGETIHST ARLSGSLKDV RAELTIDGGN IRLSGKSVIH PFAESLDKTL 

251 EEVLVKGFNI NPSAFVPSLP DAGLNFDLTA IPSFSDGIAL EGSLDLENTK 

301 AGFADRNGIP VRQVLGGFVI RQDGTVHTGN TSAALLGRGG IRLSGKIDTE 

351 KDILDLNISI NSVGAEDVLQ TAFKGRLDGS IGIGGTTASP KISWQLGTGT 

401 ARTDGSLAIA SDPANEQRKL VFDTVNISAG EGSLTAQGYL ELFKDRLLKL 

451 DIRSRAFDPS RIDPQFPAGD INGSIHLAGE LAKEKFTGKM RFLPGTFNGV 

501 PIAGSADIVY ESRHLPRAAV DLRLGRNIVK TDGGFGKKGD RLNLNITAPD 

551 LSRFGFGLAG SLNVRGHLSG DLDGGIRTFE TDLSGTARNL HIGKAADIRS 

601 LDFTLKGSPG TSRPMRADIK GGRLSLSGGA AVVDTAGLTL EGTGAQHRIR 

651 THAAMTLDGK PFKLDLDASG GINRELTRWK GSIGILDIGG AFNLKLQNRM 

701 TLEAGAEHVA ASAANWQAMG GSLNLQHFSW DRKTGI SAKG GARGLH1AEL 

751 HNFFKPPFEH NLVLNGDWDV AYGHNARGYL NISRQSGDAV LPGGQALGLN 

801 AFSLKTRFQN DRIGILLDGG ARFGRINADL GIGNAFGGNM ANTPLGGRIT 

851 ASLPDLGALK PFLPAAAQNI TGSLNASAQI GGRVGSPSVN AAVNGSSNYG 

901 KINGNITVGQ SRSFDTAPLG GRLNLTVADA EAFRNFLPVG QTVKGSLNAA 

951 VTLGGSIADP HLGGSINGDK LYYRNQTQGI ILDNGSLRSH IAGRKWVIDS 

1001 LKFRHEGTAE LSGTVSMENS VPDVDIGAVF DKYRILSRPN RRLTVSGNTR 

1051 LRYSPQKGIS VTGMIKTDQG LFGSQKSSMP SVGDDVWLG EVKKEAAASL 

1101 PVNMNLTLDL NDGIRFSGYG ADVTIGGKLT LTAQPGGNVR GVGTVRVIKG 

1151 RYKAYGQDLD ITKGTVSFVG PLNDPNLNIR AERRLSPVGA GVEILGSLNS 

1201 PRITLTANEP MSEKDKLSWL ILNRAGSGSS GDNAALSAAA GALLAGQIND 

1251 RIGLVDDLGF 7SKRSRNAQT GELNPAEQVL TVGKQLTGKL YIGYEYGISS 

1301 AECSVKLIYR LTRAIQAVAR IGSRSSGGEL TYTIRFDRLF GSDKKDSAGN 

1351 GKGK* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1 189>: 

m285-l. seq 

1 CTGAAGCTGT CGGCGGCACT GCTGTCTGTC CTGATTTTGG CAGTATGTTT 

51 CCTCGGCTGG CTCGCCGGTA CGGAAGCAGG TTTGCGCTTC GGGCTGTACC 

101 AAATCCCGTC TTGGTTCGGC GTAAACATTT CCTCCCAAAA CCTCAAAGGC 

151 ACGCTGCTCG ACGGCTTCGA CGGCGACAAC TGGTCGATAG AAACCGAGGG 

201 GGCAGACCTT AAAATCAGCC GCTTCCGCTT CGCGTGGAAA CCGTCCGAAC 

251 TGATGCGCCG CAGCCTGCAC ATTACCGAAA TTTCCGCCGG CGACATCGCC 

301 ATCGTTACCA AACCGACTCC GCCTAAAGAA GAACGCCCGC CGCTCAGCCT 

351 TCCCGACAGC ATAGACCTGC CTGCCGCCGT CTATCTCGAC CGCTTCGAGA 

401 CGGGCAAAAT CAGCATGGGC AAAGCCTTTG ACAAACAAAC CGTCTATCTC 

451 GAACGGCTGG ATGCTTCATA CCGTTACGAC CGCAAAGGAC ACCGCCTTGA 

501 CCTGAAGGCC GCCGACACGC CGTGGAGCAG TTCGTCGGGG GCGGCCTCGG 

553 TCGGCTTGAA AAAACCGTTT GCCCTCGATA CCGCCATTTA CACCAAAGGC 

601 GGACTCGAAG GCAAAACCAT ACACAGTACG GCTCGGCTGA GCGGCAGCCT 

651 GAAGGATGTG CGCGCCGAAC TGGCGATCGA CGGCGGCAAT ATCCGCCTCT 

701 CGGGAAAATC CGTCATCCAC CCGTTTGCCG AATCATTGGA TAAAACATTG 

751 GAAGAAGTAC TGGTCAAAGG GTTCAACATC AATCCGGCCG CCTTCGTGCC 

801 TTCCCTGCCC GATGCCGGAC TGAATTTCGA CCTGACCGCC ATCCCGTCGT 

851 TTTCAGACGG CATCGCGCTG GAAGGTTCGC TCGATTTGGA AAACACCAAA 

901 GCCGGCTTTG CCGACCGCAA CGGCATCCCC GTCCGTCAGG TnTAGGCGG 

951 CTTTGTCATC CGGCAGGACG GCACGGTGCA TATCGGCAAT ACGTCCGCCG 

1001 CCCTGCTCGG ACGGGGCGGC ATCAGGCTGT CGGGCAAAAT CGACACCGAA 

1051 AAAGACATCC TCGATTTAAA TATAGGCATC AACTCCGTCG GCGCGGAAGA 

1101 CGTACTGCAA ACCGCGTTCA AAGGCAGGTT GGACGGCAGC ATCGGCATCG 

1151 GTGGCACGAC CGCCTCGCCC AAAATCTCTT GGCAACTCGG CATCGGCACG 

1201 GCGCGCACGG ACGGCAGCCT CGCCATTGCA AGCGACCCAG CAAACGGACA 

1251 GCGGAAACTG GTGCTCGACA CCGTCAACAT CGCCGCCGGG CAAGGCAGCC 

1301 TGACCGCGCA AGGCTATCTC GAGCTGTTTA AAGACCGCCT GCTCAAGCTG 

1351 GACATCCGTT CCCGCGCATT CGACCCTTCG CGCATCGATC CGCAACTTCC 

1401 GGCAGGCAAT ATCAACGGCT CAATAAACCT TGCCGGCGAA CTGGCAAAAG 

1451 AGAAATTCAC AGGCAAAATG CGGTTTTTAC CCGGCACGTT CAACGGCGTA 

1501 CCGATTGCCG GCAGTGCCGA CATTGTTTAC GAGTCCCGCC ACCTTCCGCG 

1551 TGCCGCCGTC GATTTGCGGC TGGGGCGGAA CATTATTAAA ACAGACGGCG 

1601 GCTTCGGCAA AAAAGGCGAC CGGCTTAACC TCAATATCAC CGCACCCGAT 

1651 TTATCCCGTT TCGGTTTCGG ACTCGCGGGG TCTTTAAATG TACGCGGACA 

1701 CCTTTCCGGT GATTTGGACG GCGGCATCCG AACCTTTGAA ACCGACCTTT 

1751 CCGGCGCGGC GCGCAACCTG CACATCGGCA AGGCGGCAGA CATCCGTTCG 

1801 CTCGATTTCA CGCTCAAAGG TTCGCCCGAC ACAAGCCGCC CGATACGCGC 

1851 CGACATCAAA GGCAGCCGCC TTTCGCTGTC GGGCGGAGCG GCGGTTGTCG 

1901 ATACCGCCGA CCTGATGCTG GACGGCACGG GCGTGCAGCA CCGCATCC3C 

195] ACACACGCCG CCATGACGCT GGATGGCAAA CCGTTCAAAT TCGATTTGGA 

2001 CGCTTCAGGC GGCATCAACA GGGAACTTAC CCGATGGAAA GGCAGCATCG 

2051 GCATCCTCGA CATCGGCGGC GCATTCAACC TCAAGCTGCA AAACCGTATG 

2101 ACGCTCGAAG CCGGTGCGGA ACGCGTGGCG GCAAGTGCGG CAAATTGGCA 
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2151 GGCAATGGGC GGCAGCCTCA ACCTGCAACA CTTTTCTTGG GATAAAAAAA 

2201 CCGGCATATC GGCAAAAGGC GCCCCACACG GTCTCCATAT CGCCGAGTTG 

2251 CACAATTTCT TCAAACCGCC CTTCGAACAC AATCTGGTTT TAAACGGCGA 

2301 CTGGGATGTC GCCTACGGGC GCAACGCGCG CGGCTACCTC AATATCAGCC 

2351 GGCAAAGCGG CGATGCCGTA TTGCCCGGCG GGCAGGCTTT GGGTTTGAAC 

2401 GCATTTTCCC TGAAAACGCG CTTTCAAAAC GACCGCATCG GAATCCTGCT 

2451 TGACGGCGGC GCGCGTTTCG GGCGGATTAA CGCCGATTTG GGCATCGCCA 

2501 ACGCCTTCGG CGGCAATATG GCAAArGCAC CGCTCGGCGG CAGGATTACC 

2551 GCCTCCCTTC CCGACTTGGG CGCATTGAAG CCCTTTCTGC CCGCCGCCGC 

2601 GCAAAACATT ACCGGCAGCC TGAATGCCGC CGCGCAAATC GGCGGACGGG 

2651 TAGGCTCTCC GTCCGTCAAT GCCGCCGTCA ACGGCAGCAG CAACTACGGG 

2701 AAAATCAACG GCAACATCAC CGTCGGGCAA AGCCGCTCTT TCGATACCGC 

2751 GCCTTTGGGC GGCAGGCTCA ACCTGACCGT TGCCGATGCC GAAGTATTCC 

2801 GCAACTTCCT ACCGGTCGGA CAAACCGTCA AAGGCAGCCT GAATGCCGCC 

2851 GTAACCCTCG GCGGCAGCAT CGCCGATCCG CACTTGGGCG GCAGCATCAA 

2901 CGGCGACAAA CTCTATTACC GCAACCAAAC CCAAGGCATC ATCTTGGACA 

2951 ACGGCTCGCT GCGTTCGCAT ATCGCGGGCA GGAAATGGGT AATCGACAGC 

3001 CTGAAATTCC GGCACGAAGG GACGGCGGAA CTCTCCGGTA CGGTCGGTAT 

3051 GGAAAACAGC GGACCCGATG TCGATATCGG CGCGGTGTTC GACAAATACC 

3101 GCATCCTGTC CCGCCCCAAC CGCCGCCTGA CGGTTTCCGG CAACACCCGC 

3151 CTGCGCTATT CGCCGCAAAA AGGCATATCC GTTACCGGGA TGATTAAAAC 

3201 GGATCAGGGG CTGTTCGGTT CGCAAAAATC CTCGATGCCG TCCGTCGGCG 

3251 ACGATGTCGT CGTATTAGGC GAAGTCAAAA AAGAGGCGGC GGCACCGCTC 

3301 CCCGTCAATA TGAACCTGAC TTTAGACCTC AATGACGGCA TCCGCTTCGC 

3351 CGGCTACGGC GCGGACGTTA CCATAGGCGG CAAACTGACC CTGACCGCCC 

3401 AATCGGGCGG AAGCGTACGG GGCGTGGGCA CGGTCCGCGT CATCAAAGGG 

3451 CGTTATAAGG CATACGGGCA GGATTTGGAC ATTACCAAAG GCACGGTCTC 

3501 CTTTGTCGGC CCGCTCAACG ATCCCAACCT CAACATCCGC GCCGAACGCC 

3551 GCCTTTCCCC CGTCGGTGCG GGCGTGGAAA TATTGGGCAG CCTCAACAGC 

3601 CC GC GC ATT A CGCTGACGGC AAACGAACCG ATGAGTGAAA AAGACAAGCT 

3651 CTCTTGGCTC ATCCTCAACC GCGCCGGCAG CGGCAGCAGC GGCGACAATG 

3701 CCGCCCTGTC TGCAGCCGCA GGTGCGCTGC TTGCCGGGCA AATCAACGAC 

3751 CGCATCGGGC TGGTGGATGA TTTGGGCTTT ACCAGCAAGC GCAGCCGCAA 

3801 CGCGCAAACC GGCGAACTCA ACCCCGCCGA ACAGGTGCTG ACCGTCGGCA 

3851 AACAACTGAC CGGCAAACTC TACATCGGCT ACGAATACAG CATCTCCAGC 

3901 GCGGAACAGT CCGTCAAACT GATTTACCGG CTGACCCGCG CCATACAGGC 

3951 GGTTGCCCGT ATCGGCAGCC GTTCGTCGGG CGGCGAGCTG ACATACACCA 

4001 TACGTTTCGA CCGCTTCTCC GGTTCGGACA AAAAAGACTC CGCCGGAAAC 

4051 GGCAAAGGAA AATAA 

This corresponds to the amino acid sequence <SEQ ID 1 190; ORF 285- 1>: 

m285-i.pep 

1 LKLSAALLSV LILAVCFLGW LAGTEA GLRF GLYQIPSWFG VNISSQNLKG 

51 TLLDGFDGDN WSIETEGADL KISRFRFAWK PSELMRRSLH ITEISAGDIA 

101 IVTKPTPPKE ERPPLSLPDS IDLPAAVYLD RFETGKISMG KAFDKQTVYL 

151 ERLDASYRYD RKGHRLDLKA ADTPWSSSSG AASVGLKKPF ALDTAIYTKG 

201 GLEGKTIHST ARLSGSLKDV RAELAIDGGN IRLSGKSVIH PFAESLDKTL 

251 EEVLVKGFNI NPAAFVPSLP DAGLNFDLTA IPSFSDGIAL EGSLDLENTK 

301 AGFADRNGIP VROVLGGFVI RQDGTVHIGN TSAALLGRGG IRLSGKIDTE 

35i KDILDLNIGI NSVGAEDVLQ TAFKGRLDGS IGIGGTTASP KISWQLGIGT 

401 ARTDGSLAIA SDPANGQRKL VLDTVNIAAG QGSLTAQGYL ELFKDRLLKL 

451 UIKSRAFDPS R1DPQLPAGN INGSINLAGE LAKEKFTGKM RFLPGTFNGV 

501 PIAGSADIVY ESRHLPRAAV DLRLGRNIIK TDGGFGKKGD RLNLNITAPD 

551 LSRFGFGLAG SLNVRGHLSG DLDGGIRTFE TDLSGAARNL HIGKAADIRS 

601 LDFTLKGSPD TSRPIRADIK GSRLSLSGGA AWDTADLML DGTGVQHRIR 

651 THAAMTLDGK PFKFDLDASG GINRELTRWK GSIGILDIGG AFNLKLQNRN 

701 TLEAGAERVA ASAANWQAMG GSLNLQHFSW DKKTGISAKG GAHGLHIAEL 

751 HNFFKPPFEH NLVLNGDWDV AYGRNARGYL NISRQSGDAV LPGGQALGLN 

801 AFSLKTRFQN DRIGILLDGG ARFGRINADL GIANAFGGNM ANAPLGGRIT 

851 ASLPDLGALK PFLPAAAQNI TGSLNAAAQI GGRVGSPSVN AAVNGSSNYG 

901 KINGNITVGQ SRSFDTAPLG GRLNLTVADA EVFRNFLPVG QTVKGSLNAA 

951 VTLGGSIADP HLGGSINGDK LYYRNQTQGI ILDNGSLRSH IAGRKWVIDS 

1001 LKFRHEGTAE LSGTVGMENS GPDVDIGAVF DKYRILSRPN RRLTVSGNTR 

1051 LRYSPQKGIS VTGMIKTDQG LFGSQKSSNP SVGDDVVVLG EVKKEAAAPL 

1101 PVNMNLTLDL NDGIRFAGYG ADVTIGGKLT LTAQSGGSVR GVGTVRVIKG 

1151 RYKAYGODLD ITKGTVSFVG PLNDPNLNIR AERRLS PVGA GVEILGSLNS 

1201 PR1TLTANEP MSEKDKLSWL ILNRAGSGSS GDNAALSAAA GALLAGQIND 

1251 RIGLVDDLGF TSKRSRNAOT GELNPAEQVL TVGKQLTGKL YIGYEYSISS 

1301 AEQSVKLIYR LTRAIQAVAR IGSRSSGGEL TYTI RFDRFS GSDKKDSAGN 

1351 GKGK* 

g2B5-l/m285-l 96.5% identity in 1354 aa overlap 
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g285-l.pep LKLSAALLSVLILAVCFLGWIAGTEAGLRFGLYQIPSWFGVNISSQNLKGTLLDGFDGDN 

I I I I M I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

m285-l LKLSAALLSVLILAVCFLGWLAGTEAGLRFGLYQIPSWFGVNISSQNLKGTLLDGFDGDN 
10 20 30 40 50 60 



70 80 90 100 110 120 

g285-l . pep WSIETEGADLKISRFRFAWKPSELMRRSLHITDISAGDIAIVTKPTPPKEERPPQGLPDS 
I I I I i I I I I i I I I I I f I II II I 1 I I I I I I I II : I I I I I I I I I I M I I II II I I I Mill 
m285-l WSIETEGADLKISRFRFAWKPSELMRRSLHITEISAGDIAIVTKPTPPKEERPPLSLPDS 

70 80 90 100 110 120 

130 140 150 160 170 180 

g285-l .pep IDLPAAVYLDRFETGKISMGKTFDKQTVYLERLNAAYRYDRKGHRLDLKAADTPWSSSSG 
I I I I I I I I I I I! I I I I I I 1 I I :l I I I I I i I ! I I : I : I I I I I I I I I I I I I I I I I I I I t I I I 
m285-l IDLPAAVYLDRFETGKISMGKAFDKQTVYLERLDASYRYDRKGHRLDLKAADTPWSSSSG 

130 140 150 160 170 180 



190 200 210 220 230 240 

g2 8 5-1. pep SASVGLKKPFALDTAI YTKGGFEGETIHSTARLSGSLKDVRAELTI DGGNIRLSGKSVIH 
: I I I I I ! I I I I I I I I I I I I 1 1 : I I : I 1 1 I I I 1 I I I I I I I I I I I I : I I II I I I I I I I I I I I 
m28 5-1 AASVGLKKPFALDTAI YTKGGLEGKTIHSTARLSGSLKDVRAELAI DGGNIRLSGKSVIH 

190 200 210 220 230 240 



250 260 270 280 290 300 

g28 5- 1 . pep PFAESLDKTT.EEVLVKGFHINPSAFVPSLPDAGLNFDLTAI PSFSDGIALEGSLDLENTK 
I I I I I I I I t I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
m285-l PFAES LDKTLEEVLVKGFHINPAAFV PS LPDAGLNFDLTA I PSFSDGIALEGSLDLENTK 

250 260 270 280 290 300 

310 320 330 340 350 360 

g285-l .pep AGFADRNGIPVRQVLGGFVT RQDGTVHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGI 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
m285-l AGFADRNGIPVRQVLGGFVIRQDGTVHIGNTSAALLGRGGIRLSGKIDTEKDILDLNIGI 

310 320 330 340 350 360 



370 380 390 400 410 420 

g285-l .pep NSVGAEDVLQTAFKGRLDGSIGIGGTTASPKISWQLGTGTARTDGSLAIASDPANEQRKL 
I I I 1 I I I I I I I I I I I I I M I I I I I I I i I I I II I I I I I I I I I I I I I I I I I I I I I I I I II 
m285-l NSVGAEDVLQTAFKGRLDGSIGIGGTTASPKISWQLGIGTARTDGSLAIASDPANGQRKL 

370 380 390 400 410 420 



430 440 450 460 470 480 

g285-l .pep VFDTVNISAGEGSLTAQGYLELFKDRLLKLDIRSRAFDPSRIDPQFPAGDINGSIHLAGE 
I : I I I I I : I I : I I I I I I I ■ I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I : I I I I I : I I I I 
m285-l VLDTVNIAAGQGSLTAQGYLELFKDRLLKLDIRSRAFDPSRIDPQLPAGNINGSINLAGE 

430 440 450 460 470 480 



490 500 510 520 530 540 

g285-l .pep LAKEKFTGKMRFLPGTFNGVPIAGSADIVYESRHLPRAAVDLRLGRNIVKTDGGFGKKGD 
I I I I ! I I II I M I I I I I I M I I I I I I I I 1 I I I I I I I ! ! I I I I I I I I I I : I I I I I I I I I I I 
m285-l LAKEKFTGKMRFLPGTFNGVPIAGSADIVYESRHLPRAAVDLRLGRNIIKTDGGFGKKGD 

490 500 510 520 530 540 



550 560 570 580 590 600 

g285-l . pep RLNLNITAPDLSRFGFGLAGSLNVRGHLSGDLDGGIRTFETDLSGTARNLHIGKAADIRS 
I I I I i II I I I I I I I I I I I i I I I I II I I I I I I I I I I I I II I i I I I I : I I I I I I I I M ! I I I 
m2 8 5- 1 RLNLNITAPDLSRFGFGLAGSLNVRGHLSSDLDGGI RTFETDLSGAARHLHIGKAADI RS 

550 560 570 580 590 600 



610 620 630 640 650 660 

g285-l.pep LDFTLKG S PGT S RPMRADI KGGRL S L S GG AA V V DT AG LTLEGTG AQHR I RTHAAMT LDGK 
I I I I I I I I I I I I I : I II ! I I : I I I I I I I II I I I I I I I : I I I : I I I I I I I I I I I I I I I 
m285-l LDFTLKGSPDTSRPIRADIKGSRLSLSGGAAWDTADLMLDGTGVQHRIRTHAAMTLDGK 

610 620 630 640 650 660 

670 680 690 700 710 720 

g285-l .pep PFKLDLDASGGINRELTRWKGSIC-ILDIGGAFNLKLQNRMTLEAGAEHVAASAANWQAMG 
I I I : I III I I I I I I 1 I I M I I I I I I i I I I I I i I I I | I | | | | | | | I | l : | | | | | | j | | | | | 
m285-l PFKFDLDASGGINRELTRWKGSIGILDIGGAFNLKLQNRMTLEAGAERVAASAANWQAMG 
670 680 690 700 710 720 



730 74U 750 760 770 780 

g285-l . pep GSLNLQHFSWDRKTGISAKGGARGLHIAELHNFFKPPFEHNLVLNGDWDVAYGHNARGYL 
I I I M I I II I I : I I M I I I I I I : I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I :|| | | I I 
m285-l GSLNLQHFSWDKKTGISAKGGAHGLHIAELHNFFKPPFEHNLVLNGDWDVAYGRNARGYL 
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730 740 750 760 770 780 

790 800 810 820 820 840 

g2 85-l .pep N I S RQS G DA V L PGGQA LG LN AFS LK T R FQN D RI G I LL DG G ARFG R I N A DLG I GNA FGGN M 
I I I I I I I I I I I I I I I I I I I I I I I I I i I ! I • I I I I I I I I I I I I I I I t I I 1 I I I : I I f I I 1 I 
m285-l NISRQSGDAVLPGGQALGLNAFSLKTRFQNDRIGILL.DGGARFGRINADLGIANAFGGNM 
790 BOO 810 820 830 840 

850 860 870 880 890 900 

g2 85-1. pep ANTPIX5GRITASLPDLGALKPFLPAAAQNITGSLNASAQIGGRVGSPSVNAAVN3SSWYG 
I I : I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I r I I ! I I I I I I I I I I II I I I I I I I I 
m285-l ANAPLGGRITASLPDLGALKPFLPAAAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYG 

850 860 870 880 890 900 

910 920 930 940 950 960 

g285-l . pep KINGNITVGQSRSFDTAPLGGRLNLTVADAEAFRNFLPVGQTVKGSLNAAVTLGGSIADP 
I } I I I I I I I I I I I I I I I i I I I I I i I I I I I I I : I M II I I I I I I I I I I I I I II I I I I I I I S 
m285-l KINGNITVGQSRSFDTAPLGGRLNLTVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADP 

910 920 930 940 950 960 

970 980 990 1000 1010 1020 

g285-l.pcp HLGGSINGDKLYYRNQTQGIILDNGSLRSHIAGRKWVIDSLKFRHEGTAELSGTVSMENS 
! I I I I I I I I I I (I I I I I I II I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I I I : I I II 
m2 8 5- 1 HLGGSINGDKLYYRNOTQGI ILDNGSLRSHI AGRKWVI DSLKFRHEGTAELSGTVGMENS 

970 980 990 1000 1010 1020 

1030 1040 1050 1060 1070 1080 

g2 85-1. pep VPDVDIGAVFDKYRILSRPNRRLTVSGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMP 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
m2 8 5 - 1 GPDVDI GAVFDKYRILSRPNRRLTVSGNTRLRYSPQKG I S VTGMIKTDQGLFGSQKSSMP 

1030 1040 1050 1060 1070 1080 

1090 1100 1110 1120 1130 1140 

g2 8 5-1. pep SVGDDVVVLGEVKKEAAASLPVNMNLTLDLNDGIRFSGYGADVTIGGKLTLTAQPGGNVR 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I 1 I I I II I I I I I I II I I : I I 
m2 8 5- 1 SVGDDVVVLGEVKKEAAAPLPVNMNLTLDLNDGI RFAG YGADVT IGGKLTLTAQSGGSVR 

1090 1100 1110 1120 1130 1140 

1150 1160 1170 1180 1190 1200 

g2 85-1. pep GVGTVRVIKGRYKAYGQDLDITKGTVSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNS 
I I I I t I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I II I I I I I I I I ! I I I I I I 1 1 i I I I I 
m285-l GVGTVRVIKGRYKAYGQDLDITKGTVSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNS 

1150 1160 1170 1180 1190 1200 

1210 1220 1230 1240 1250 1260 

g285-l.pep PRITLTANEPMSEKDKLSWLILNRAGSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGF 
I I I I ! I ! I I I I I I M I i i I I I I I i I I I I I II I I I I I I I ! I I I I I 1 I t I I I I I Ml I I I I I 
IU285-1 PRITLTANEPMSEKDKLSWLILNRAGSGSSGDNAALSAAAGALLAGQINDRI GLVDDLGF 

1210 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

g285-l.pep TSKRSRNAQTGELNPAEQVLTVGKQLTGKLYIGYEYGISSAEQSVKLIYRLTRAIQAVAR 
I I I I I I I M 1 1 I 1 I I I I I I I 1 1 I 1 1 I I 1 1 1 1 I I 1 1 I : I i 1 1 1 1 1 1 1 1 1 I I I I 1 1 1 1 1 1 1 I 
m285-l TSKRSRNAQTGELNPAEQVLTVGKQLTGKLY1GYEYSISSAEQSVKLIYRLTRAIQAVAR 

1270 1280 1290 1300 1310 1320 

1330 1340 1350 

g285-l -pep I GSRS SGGELT YT IRFDRLFGS DKKDS AGNGKGK 
I I I I I I I I I I I I I I I I I I : I I I I i t I S I I I 1 i I 
m285-l IGSRSSGGELTYTIRFDRFSGSDKKDSAGNGKGKX 

1330 1340 1350 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 191>: 

a285-l.seq 

1 CTGAAGCTGT CGGCGGCACT GCTGTCTGTT CTGATTTTGG 

51 CCTCGGCTGG CTCGCCGGCA CGGAAGCGGG TTTGCGCTTC 

101 AAATCCCGTC TTGGTTCGGC GTAAACATTT CCTCCCAAAA 

151 ACGCTGCTCG ACGGCTTCGA CGGCGACAAC TGGTCGATAG 

201 GGCAGACCTT AAAATCAGCC GCTTCCGCTT CGCGTGGAAA 

251 TGATGCGCCG CAGCCTGCAC ATTACCGAAA TTTCCGCCGG 

301 ATCGTTACCA AACCGACTCC GCCTAAAGAA GAACGCCCGC 

351 TCCCGACAGC ATAGACCTGC CTGCCGCCGT CTATCTCGAC 

401 CGGGCAAAAT CAGCATGGGC AAAGCCTTTG ACAAACAAAC 

4 51 GAACGGCTGG ATGCTTCATA CCGTTACGAC CGCAAAGGAC 



CAGTATGTTT 
GGGCTGTACC 
CCTCAAAGGC 
AAACCGAGGG 
CCGTCCGAAC 
CGACATCGCC 
CGCTCAGCCT 
CGCTTCGAGA 
CGTCTATCTC 
ACCGCCTCGA 
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501 CCTGAAGGCT GCCGACACGC CGTGGAGCAG TTCGTCGGGG TCAGCCTCGG 

551 TCGGCTTGAA AAAACCGTTT GCCCTCGATA CCGCCATTTA CACCAAAGGC 

601 GGACTCGAAG GCAAAACCAT ACACAGTACG GCTCGGCTGA GCGGCAGCCT 

651 GAAGGATGTG CGCGCCGAAC TGGCGATCGA CGGCGGCAAT ATCCGCCTCT 

701 CGGGAAAATC CGTCATCCAC CCGTTTGCCG AATCATTGGA TAAAACATTG 

751 GAAGAAG7AC TGGTCAAAGG GTTCAACATC AATCCGTCCG CCTTCGTGCC 

801 TTCCCTGCCC GATGCCGGGC TGAATTTCGA cctgaccgcc ATCCCGTCGT 

851 TTTCAGACGG CATCGCGCTG GAAGGCTCGC TCGATTTGGA AAACACCAAA 

901 GCCGGCTTTG CCGACCGCAA CGGCATCCCC GTCCGTCAGG TTTTAGGCAG 

951 CTTTGTCATC CGGCAGGACG GCACGGTGCA TATCGGCAAT ACGTCCGTCG 

1001 CCCTGCTCGG ACGGGGCGGC ATCAGGCTGT CGGGCAAAAT CGACACCGAA 

1051 AAAGACATCC TCGATTTAAA TATAGGCATC AACTCCGTCG GCGCGGAAGA 

1101 CGTACTGCAA ACCGCGTTCA AAGGCAGGTT GGACGGCAGC ATCGGCATCG 

1151 GTGGCACGAC CGCCTCGCCC AAAATCTCTT GGCAACTCGG CATCGGCACG 

1201 GCGCGCACGG ACGGCAGCCT CGCCATTGCA AGCGACCCCG CAAACGGACA 

1251 GCGGAAACTG GTGCTCGACA CCGTCAACAT CGCCGCCGGG CAAGGCAGCC 

1301 TGACCGCGCA AGGCTATCTC GAGCTGTTTA AAGACCGCCT GCTCAAGCTG 

1351 GACATCCGTT CCCGCGCATT CGACCCTTCG CGCATCGATC CGCAACTTCC 

1401 GGCAGGCAAT ATCAACGGCT CAATAAACCT TGCCGGCGAA CTGGCAAAAG 

14 51 AGAAATTCAC AGGCAAAATG CGGTTTTTAC CCGGCACGTT CAACGGCGTA 

1.501 CCGATTGCCG GCAGTGCCGA CATTGTTTAC GAGTCCCGCC ACCTTCCGCG 

1551 TGCCGCCGTC GATTTGCGGC TGGGGCGGAA CATTATTAAA ACAGACGGCG 

1601 GCTTCGGCAA AAAAGGCGAC CGGCTTAACC TCAATATCAC CGCACCCGAT 

1651 TTATCCCGTT TCGGTTTCGG ACTCGCGGGG TCTTTAAATG TACGCGGACA 

1701 CCTTTCCGGC GATTTGGACG GTGGCATCCG AACCTTTGAA ACCGACCTTT 

1751 CCGGCGCGGC GCGCAACCTG CACATCGGCA AGGCGGCAGA CATCCGTTCG 

1801 CTCGATTTCA CGCTCAAAGG TTCGCCCGAC ACAAGCCGCC CGATACGCGC 

1851 CGACATCAAA GGCAGCCGCC TTTCGCTGTC GGGCGGAGCG GAGGTTGTCG 

1901 ATACCGCCGA CCTGATGCTG GACGGCACGG GCGTGCACCA CCGCATCCGC 

1951 ACACACGCCG CCATGACGCT GGATGGCAAA CCGTTCAAAT TCGATTTGGA 

2001 CGCTTCAGGC GGCATCAACA GGGAACTTAC CCGATGGAAA GGCAGCATCG 

2051 GCATCCTCGA CATCGGCGGC GCATTCAACC TCAAGCTGCA AAACCGTATG 

2101 ACGCTCGAAG CCGGTGCGGA ACGCGTGGCG GCAAGTGCGG CAAATTGGCA 

2151 GGCAATGGGC GGCAGCCTCA ACCTGCAACA CTTTTCTTGG GATAAAAAAA 

2201 CCGGCATATC GGCAAAAGGC GGCGCACACG GTCTGCATAT CGCCGAGTTG 

2251 CACAATTTCT TCAAACCGCC CTTCGAACAC AATCTGGTTT TAAACGGCGA 

2301 CTGGGATGTC GCCTACGGGC GAAACGCGCG CGGCTACCTC AATATCAGCC 

2351 GGCAAAGCGG CGATGCCGTA TTGCCCGGCG GGCAGGCTTT GGGTTTGAAC 

2401 GCATTTTCCC TGAAAACGCG CTTTCAAAAC GACCGTATCG GAATCCTGCT 

2451 TGACGGCGGC GCGCGTTTCG GGCGGATTAA CGCCGATTTG GACATCGGCA 

2501 ACGCCTTCGG CGGCAATATG GCAAATGCAC CGCTCGGCGG CAGGATTACC 

2551 GCCTCCCTTC CCGACTTGGG CACATTGAAG CCCTTTCTGC CCGCCGCCGC 

2601 GCAAAACATT ACCGGCAGCC TGAATGCCGC CGCGCAAATC GGCGGACGGG 

2651 TCGGCTCTCC GTCCGTCAAT GCCGCCGTCA ACGGCAGCAG CAACTACGGG 

2701 AAAATCAACG GCAACATCAC CGTCGGGCAA AGCCGCTCTT TCGATACCGC 

2751 GCCTTTGGGC GGCAGGCTCA ACCTGACCGT TGCCGATGCC GAAGTATTCC 

2801 GCAACTTCCT ACCGGTCGGA CAAACCGTCA AAGGCAGCCT GAATGCCGCC 

2851 GTAACCCTCG GCGGCAGCAT CGCCGATCCG CACTTGGGCG GCAGCATCAA 

2901 CGGCGACAAA CTCTATTACC GCAACCAAAC CCAAGGCATC ATCTTGGACA 

2951 ACGGCTCGCT GCGTTCGCAT ATCGCGGGCA GGAAATGGGT AATCGACAGC 

3001 CTGAAATTCC GGCACGAAGG GACGGCGGAA CTCTCCGGTA CGGTCGGTAT 

3051 GGAAAACAGC GGACCCGATG TCGATATCGG CGCGGTGTTC GACAAATACC 

3101 GCATCCTGTC CCGCCCCAAC CGCCGCCTGA CGGTTTCCGG CAACACCCGC 

3151 CTGCGCTATT CGCCGCAAAA AGGCATATCC GTTACCGGGA TGATTAAAAC 

3201 GGATCAGGGG CTGTTCGGTT CGCAAAAATC CTCGATGCCG TCCGTCGGCG 

3251 ACGATGTCGT CGTATTAGGC GAAGTCAAAA AAGAGGCGGC GGCACCGCTC 

3301 CCCGTCAATA TGAACCTGAC TTTAGACCTC AATGACGGCA TCCGCTTCGC 

3351 CGGCTACGGC GCGGACGTTA CCATAGGCGG CAAACTGACC CTGACCGCCC 

3401 AATCGGGCGG AAGCGTGCGG GGCGTGGGCA CGGTCCGCGT CATCAAAGGG 

34 51 CGTTATAAGG CATACGGGCA GGATTTGGAC ATTACCAAAG GCACGGTCTC 

3501 CTTTGTCGGC CCGCTCAACG ACCCCAACCT CAACATCCGC GCCGAACGCC 

3551 GCCTTTCCCC CGTCGGTGCG GGCGTGGAAA TATTGGGCAG CCTCAACAGT 

3601 CCGCGCATTA CGCTGACGGC AAACGAACCG AT GA GTG AAA AAGACAAGCT 

3651 CTCCTGGCTC ATCCTCAACC GCGCCGGCAG TGGCAGCAGC GGCGACAATG 

3701 CCGCCCTGTC CGCAGCCGCC GGCGCGCTGC TTGCCGGGCA AATCAACGAC 

3751 CGCATCGGGC TGGTGGATGA TTTGGGCTTT ACCAGCAAGC GCAGCCGCAA 

3801 CGCGCAAACC GGCGAACTCA ACCCCGCCGA ACAGGTGCTG ACCGTCGGCA 

3851 AACAACTGAC CGGCAAACTC TACATCGGCT ACGAATACAG CATCTCCAGC 

3901 GCGGAACAGT CCGTCAAACT GATTTACCGG CTGACCCGCG CCATACAGGC 

3951 GGTTGCCCGT ATCGGCAGCC GTTCGTCGGG CGGCGAGCTG ACATACACCA 

4001 TACGTTTCGA CCGCTTCTCC GGTTCGGACA AAAAAGACTC CGCCGGAAAC 

4051 AGCAAAGGAA AATAA 



This corresponds to the amino acid sequence <SEQ ID 1 192; ORF 285-1. a>: 
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a285~l .pep 



LKLSAALLSV LILAVCFLGW LAGTEAGLRF 



TLLDGFDGDN 
IVTKPTPPKE 
ERLDASYRYD 
201 GLEGKTIHST 
251 EEVLVKGFNI 
AGFADRNGI P 
KD1LDLNIGI 
401 ARTDGSLAIA 
451 DIRSRAFDPS 
PIAGSADIVY 
LSRFGFGLAG 
LDFTLKGSPD 
THAAMTLDGK 
TLEAGAERVA 
HNFFKPPFEH 
AFSLKTRFQN 
ASLPDLGTLK 
KINGNITVGQ 
VTLGGSIADP 
LKFRHEGTAE 
LRYSPQKGIS 
PVNMNLTLDL 
RYKAYGQDLD 
PRITLTANEP 
RIGLVDDLGF 
1301 AEQSVKLIYR 
1351 SKGK* 



51 
101 
151 



301 
351 



501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 



WSIETEGADL 
ERPPLSLPDS 
RKGHRLDLKA 
ARLSGSLKDV 
NPSAFVPSLP 
VRQVLGSFVI 
NSVGAEDVLQ 
SDPANGQRKL 
RIDPQLPAGN 
ESRHLPRAAV 
SLNVRGHLSG 
TSRPIRADIK 
PFKFDLDASG 
ASAANWQAMG 
NLVLNGDWDV 
DRIGILLDGG 
PFLPAAAQNI 
SRSFDTAPLG 
HLGGSINGDK 
LSGTVGMENS 
VTGMIKTDQG 
NDGIRFAGYG 
ITKGTVSFVG 
MSEKDKLSWL 
TSKRSRNAQT 
LTRAIQAVAR 



K1SRFRFAWK 
IDLPAAVYLD 
ADTPWSSSSG 
RAELAIDGGN 
DAGLNFDLTA 
RQDGTVHIGN 
TAFKGRLDGS 
VLDTVNIAAG 
INGSINLAGE 
DLRLGRNI I K 
DLDGGIRTFE 
GSRLSLSGGA 
GINRELTRWK 
GSLNLQHFSW 
AYGRNARGYL 
ARFGRINADL 
TGSLNAAAQI ' 
GRLNLTVADA 
LYYRNQTQGI 
GPDVDIGAVF 
LFGSQKSSMP 
ADVTIGGKLT 
PLNDPNLNIR 
ILNRA3SGSS 
GELNPAEQVL 
IGSRSSGGEL 



GLYQIPSWFG 
PSEXMRRSLH 
RFETGKISMG 
SASVGLKKPF 
IRLSGKSVIH 
IPSFSDGIAL 
TSVALLGRGG 
IGIGGTTASP 
QGSLTAQGYL 
LAKEKFTGKM 
TDGGFGKKGD 
TDLSGAARNL 
EVVDTADLML 
GSIGILDIGG 
DKKTGISAKG 
NISRQSGDAV 
DIGNAFGGNM 
GGRVGSPSVN 
EVFRNFLPVG 
ILDNGSLRSH 
DKYRILSRPN 
SVGDDVVVLG 
LTAQSGGSVR 
AERRLSPVGA 
GDNAALSAAA 
TVGKQLTGKL 
TYTIRFDRFS 



VNISSQNLKG 
ITEISAGDIA 
KAFDKQTVYL 
ALDTAIYTKG 
PFAESLDKTL 
EGSLDLEKTK 
IRLSGKIDTE 
KISWQLGIGT 
ELTKDRLLKL 
RFLPGTFKGV 
RLNLNITAPD 
H1GKAADIRS 
DGTGVQHRIR 
AFNLKLQKRM 
GAHGLHIAEL 
LPGGQALGLN 
ANAPLGGF.IT 
AAVNGSSKYG 
QTVKGSLKAA 
IAGRKWV1DS 
RRLTVSGKTR 
EVKKEAAAPL 
GVGTVRVIKG 
GVEILGSLNS 
GALLAGQIND 
YIGYEYSISS 
GSDKKDSAGN 



a285-l/m285-l 



99.3% identity in 1354 aa overlap 



10 20 30 40 50 60 

a2 8 5-1. pep LKLSAALLSVLILAVCFLGWLAGTEAGLRFGLYQIPSWFGVNISSQNLKGTLLDGFDGDN 

I i I I 1 I I I I I I I I I I I I I I I I I I I i I I I I I I I 1 1 I I 11 I I I I I I 1 1 I I I I t i 1 1 (! I I I I 

m265-l LK LS AALLS VL I L A VC FLGWL AG TEAGLRFG LYQI PS W FGVN I S S QNLKGT LLDG FDG DN 

10 20 30 40 50 60 

70 80 90 100 110 120 

a2 35-1. pep wsietegadlkisrfrfawkpselmrrslhiteisagdiaivtkptppkeerpplslpds 
1 1 1 1 1 1 i i ! i 1 1 1 i 1 1 1 i 1 1 i I i i i i i i 1 1 1 i i 1 1 i i i i ] I i i i i i i i i t i i i i i i i i i i 
m285-l wsietegadlkisrfrfakkpselmrrslhiteisagdiaivtkptppkeerpplslpds 
70 80 90 100 110 120 

130 140 150 160 170 180 

a2 8 5- 1 . pep I dlpaavyldrfetgki smgkafdkqtvylerldasyry drkghrldlkaadtpwss ssg 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 ] 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 

ra285-l idlpaavyldrfetgkismgkafdkqtvylerldasyrydrkghrldlkaadtpwssssg 

130 140 150 160 ^ 170 180 

190 200 210 220 230 240 

a285-l.pep SASVGLKKPFALDTAI YTKGGLEGKTI HSTARLSGSLKDVRAELAI DGGNI RLSGKS VI H 
: I I I I I I I I I I I I ! I I I I I I I I I I I I I ( I I I I I I I I I I I li I I ! I I I I 1 1 ! I I I 1 I I I I I 
m285-l AASVGLKKPFALDTAIYTKGGLEGKTIHSTARLSGSLKDVRAELAIDGGNIRLSGKSVIH 

190 200 210 220 230 240 

250 260 270 280 290 300 

a285-l.pep P FAE S LDKTLE EVLVKG FN I N PS AFV PS L ? DAGLN FDLT AI PS FS DG I ALEG S LDLENT K 
I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I II 
m285-l P FAE S LDKTLE EVLV KG FN I N PAAFV P S LPDAGLN FD LT AI PS FS DG I A LEG S LDLENT K 

250 260 270 280 290 300 

310 320 330 340 350 360 

a285-l.pep AG FADRN G I PV RQ VLG S FV I RQDCT VH I CNTSVAL LG RCGIRLSGKIDTEKDILDLNICI 
1 I I I I I I I I I I I I II I : I I I I II I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I ! I I 
ro28 5- 1 AGFADRNGI PV RQVLGG FV I RQDGT VH I GNT S AAL LG RGG I RLSGKIDTEKDILDLNIGI 

310 320 330 340 350 360 

370 380 390 400 410 420 

a285-l.pep NSVGAEDVLQTAFKGRLDGSIGIGGTTASPKISWQLGIGTARTDGSLAIASDPANGQRKL 
I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I 
m285-l NSVGAEDVLQTAFKGRLDGSIGIGGTTAS PKISWQLGIGTARTDGSLA1ASDPANGQRKL 

370 380 390 400 410 420 



430 



440 



4 50 



460 



410 



480 
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a2 8 5- 1 . pep VLDTVN I AAGQGSLTAQGYLELFKDRLLKLDI RSRAFDPSRI DPQLPAGNINGSINLAGE 
I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I f I I I I I i I I I I I I I I I I II I I I I 
m285-l VLDTVN I AAGQGSLTAQG YLELFKDRLLKLD I RSRAFDPSRI DPQLPAGNI NGSI N LAG E 

430 440 450 460 470 480 



490 500 510 520 530 540 

a2 85-1. pep LA KEKFTG KMRFL PGT FNGV P I AG SAD IVYESRHLP RAA VDL RLGRN I 1 KT DGG FGKKG D 
I I I I I I II I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I 
m2 85-l LAKEKFTGKMRFLPGTFNGVPIAGSAD3VYESRHLPRAAVDLRLGRNIIKTDGGFGKKGD 

490 500 510 520 530 540 



550 560 570 560 590 600 

a2 85-1. pep RLNLNITAPDLSRFGFGLAGSLNVRGHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRS 
i I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I IE I I I I I I I I M I I I I I I I I I I I I I I I 
m285-l RLNLNITAPDLSRFGFGLAGSLNVRGHLSGDLDGGIRTFETDLSGAARNLHIGKAADIRS 

550 560 570 580 590 600 



610 620 630 640 650 660 

a285-l.pep LDFTLKGSPDTSRPIRADIKGSRLSLSGGAEVVDTADLMLDGTGVOHRIRTHAAMTLDGK 
I I I I I I I I I I I I I I I I I I I i i I I I I I I I I I I I I I I I I I I I t I I I I I I I I t I I I I I I I f I 
m285~l LDFTLKGSPDTSRPIRADIKGSRLSLSGGAAVVDTADLMLDGTGVQHRIRTHAAMTLDGK 

610 620 630 640 650 660 



670 680 690 700 710 720 

a285-l.pep PFKFDLDASGGIN RELT RWKGS I G I L DI GG A FNL KLQN RMTLEAGAERVAAS AANWQAMG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
m285-l PFKFDLDASGGINRELTRWKGSIGILDIGGAFNLKLQNRMTLEAGAERVAAS AANWQAMG 

670 680 690 700 710 720 



730 740 750 760 770 780 

a285-l . pep GSLNLQHFSWDKKTGISAKGGAHGLHIAELHNFFKPPFEHNLVLNGDWDVAYGRNARGYL 
11 I I I I I I I I I I I I I I I I i I I I I I ; I I I M I I I I I I I M I I I i I M I I I I I I I I I I I I I I 
m285-l CSLNLQHFSWDKKTGISAKGGAHGLHIAELHNFFKPPFEHNLVLNGDWDVAYGRMARGYL 

730 740 750 760 770 780 



790 800 B1C 820 830 840 

a285-l . pep NISRQ3GDAVLPGGQALGLNAFSLKTRFQKDRIGILLDGGARFGRINADLDIGNAFGGNN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I : I I I I I I I 
Itl285-1 NISRQSGDAVLPGGQALGLNAFSLKTRFQKDRIGILLDGGARFGRINADLGIANAFGGNM 

790 800 81C 820 830 840 



850 860 87C 880 890 900 

a285-l . pep ANAPLGGRITASLPDLGTLKPFLPAAAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYG 
I I I I 1 1 I I I I ! I I I I I I : I I 1 1 I I I I I I I 1 1 i 1 1 I I 1 1 I I I I I 1 1 I I I 1 1 1 1 1 I I 1 1 I I I 
m285-l ANAPLGGRITASLPDLGALKPFLPAAAQNITGSLNAAAQIGGRVGSPSVNAAVNGSSNYG 

850 860 870 880 890 900 



910 920 930 940 950 960 

a2 8 5-1 .pep KINGNITVGQSRSFDTAPLGGRLNLTVADAEVFRNFLPVGQTVKGSLNAAVTLGGSIADP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
1*285-1 K I NGN I T VGQS RS FDT A PLGGRLN LT V AD AEV FRNFL P VGQTVKGS LN AAVTLGGS I ADP 

910 920 930 940 950 960 



970 980 990 1000 1010 1020 

a2 8 5-1. pep HLGGSINGDKLYYRNQTQGIILDNGSLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENS 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I II I I 
m285-l HLGGSINGDKLYYRNQTQGIILDNGSLRSHIAGRKWVIDSLKFRHEGTAELSGTVGMENS 

970 980 990 1000 1010 102C 



1030 1040 1050 1060 1070 1080 

a285-l . pep GPDVDrGAVFDKYRILSRPNRRLTVSGNTRLRYSPQKGISVTGMIKTDQGLFGSQKSSMP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
n\285-l GPDVDIGAVFDKYRILSRPNRRXTVSGNTRLRYSPQKGISVTGMIKTDOGLFGSOKSSMP 

1030 1040 1050 1060 1070 1080 



1090 1100 1110 1120 1130 114C 

a285 - 1 . pep SVGDDWVLGEVKKEAAAPLPVNMNLTLDLMDGIRFAGYGADVTIGGKLTLTAQSGGSVF. 

I I I I I I I I I I I I I I I I I I M I I t I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m285-l SVGDDWVLGEVKKEAAAPLPVNMNLTLDLNDGIRFAGYGADVTIGGKLTLTAOSGGSVR 

1090 1100 1110 1120 1130 1140 



1150 1160 1170 1180 1193 1200 

a285-l . pep GVGTVRVI KGRYK AYGQDLD1 TKGT VS FVGPLNDPNLN I RAERRLS PVGAGVE ILGSLN S 
I M I I I I I I j I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I i I I I I I I I I M I I I I I I I I 
m285- 1 GVGTVRVI KGRYKAYGQDLD1TKGTVSFVGPLNDPNLNIRAERRLSPVGAGVEILGSLNS 
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1150 1160 1170 1180 1190 1200 

1210 1220 1230 1240 1250 1260 

a2 8 5-1. pep PRITLTANEPMSEKDKLSWLILNRAGSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGF 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : 
m2 85-l PRITLTANEPMSEKDKLSWLILNRAGSGSSGDNAALSAAAGALLAGQINDRIGLVDDLGF 

1210 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

a2 8 5- 1 . pep TSKRSRNAQTGELNPAEQVLTVGKQLTGKLYIGYEYSISSAEQS vkli yrltraiqavar 

i i t i i i i i i i i i i i i I i i i i i i i i i i i i i i i i i i i i i i i I i i i i i i i i i i i i i i I i i i i i 

m285-l T S KRS RN AQTGELN PAEQVLTV GKQLTG KL Y IGYEYSISS AEQS VK L I YRLTRA I QA V A R 

1270 1280 1290 1300 1310 1320 

1330 1340 1350 

a285-l .pep IGSRSSGGELTYTIRFDRFSGSDKKDSAGNSKGKX 
I I I i I i I i I I I I I II t I I ! I I I I I I I I I I I: 1 I I I 
IU285-1 IGSRSSGGELTYTIRFDRFSGSDKKDSAGNGKGKX 

1330 1340 1350 



The following partial DNA sequence was 

g286.seq 

1 atgcagaaca ccggtaccat 

51 ggctttattt ttctttccgc 

101 aaaacaaggc ggcgggtttc 

151 gaatcagtca aattaaaacc 

201 cagtgaaatc aaagatatgg 

251 agcaggaaga ggttttggat 

301 gcaccggaca acgttaaaac 

351 caaggtcagc ctgacggaaa 

4 01 cgggcccgcg caccaaaatc 

451 atcctttcag acggcaacct 

501 ctggcagcag ccggtaggca 

551 gcaaaacttc cgtcctcggc 

601 aagctcggca acacccgggc 

651 tttgaacgtc gtcgtggaca 

701 aaatcaccgg cacacagcgt 

751 cgcttccaac cgggcacgcc 

801 acaggcgctc gaacaaaacg 

851 acttcgaccg cctcccaagg 

901 cgaggtcaaa cgccacaaac 

951 acggtttggg cggcaaaatc 

1001 ggctatatcg gctcggtcgt 

1051 tgccgccggc atcagccagc 

1101 gcaacgtttc ctacaaccgt 

1151 ttctccggcg gcatctggta 

1201 gctgggggcg gaatttctcg 

1251 tcgatttggg caacagccac 

1301 cagctgctca acaacgtgct 

1351 caaaatcggg acgactttgg 

14 01 gcacctctgc ccgcgcaggt 

14 51 ggcacgttca tcatacgcgg 

1501 tgccgatgtc ccctcggggc 

1551 tgcgcggtta cgaacttga 



identified in N. gonorrhoeae <SEQ ID 1 193>: 



gatgatcaaa 
acgcatacgc 
ccattgttca 
caaattcccc 
tcgaagaaca 
aaggaacaga 
aatgctccgc 
aagacggagc 
cccaacgtcg 
cgccgaatac 
gcgattUcga 
gcggtaacgc 
ggccgtcaac 
gcggccgccc 
taccccgaac 
ctacgacctc 
ggcattattc 
ggaccgcgtc 
tcgaaaccgg 
gcctacgact 
ctgggatatg 
cgcgcaacta 
tcgaccaccc 
tgtgcgcgac 
cagaaggccg 
gccacgatgc 
gcaccccgaa 
gcacattcct 
tatttcttca 
acaagcgggt 
tgatgttccg 



ccgaccgccc 
gcctgccgcc 
aaagcaaaag 
gtccgcatcg 
cctgccgctc 
cgggattcct 
agcaaaggct 
ttatacggtg 
gcgtcgccat 
taccgcaacg 
tcaggacag t 
gcaaaggcta 
cccgataccg 
cattgccttc 
aaaccgtctc 
gacctgctgc 
cggcgcgtcc 
cccgtcaaag 
catccgcctc 
attacaacct 
gacaaatacg 
tcggggcaac 
aaaacctcga 
cgcgcgggca 
gaaaatcccc 
tgaccgcctc 
aacggccatt 
gtcctccacc 
cgcccgaaaa 
tacaccgttg 
cagcggcggc 



tgctcctgcc 
gacct-tccg 
ccccgacacc 
acacgcagga 
atcacgcagc 
tgccgaagaa 
atttcagcag 
cacatcacac 
cctcggcgac 
cgctggaaaa 
tyggaaaaca 
cccgcttgcc 
ccaccgccga 
ggcgactttg 
cggcctggcg 
tcgacttcca 
gtacaagccg 
tcagcgtaac 
gattcggaat 
cttcaacaaa 
aaaccacgct 
tactggacaa 
aaaacgcgcc 
tcgatgccag 
cgctcggatg 
ttggaaacgc 
acctcgacgg 
ccgctaatcc 
caaaaaactc 
cacgcgacaa 
ccgtcttccg 



This corresponds to the amino acid sequence <SEQ ID 1 194; ORF 286.ng>: 

g286.pep 

1 MQNTGTMMIK PTALLLPALF FFPHAYAPAA DLSENKAAGF ALFKSKSPDT 

51 ESVKLKPKFP VRIDTQDSEI KDMVEEHLPL ITQQQEEVLD KEQTG FLAEE 

101 APDNVKTMLR SKGYFSSKVS LTEKDGAYTV HITPGPRTKI ANVGVAILGD 

151 ILSDGNLAEY YRNALENWQQ PVGSDFDQDS WENSKTSVLG AVTRKGYPLA 

201 KLGNTRAAVN PDTATADLNV VVDSGRPIAF GDFEITGTQR YPEQTVSGLA 

251 RFQPGTPYDL DLLLDFQQAL EQNGHYSGAS VQADFDRLPR GPRPRQSQRN 

301 RGQTPQTRNR HPPRFGIRFG RQNRLRLLQP LQQRLYRLGR LGYGQIRNHA 

351 CRRHQPAAQL SGQLLDKQRF LQPFDHPKPR KTRLLRRHLV CARPRGHRCQ 

4 01 AGGGISRRRP EN PRLGCRFG QQPRHDADRL LETPAAQQRA APRKRPLPRR 
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451 QNRDDFGHIP VLHRANPHLC PRRLFLHARK QKTRHVHHTR TSGLHRCTRQ 

501 CRCPLGADVP QRRRVFRARL RT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 195>: 

m286. seq 

1 ATGCACGACA CCCGTACCAT GATGATCAAA CCGACCGCCC TGCTCCTGCC 

51 GGCTTTATTT TTCTTTCCGC ACGCATACGC GCCTGCCGCC GACCTTTCCG 

101 AAAACAAGGC GGCGGGTTTC GCATTGTTCA AAAACAAAAG CCCCGACACC 

151 GAATCAGTCA AATTAAAACC CAAATTCCCC GTCCTCATCG ACACGCAGGA 

201 CAGTGAAATC AAAGATATGG TCGAAGAACA CCTGCCGCTC ATCACGCAGC 

251 AGCAGGAAGA AGTATTGGAC AAGGAACAGA CGGGCTTCCT CGCCGAAGAA 

301 GCGCCGGACA ACGTTAAAAC GATGCTCCGC AGCAAAGGCT ATTTCAGCAG 

351 CAAAGTCAGC CTGACGGAAA AAGACGGAGC TTATACGGTA CACATCACAC 

401 CGGGCCCGCG CACCAAAATC GCCAACGTCG GCGTCGCCAT CCTCGGCGAC 

451 ATCCTTTCAG ACGGCAACCT CGCCGAATAC TACCGCAACG CGCTGGAAAA 

501 CTGGCAGCAG CCGGTAGGCA GCGATTTCGA TCAGGACAGT TGGGAAAACA 

551 GCAAAACTTC CGTCCTCGGC GCGGTAACGC GCAAAGCCTA CCCGCTTGCC 

601 AAGCTCGGCA ATACGCAGGC GGCCGTCAAC CCCGATACCG CCACCGCCGA 

651 TTTGAACGTC GTCGTGGACA GCGGCCGCCC CATCGCCTTC GGCGACTTTG 

701 AAATCACCGG CACACAGCGT TACCCCGAAC AAATCGTCTC CGGCCTTGCG 

751 CGTTTCCAGC CCGGTATGCC GTACGACCTC GACCTGCTGC TCGACTTCCA 

801 ACAGGCGCTC GAACAAAACG GGCATTATTC CGGCGCGTCC GTACAAGCCG 

851 ACTTCGACCG CCTCCAAGGC GACCGCGTCC CCGTCAAAGT CAGCGTAACC 

901 GAGGTCAAAC GCCACAAACT CGAAACCGGC ATCCGCCTCG ATTCGGAATA 

951 CGGTTTGGGC GGCAAAATCG CCTACGACTA TTACAACCTC TT CAACAAAG 

1001 GCTATATCGG TTCGGTCGTC TGGGATATGG ACAAATACGA AACCACGCTT 

1051 GCCGCCGGCA TCAGCCAGCC GCGCAACTAT CGGGGCAACT ACTGGACAAG 

1101 CAACGTTTCC TACAACCGTT CGACCACCCA AAACCTCGAA AAACGCGCCT 

1151 TCTCCGGCGG CGTCTGGTAT GTGCGCGACC GCGCGGGCAT CGATGCCAGG 

1201 CTGGGGGCGG AATTTCTCGC AGAAGGCCGG AAAAT CCCCG GCTCGGCTGT 

1251 CGATTTGGGC AACAGCCACG CCACGATGCT GACCGCCTCT TGGAAACGCC 

1301 AGCTGCTCAA CAACGTGCTG CATCCCGAAA ACGGCCATTA CCTCGACGGC 

1351 AAAATCGGTA CGACTTTGGG CACATTCCTG TCCTCCACCG CGCTGATCCG 

1401 CACCTCTGCC CGTGCAGGTT ATTTCTTCAC GCCCGAAAAC AAAAAACTCG 

1451 GCACGTTCAT CATACGCGGA CAAGCGGGTT ACACCGTTGC CCGCGACAAT 

1501 GCCGACGTTC CTTCAGGGCT GATGTTCCGC AGCGGCGGCG CGTCTTCCGT 

1551 GCGCGGTTAC GAACTCGACA GCATCGGACT TGCCGGCCCG AACGGATCGG 

1601 TCCTGCCCGA ACGCGCCCTC CTGGTGGGCA GCCTGGAATA CCAACTGCCG 

1651 TTTACGCGCA CCCTTTCCGG CGCGGTGTTC CACGATATGG GCGATGCCGC 

1701 CGCCAATTTC AAACGTATGA AGCTGAAACA CGGTTCGGGA CTGGGCGTGC 

1751 GCTGGTTCAG CCCGCTTGCG CCGTTTTCCT TCGACATCGC CTACGGGCAC 

1801 AGCGATAAGA AAATCCGCTG GCACATCAGC TTGGGAACGC GCTTCTAA 

This corresponds to the amino acid sequence <SEQ ID 1 196; ORF 286>: 

m286.pep 

1 MHDTRTMMIK PTALLLPALF FFPHAYAPA A DLSENKAAGF ALFKNKSPDT 

51 ESVKLKPKFP VLI DTQDSEI KDMVEEHLPL ITQQQEEVLD KEQTGFLAEE 

101 APDNVKTMLR SKGYFSSKVS LTEKDGAYTV HITPGPRTKI ANVGVAILGD 

151 ILSDGNLAEY YRNALENWQQ PVGSDFDQDS WENSKTSVLG AVTRKAYPI.A 

201 KLGNTQAAVN PDTATADLNV WDSGRPIAF GDFEITGTQR YPEQIVSGLA 

251 RFQPGMPYDL DLLLDFQQAL EQNGHYSGAS VQADFDRLQG DRVPVKVSVT 

301 EVKRHKliETG IRLDSEYGLG GKIAYDYYNL FNKGYIGSVV WDMDKYETTL 

351 AAGISQPRNY RGNYWTSNVS YNRSTTQNLE KRAFSGGVWY VRDRAGIDAR 

401 LGAEFLAEGR KIPGSAVDLG NSHATMLTAS WKRQLLNNVL HPENGHYLDG 

451 KIGTTLGTFL SSTALIRTSA RAGYFFTPEN KKLGTFIIRG QAGYTVARDN 

501 ADVPSGLMFR SGGASSVRGY ELDSIGLAGP NGSVLPERAL LVGSLEYQLP 

551 FTRTLSGAVF HDMGDAAANF KRMKLKHGSG LGVRWFSPLA PFSFDIAYGH 

601 SDKKIRWHIS LGTRF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m286/g286 95.9% identity in 293 aa overlap 
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10 20 30 40 50 60 

m286.pep MHDTRTMMIKPTALLLPALFFFPHAYAPAADLSENKPAGFALFKNKSPDTESVKLKPKFP 
1 : : I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I 1 I I : I I I I I I I I I I I I I I I 
g286 MQNTGTMMIKPTALLLPALFFFPHAYAPAADLSENKAAGFALFKSKSPDTESVKLKPKFP 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 8 6 . pep VLI DTQDSEIKDMVEEHLPLITQQQEEVLDKEQTGFLAEEAPDNVKTMLRSKGYFS SKVS 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
g 2 8 6 VRI DTQDSE I XDMVEEHLPL I TQQQEE VLDKEQTGFLAEEAP DNVKTMLRSKG YFS SKV S 

70 80 90 100 110 120 



130 140 150 160 170 180 

m2B 6 . pep LTEKDGAYTVHITPGPRTKIANVGVAILGDILSDGNLAEYYRNALENWQQPVGSDFDQDS 
I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I i I I I I I I I I I II I I I I I I I I I I I I i I I I I 
g286 LTEKDGAYTVHITPGPRTKIANVGVAILGDILSDGNLAEYYRNALEKWQQPVGSDFDQDS 

130 140 150 160 170 180 



190 200 210 220 230 240 

m2 8 6 . pep WENSKTSVLGAVTRKAYPLAKLGNTQAAVNPDTATADLNWDSGRPIAFGDFEITGTQR 
I I I I I I I I I I I I I I I : I I I I I I I I I : I I I I I I I I I I I i I I I I I I I I I I II I I I I M I I I I 
g286 WENSKTSVLGAVTRKGYPLAKLGNTRAAVNPDTATACLNW\^DSGRPIAFGDFEITGTQR 

190 200 210 220 230 240 



250 260 270 280 290 299 

m28 6 . pep YPEQIVSGLARFQPGMPYDLDLLLDFQQALEQNGHYSGASVQADFDRL-QGDRVPVKVSV 

Nil I I II I I I I ! I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I : I ! 
g286 YPEQTVSGLARFQPGTPYDLDLLLDFQQALEQNGHYSGASVQADFDRLPRGPRPRQSQRN 

250 260 270 280 290 300 

300 310 320 330 340 350 359 

m286 . pep TEVKRHKLETGIRLDSEYGLGGKIAYDYYNLFNKGYIGSWWDMDKYETTLAAGISQPRN 



g286 RGQTPQTRNRH PPRFG I RFGRQNRLRLLQPLQQRLYRLGRLGYGQI RNHACRRHQPAAQL 

310 320 330 340 350 360 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 197>: 

a286. seq 

1 ATGCACGACA CCCGTACCAT GATGATTAAA CCGACCGCCC TGCTCCTGCC 

51 GGCTTTATTT TTCTTTCCGC ACGCATACGC GCCTGCCGCC GACCTTTCCG 

101 AAAACAAGGC GGCGGGTTTC GCATTGTTCA AAAACAAAAG CCCCGACACC 

151 GAATCAGTTA AATTAAAACC CAAATTCCCC GTCCGCATCG ACACGCAGGA 

201 TAGTGAAATC AAAGATATGG TCGAAGAACA CCTGCCGCTC ATCACGCAGC 

251 AGCAGGAAGA AGTATTGGAC AAGGAACAGA CGGGCTTCCT CGCCGAAGAA 

301 GCACCGGACA ACGTTAAAAC AATGCTCCGC AGCAAAGGCT ATTTCAGCAG 

351 CAAAGTCAGC CTGACGGAAA AAGACGGAGC TTATACGGTA CACATCACAC 

401 CGGGCCCGCG CACCAAAATC GCCAACGTCG GCGTCGCCAT CCTCGGCGAC 

4 51 ATCCTTTCAG ACGGCAACCT CGCCGAATAC TACCGCAACG CGCTGGAAAA 

501 CTGGCAGCAG CCGGTAGGCA GTGATTTCGA TCAGGACAGT TGGGAAAACA 

551 GCAAAACTTC CGTCCTCGGC GCGGTAACGC GCAAAGCCTA CCCGCTTGCC 

601 AAGCTCGGCA ACACCCGGGC GGCCGTCAAC CCCGATACCG CCACCGCCGA 

651 TTTGAACGTC GTCGTGGACA GCGGCCGCCC CATCGCCTTC GGCGACTTTG 

701 AAATTACCGG CACGCAGCGT TACCCCGAAC AAATCGTCTC CGGCTTGGCG 

751 CGCTTCCAAC CGGGCACGCC CTACGACCTC GACCTGCTGC TCGACTTCCA 

801 ACAGGCGCTC GAACAAAACG GGCATTATTC CGGCGCGTCC GTACAAGCCG 

851 ACTTCGACCG CCTCCAAGGC GACCGCGTCC CCGTCAAAGT CAGCGTAACC 

901 GAGGTCAAAC GCCACAAGCT CGAAACCGGC ATCCGCCTCG ATTCGGAATA 

951 CGGTTTGGGC GGCAAAATCG CCTACGACTA TTACAACCTC TTCAACAAAG 

1031 GCTATATCGG TTCGGTCGTC TGGGATATGG ACAAATACGA AACCACGCTT 

1051 GCCGCCGGCA TCAGCCAGCC GCGCAACTAT CGGGGCAACT ACTGGACAAG 

1131 CAACGTTTCC TACAACCGTT CGACCACCCA AAACCTCGAA AAACGCGCCT 

1151 TCTCCGGCGG CATCTGGTAT GTGCGCGACC GCGCGGGCAT CGATGCCAGG 

1201 CTGGGGGCGG AGTTTCTCGC AGAAGGCCGG AAAATCCCCG GCTCGGATAT 

1251 CGATTTGGGC AACAGCCACG CCACGATGCT GACCGCCTCT TGGAAACGCC 

1301 AGCTGCTCAA CAACGTGCTG CATCCCGAAA ACGGCCATTA CCTCGACGGC 
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1351 
1401 
1451 
1501 
1551 
1603 
1651 
1701 
1751 
1801 



AAAATCGGTA 
CACCTCTGCC 
GCACGTTCAT 
GCCAACGTTC 
GCGCGGTTAC 
TCCTGCCCGA 
TTTACGCGCA 
CGCCAATTTC 
GCTGGTTCAG 
AGCGACAAGA 



CGACTTTGGG 
CGCGCAGGTT 
CATACGCGGA 
CTTCAGGGCT 
GAACTCGACA 
ACGCGCCCTC 
CCCTTTCCGG 
AAACGTATGA 
CCCGCTCGCG 
AAATCCGCTG 



669 

CGCATTCCTG 
ATTTCTTCAC 
CAAGCGGGTT 
GATGTTCCGC 
GCATCGGGCT 
7TGGTGGGCA 
CGCGGTGTTC 
AGCTGAAACA 
CCGTTTTCCT 
GCACATCAGC 



TCCTCCACCG 
GCCCGAAAAC 
ACACCGTTGC 
AGCGGCGGCG 
TGCCGGCCCG 
GCCTGGAATA 
CACGATATGG 
CGGTTCGGGA 
TCGACATCGC 
TTGGGAACGC 



CGCTGATCCG 
AAAAAACTCG 
CCGCGACAAT 
CGTCTTCCGT 
AACGGATCGG 
CCAACTGCCG 
GCGACGCCGC 
CTGGGCGTGC 
CTACGGGCAC 
GCTTCTAA 



This corresponds to the amino acid sequence <SEQ ID 1 198; ORF 286.a>: 

a286.pep 

1 MHDTRTMMIK PTALLLPALF FFPHAYAPA A DLSENKAAGF ALFKNKSPDT 

51 ESVKLKPKFP VRIDTQDSEI KDMVEEHLPL ITQQQEEVLD KEQTGFLAEE 

101 APDNVKTMLR SKGYFSSKVS LTEKDGAYTV HITPGPRTKI ANVGVAILGD 

151 ILSDGNLAEY YRNALENWQQ PVGSDFDQDS WENSKTSVLG AVTRKAYPLA 

201 KLGNTRAAVN PDTATADLNV WDSGRPIAF GDFEITGTQR YPEQIVSGLA 

251 KFQPGTPYDL DLLLDFQQAL EQNGHYSGAS VQADFDRLQG DRVPVKVSVT 

301 EVKRHKLETG IRLDSEYGLG GKIAYDYYNL FNKGYIGSVV WDMDKYETTL 

351 AAGISQPRNY RGNYWTSNVS YNRSTTQNLE KRAFSGGIWY VRDRAGIDAR 

4 01 LGAEFLAEGR KIPGSDIDLG NSHATMLTAS WKRQLLNNVL HPENGHYLDG 

451 KIGTTLGAFL SSTALIRTSA RAGYFFTPEN KKLGTFI IRG QAGYTVARDN 

501 ANVPSGLMFR SGGASSVRGY ELDSIGLAGP NGSVLPERAL LVGSLEYQLP 

551 FTRTLSGAVF HDMGDAAANF KRMKLKHGSG LGVRWFSPLA PFS FDIAYGH 

601 SDKKIRWHIS LGTRF* 

m286/a286 



98.7% identity in 615 aa overlap 



50 



60 



m286.pep 
a286 

m286.pep 
a286 

ra286.pep 
a286 

m28 6.pep 
a286 



10 20 30 40 

MHDTRTMMIKPTALLLPALFFFPHAYAPAADLSENKAAGFALFKNKSPDTESVKLKPKFP 

I 1 1 I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I II I 1 I I I I I I I i I I I I I I I I I 1 1 I I I ! I 

MHDTRTMMIKPTALLLPALFFFPHAYAPAADLSENKAAGFALFKNKSPDTESVKLKPKFP 
10 20 30 40 50 60 

70 80 90 100 110 120 

VLIDTQDSEIKDMVEEHLPLITQQQEEVLDKEQTGFLAEEAPDNVKTMLRSKGYFSSKVS 
I I I I I I I I I II I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 1 I 
VRIDTQDSEIKDMVEEHLPLITQQQEEVLDKEQTGFLAEEAPDNVKTMLRSKGYFSSKVS 

70 80 90 100 110 120 

130 140 150 160 170 180 

LTEKDGAYTVHITPGPRTKIANVGVAILGDILSDGNLAEYYRNALENWQQ PVGSDFDQDS 
I I I I II I I I I I II I I I II I I I I I I 1 I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I 
LTEKDGAYTVHITPGPRTKIANVGVAILGDILSDGNLAEYYRNALENWQQPVGSDFDQDS 

130 140 150 160 170 180 

190 200 210 220 230 240 

WENSKTSVLGAVTRKAYPIiAKLGNTQAAVNPDTATADLNW/DSGRPIAFGDFEITGTQR 
I I I I II I I I I I I I I I I I I I ! I I I I I : I I I I I I I I II I I 1 I I I I I II I I I I I I I I I I I I I I 
WENSKTSVLGAVTRKAYPLAKLGNTRAAVNPDTATADLNW^/DSGRPIAFGDFEITGTQR 

190 200 210 220 230 240 



250 260 270 280 290 300 

ra286.pep YPEQIVSGLARFQPGMPYDLDLLLDFQQALEQNGHYSGASVQADFDRLQGDRVPVKVSVT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a286 YPEQIVSGLARFQPGTPYDLDLLLDFQQALEQNGHYSGASVQADFDRLQGDRVPVKVS^/T 

250 260 270 280 290 300 

310 320 330 340 350 360 

m286.pep EVKRHKLETGIRLDSEYGLGGKIAYDYYNLFNKGYIGSWWDMDKYETTLAAGISQPRNY 
I 1 1 I I II I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
a286 EVKRHKLETGIRLDSEYGLGGKIAYDYYNLFNKGYIGSWWDMDKYETTLAAG1SQPRNY 
310 320 330 340 350 360 



370 



380 



390 



400 



410 



420 
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m286.pep RGNYWTSNVSYNRSTTQNLEKRAFSGGVWYVRDRAGIDARLGAEFIAEGRKIPGSAVDLG 
I I I I I I I I I 1 I I I I I I I I I I I I I I i I I : I I I I I I I I I I I I I I I I I I I I I I II II I 'III 
a 286 RGNYWTSNVSYNRSTTQNLEKRAFSGGIWYVRDRAGIDARLGAEFLAEGRKIPGSDIDLG 
370 380 390 400 410 420 

430 440 450 460 470 480 

m286.pep NSHATMLTASWKRQLLNNVLHPENGHYLDGKIGTTLGTFLSSTALIRTSARAGYFFTPEN 
I | | I | I I I I I I I I I I I I I I II I I I II I I I I I I I I I II : I I I I ! I I I I I I I I I I I I I I I I I 
a286 NSflATMLTASWKRQLLNNVLHPENGHYLDGKIGTTLGAFLSSTALIRT SARAGYFFTPEN 

430 440 450 460 470 480 

490 500 510 520 530 540 

m286.pep KKLGTFIIRGQAGYTVARDNADVPSGLMFRSGGASSVRGYE1DSIGLAGPNGSVLPERAL 
I | | I I | I I ( 1 I I I I I I I I I I I: I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I 
a286 KKLGTFIIRGQAGYTVARDNANVPSGLMFRSGGASSYRGYELDSIGLAGPNGSVLPERAL 

490 500 510 520 530 540 

550 560 570 580 590 600 

m286 . pep LVGSLEYQLPFTRTLSGAVFHDMGDAAANFKRMKLKHGSGLGVRWFSPLAPFSFDIAYGH 

I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

a 286 LVGSLEYQLPFTRTLSGAVFHDMGDAAANFKRMKLKHGSGLGVRWFSPLAPFSFDIAYGH 

550 560 570 580 590 600 

610 

m28 6 . pep S DKKIRWH I SLGTRFX 

I I I I I I I I I I I I I I I I 
a286 S DKKI RWH I SLGTRFX 

610 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 99>: 

g287.seq 

1 atgtttaaac gcagtgtgat tgcaatggct tgtatttttc ccctttcagc 

51 ctgtgggggc ggcggtcgcg gatcgcccga tgtcaagtcg gcggacacgc 

101 cgtcaaaacc ggccgccccc gttgttgctg aaaatgccgg ggaaggggty 

151 ctgccgaaag aaaagaaaga tgaggaggca gcgggcggtg cgccgcaagc 

201 cgatacgcag gacgcaaccg ccggagaagg cagccaagat atggcggcag 

25.1 tt.tcggcaga aaatacaggc aatggcggtg cggcaacaac ggacaacccc 

301 aaaaatgaag acgcgggggc gcaaaatgat atgccgcaaa atgccgccga 

351 atccgcaaat caaacaggga acaaccaacc cgccggttct tcagattccg 

401 cccccgcgtc aaaccctgcc cctgcgaatg gcggtagcga ttttggaagg 

451 acgaacgtgg gcaattctgt tgtgattgac ggaccgtcgc aaaatataac 

501 gttgacccac tgtaaaggcg attcttgtaa tggtgataat ttattggatg 

551 aagaagcacc gtcaaaatca gaatttgaaa aattaagtga tgaagaaaaa 

601 attaagcgat ataaaaaaga cgagcaacgg gagaattttg tcggtttggt 

651 tgctgacagg gtaaaaaagg atggaactaa caaatatatc atcttctata 

701 cggacaaacc acctactcgt tctgcacggt cgaggaggtc gcttccggcc 

751 gagattccgc tgattcccgt caatcaggcc gatacgctga ttgtggatgg 

801 ggaagcggtc agcctgacgg ggcattccgg caatatcttc gcgcccgaag 

851 ggaattaccg gtatctgact tacggggcgg aaaaattgcc cggcggatcg 

901 tatgccctcc gtgtgcaagg cgaaccggca aaaggcgaaa -gcttgttgg 

951 cacggccgtg tacaacggcg aagtgctgca tttccatatg gaaaacggcc 

1001 gtccgtaccc gtccggaggc aggtttgccg caaaagtcga -ttcggcagc 

1051 aaatctgtgg acggcattat cgacagcggc gatgatttgc atatgggtac 

1101 gcaaaaattc aaagccgcca -cgatggaaa cggctttaag gggacttgga 

1151 cggaaaatgg cggcggggat gtttccggaa ggttttacgg cccggccggc 

1201 gaggaagtgg cgggaaaata cagctatcgc ccgacagatg ctgaaaaggg 

1251 cggattcggc gtgtttgccg gcaaaaaaga tcgggattga 

This corresponds to the amino acid sequence <SEQ ID 1200; ORF 287.ng>: 

g287 .pep 

1 MFKRSVIAMA CIFPLSA CGG GGGGSPDVKS ADTPSKPAAP VVAENAGEGV 

51 LPKEKKDEEA AGGAPQADTQ DATAGEGSQD MAAVSAENTG NGGAATTDNP 

101 KNEDAGAQND MPQNAAESAN QTGNNQPAGS SDSAPASNPA PANGGSDFGR 
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151 TNVGNSWID GPSQNITLTH CKGDSCNGDN LLDEEAPSKS EFEKLS DEEK 

201 IKRYKKDEQR ENFVGLVADR VKKDGTNKYI IFYTDKPPTR SARSRRSLPA 

251 EIPLIPVNQA DTLIVDGEAV SLTGHSGNIF APEGNYRYLT YGAEKLPGGS 

301 YALRVQGEPA KGEMLVGTAV YNGEVLHFHM ENGRPYPSGG RFAAKVDFGS 

351 KSVDGIIDSG DDLHMGTQKF KAAIDGNGFK GTWTENGGGD VSGRFYGPAG 

4 01 EEVAGKYSYR PTDAEKGGFG VFAGKKDRD* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1201>: 

m287 . seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAGC 

51 CTGCGGGGGC GGCGGTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AGAGGCAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 

201 AGGCAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC CGAATCACAC 

351 CCCGGATCCG AATATGCTTG CCGGAAATAT GGAAAATCAA GCAACGGATG 

4 01 CCGGGGAATC GTCTCAGCCG GCAAACCAAC CGGATATGGC AAATGCGGCG 

451 GACGGAATGC AGGGGGACGA T^CGTCGGCA GGCGGGCAAA ATGCCGGCAA 

501 TACGGCTGCC CAAGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 

551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGCGAA TGGCGGTAGC 

601 AATTTTGGAA GGGTTGATTT GGCTAATGGC GTTTTGATTG ACGGGCCGTC 

651 G CAAAAT AT A ACGTTGACCC ACTGTAAAGG CGATTCTTGT AGTGGCAATA 

7 01 ATTTCTTGGA TGAAGAAGTA CAGCTAAAAT CAGAATTTGA AAAATTAAGT 

751 GATGCAGACA AAATAAGTAA TTACAAGAAA GATGGGAAGA ATGATAAATT 

801 TGTCGGTTTG GTTGCCGATA GTGTGCAGAT GAAGGGAATC AATCAATATA 

851 TTATCTTTTA TAAACCTAAA CCCACTTCAT TTGCGCGATT TAGGCGTTCT 

901 GCACGGTCGA GGCGGTCGCT TCCGGCCGAG ATGCCGCTGA TTCCCGTCAA - 

951 TCAGGCGGAT ACGCTGATTG TCGATGGGGA AGCGGTCAGC CTGACGGGGC 

1001 ATTCCGGCAA TATCTTCGCG CCCGAAGGGA ATTACCGGTA TCTGACTTAC 

1051 GGGGCGGAAA AATTGCCCGG CGGATCGTAT GCCCTTCGTG TTCAAGGCGA 

1101 ACCGGCAAAA GGCGAAA7GC TTGCGGGCGC GGCCGTGTAC AACGGCGAAG 

1151 TACTGCATTT CCATACGGAA AACGGCCGTC CGTACCCGAC CAGGGGCAGG 

1201 TTTGCCGCAA AAGTCGA7TT CGGCAGCAAA TCTGTGGACG GCATTATCGA 

1251 CAGCGGCGAT GATTTGCATA TGGGTACGCA AAAATTCAAA GCCGCCATCG 

1301 ATGGAAACGG CTTTAAGGGG ACTTGGACGG AAAATGGCAG CGGGGATGTT 

1351 TCCGGAAAGT TTTACGGCCC GGCCGGCGAG GAAGTGGCGG GAAAATACAG 

1401 CTATCGCCCG ACAGATGCGG AAAAGGGCGG ATTCGGCGTG TTTGCCGGCA 

1451 AAAAAGAGCA GGATTGA 

This corresponds to the amino acid sequence <SEQ ID 1202; ORF 287>: 

m287.pep 

1 MFKRSVIAMA CI FALSA CGG GGGGSPDVKS ADTLSKPAAP WS EKE TEAK 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGN7AA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

351 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLH FHTE NGRPYPTRGR 

4 01 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV 

451 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m287/g287 70.1% identity in 499 aa overlap 

10 20 30 40 49 

m287.pep M FKR S V I AMAC I FALS ACGGG GGG S P D VKS ADT L S K PAAP VVS E KETEA 

i I i 1 I I I I I I I I I I I I I J I I I I I I I I I ! I I I I II I I I I I I : I I : II 

g287 MFKRSVIAMACIFPLSACGGGGCGSPDVKS/IDTPSKPAAPWAENAGEGVLPKEKKDEEA 

10 20 30 40 50 60 
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50 60 70 80 90 100 109 

m287 . pep KEDAPQAGSC'GQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 

I I I I : I I : : : I I I I i ! I I I I I I I I I I I : I : I I I I I 1 I I I I I I I I I I I 
g 287 AGGAPQADTQD — ATAGEGSQDMAAV S AENTGNGGAATT DN PKNEDAGAQN DMPQNAA — 

70 80 90 100 110 

110 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 

g287 



170 180 190 200 210 220 229 

m287 .pep AQGANQAGNNQAAGS S DPI PASNPAPANGGSNFGRVDLANGVLI DGPSQNITLTHCKGDS 

:: I I I : I I I I I I I I I I I I 1 I I I I I I I I : I I I : : : : I : I : I I I I I I I I I 11 I I I I I I 
g 287 -ESANQTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWI DGPSQNITLTHCKGDS 

120 130 140 150 160 170 



230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYI IFYKP 

1:1:1:1111: I I II I I I ! I : I I : I I I I : : : I I I I I M I : I I : I I I I I 
g287 CNGDNLLDEEAPSKSEFEKliSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTD 
180 190 200 210 220 230 



290 300 310 320 330 340 349 

m287 .pep KPTSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 
II : I I I I I M I I I I I : 1 I I I II I I f I I ! 1 1 i I I I I I I I I I I I I I II I I I I I I I I 

g287 KPPT RSARSRRSLPAEI PLI PVNQADTLI VDGEAVSLTGHSGN IFAPEGN YRYLT 

240 250 260 270 280 290 



350 360 370 380 390 400 409 

m287 . pep YGAEKLPGG SYALRVQGE PAKGEMLAGAAVYNGEVLHFHTENGRP YPTRGRFAAKVDFGS 

I I I I I I I I I I I I I I I I I I I I I I I II : I : I I I I I I II I II I I I I I I I : I I I I I I I I I I I 
g287 YGAEKLPGGSYALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGS 
300 310 320 330 340 350 



410 420 430 440 450 460 469 

m287.pep KSVDGIIDSGDDLHMGTQKFKT^AIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYR 
I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I ! I I I I : I I I I I : I I I I I I I I I I I I I I I I 
g287 KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYR 
360 370 380 390 400 410 



470 480 489 

m2 8 7 . pep PTDAEKGGFGVFAGKKEQDX 
I li III MM I HI li::M 
g287 PTDAEKGGFGVFAGKKDRDX 
420 430 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1203>: 
a287.seq 

1 ATGTTT7AAAC GCAGTGTGAT TGCAATGGCT TGTATTGTTG CCCTTTCAGC 

51 CTGTGGGGGC GGCGGTGGCG GATCGCCCGA TGTTAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTACTG AAGATGTCGG GGAAGAGGTG 

151 CTGCCGAAAG AAAAGAAAGA TGAGGAGGCG GTGAGTGGTG CGCCGCAAGC 

201 CGATACGCAG GACGCAACCG CCGGAAAAGG CGGTCAAGAT ATGGCGGCAG 

251 TTTCGGCAGA AAATACAGGC AATGGCGGTG CGGCAACAAC GGATAATCCC 

301 GAAAATAAAG ACGAGGGACC GC7AAAATGAT ATGCCGCAAA ATGCCGCCGA 

351 TACAGATAGT TCGACACCGA ATCACACCCC TGCACCGAAT ATGCCAACCA 

401 GAGATATGGG AAACCAAGCA CCGGATGCCG GGGAATCGGC ACAACCGGCA 

451 AACCAACCGG ATATGGCAAA TGCGGCGGAC GGAATGCAGG GGGACGATCC 

501 GTCGGCAGGG GAAAATGCCG GCAATACGGC AGATCAAGCT GCAAATCAAG 

551 CTGAAAACAA TCAAGTCGGC GGCTCTCAAA ATCCTGCCTC TTCAACCAAT 

601 CCTAACGCCA CGAATGGCGG CAGCGATTTT GGAAGGATAA ATGTAGCTAA 

651 TGGCATCAAG CTTGACAGCG GTTCGGAAAA TGTAACGTTG ACACATTGTA 

701 AAGACAAAGT ATGCGATAGA GATTTCTTAG ATGAAGAAGC ACCACCAAAA 

751 TCAGAATTTG AAAAATTAAG TGATGAAGAA AAAATTAATA AATATAAAAA 



WO 99/57280 PCT/US99/09346 



673 



801 AGACGAGCAA CGAGAGAATT TTGTCGGTTT GGTTGCTGAC AG GG TAG AAA 

851 AGAATGGAAC TAACAAATAT GTCATCATTT ATAAAGACAA GTCCGCTTCA 

901 TCTTCATCTG CGCGATTCAG GCGTTCTGCA CGGTCGAGGC GGTCGCTTCC 

951 GGCCGAGATG CCGCTGATTC CCGTCAATCA GGCGGATACG CTGATTGTCG 

1001 ATGGGGAAGC GGTCAGCCTG ACGGGGCATT CCGGCAATAT CTTCGCGCCC 

1051 GAAGGGAATT ACCGGTATCT GACTTACGGG GCGGAAAAAT TGTCCGGCGG 

1101 ATCGTATGCC CTCAGTGTGC AAGGCGAACC GGCAAAAGGC GAAATGCTTG 

1151 CGGGCACGGC CGTGTACAAC GGCGAAGTGC TGCATTTCCA TATGGAAAAC 

1201 GGCCGTCCGT CCCCGTCCGG AGGCAGGTTT GCUGCAAAAG TCGATTTCGG 

1251 CAGCAAATCT GTGGACGGCA TTATCGACAG CGGCGATGAT TTGCATATGG 

1301 GTACGCAAAA ATTCAAAGCC GTTATCGATG GAAACGGCTT TAAGGGGACT 

1351 TGGACGGAAA ATGGCGGCGG GGATGTTTCC GGAAGGTTTT ACGGCCCGGC 

14 01 CGGCGAAGAA GTGGCGGGAA AATACAGCTA TCGCCCGACA GATGCGGAAA 

14 51 AGGGCGGATT CGGCGTGTTT GCCGGCAAAA AAGAGCAGGA TTGA 



This corresponds to the amino acid sequence <SEQ ID 1204; ORF 287.a>: 

a287 .pep 

1 MFKRSVIAMA CIVALSAC GG GGGGSPDVKS ADTLSKPAAP WTEDVGEEV 

51 LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 

101 ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 

151 NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQAENNQVG GSQNPASSTN 

201 PNATNGGSDF GRINVANGIK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 

251 SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKDKSAS 

301 SSSARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 

351 EGNYRYLTYG AEKLSGGSYA LSVQGEPAKG EMLAGTAVYN GEVLHFHMEN 

401 GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VIDGNGFKGT 

4 51 WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 



m287/a287 77.2% identity in 501 aa overlap 

10 20 30 40 49 

m287.pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

IMIEIIiilll M I I I I II I i M I I I I I M I I I ! I I i I I I : I |: || 

a287 MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPVATEDVGEEVLPKEKKDEEA 

10 20 30 40 50 60 



50 60 70 80 90 100 109 

m2 8 7 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 
1111:1 I : : : I : I I I I I II I I I I I I II : I : I I I : I : 1 1 I I I I I I I I I I 
a287 VSGAPQADTQ — DATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADT 

70 80 90 100 110 



110 120 130 140 150 160 169 

m2 87 . pep DSSTPNHT PD PNMLAGNMENQAT DAGE S SQ PAN Q PDMAN AADGMQG D DP SAGGQN AGNTA 

I I I I I I I I I ill : : I III I I I I I : I 1 I I I I II I I II I I I I I II I I I I : I I II I I 
a287 DSSTPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAG-F.NAGNTA 
120 130 140 150 160 170 



170 180 190 200 210 220 229 

m287 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANG\'LIDGPSQNITLTHCKGDS 

1:1111 I I I : : I I : : I : : I I : I I I I : I I I : : : I ! I : : I : I : I : I I I I I I 
a287 DQ AANQ AE NNQVGG SQN P AS S TN PN AT NGG S DFG R I NV ANG I KL D S G S EN VT LTHCK DKV 

180 190 200 210 220 230 



230 240 250 260 270 280 289 

m287 .pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLYADSVQMKGINQYIIFYKP 

I : : I I II I : I I I I I I I I I : I I : : I I I I : : : I I I I I I f I : : I I : I : I : I I 
a287 CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEQREKFVGLVADRVEKNGTNKYVIIYKD 
240 250 260 270 280 290 



290 300 310 320 330 340 

m287.pep KP-- TSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a287 KSASSSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 
300 310 320 330 340 350 



WO 99/57280 



PCT/US99/09346 



674 



350 360 370 380 390 400 

m287 . pep LTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDF 

I I ! I I I I I I II I I i I I I I I ! I I I ! I > I : I i I I I I i I I I I I I I I I I : MINIMI 
a 287 LTYGAEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDF 
360 370 380 390 400 410 

410 420 430 440 450 460 

m287 . pep GSKS VDG 1 1 DSGDDLHMGTQK FKAAI DGNG FKGTWTENG SGDVSGKFYGPAGEEVAGKYS 

I I ! I I I I I I I I I I I I II II II I I I : M I I I I I I I M I I I : M I I I : I I II I M I I I I I M 
a 287 GSKSVDGI I DSGDDLHMGTQK FKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYS 

420 430 440 450 460 470 

470 480 489 

m287 . pep YRPTDAEKGGFGVFAGKKEQDX 

I I I I 11 I I I I I I I I I I I II I I i 
a287 YRPTDAEKGGFGVFAGKKEQDX 
480 490 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 1205>: 

g288 .saq 

1 atgcacaccg gacaggcggt aagccgggtt ctgtctcgga cagtcattcc 

51 tctaggcata ccgttgccgg tatgctcaag caacctaccc gaacgctcgg 

101 cgggcagcgt cattgcgttc tgtttggtct tgctccgaat ggggtttggc 

151 ctgccgcata ttgttaccaa atgcgcggtg cgcccttacc gcaccttttc 

201 acccttgcct gtgctgccaa agcagccatc ggcggttttg ctttctgttc 

251 cactttccgt cgcgttaccg cgcccggccg ttaaccggca ttctaccctg 

301 cggagcccgg actttcctcc ccgtatgcct tacgcgatac gcggcgactg 

351 tctgcccgtc ccgtgtgcgg cgcggattat aacacgaaac gcaaaaatgc 

401 cgtctgaaac ggtacaggtt tcagacggca tacagcctaa actacacacc 

•451 ctgtttcagg ctggcttcga tgaagccgtc caagtcgccg tccaatacgg 

501 ctttgtggtt gccgacttcg tagcctgtac gcaagtcttt gatgcgtga 

This corresponds to the amino acid sequence <SEQ ID 1206; ORF 288.ng>: 

g288.pep 

1 MHTGQAVSRV LSRTVIPLGI PLPVCSSNLP ERS AGSVIAF CLVLLRMGFG 

51 LPHIVTKCAV RPYRTFSPLP VLPKQPSAVL LSVPLSVALP RPAVNRHSTL 

101 RSPDFPPRMP YAIRGDCLPV PCAARIITRN AKMPSETVQV SDGIQPKLHT 

151 LFQAGFDEAV QVAVQYGFVV ADFVACTQVF DA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1207>: 

GACAGGCGGT AAGCCGGGTT CTGTCTCGGA CAGTCATTCC 
CCGTTACCGG TATGCTCAAG CAACCTACCC GAACGCTCGG 
CATTGCGTTC TGTTTGGTCT TGCTCCGAAT GGGGTTTGGC 
TTGTTACCAA ATGCGCGGTG CGCCCTTACC GCACCTTTTC 
GTGCTGCCAA AGCAGCCATC GGCGGTTTTG CTTTCTGTTC 
CGCGTTACCG CGCCCGGCCG TTAACCGGCA TTCTACCCTG 
ACTTTCCTCC CCGTATGCCT TACGCGATAC GCGGCGACTG 
CCGTGTGCGG CGCGGATTAT AACACGAAAC ACAAAAATGC 
GGTACAGGTT TCAGACGGCA TACAGCCTAA ACTACACGCC 



This corresponds to the amino acid sequence <SEQ ID 1208; ORF 288>: 

m288 .pep 

1 MHTGQAVSRV LSRTVIPLGI PLPVCSSNLP ERS AGSVIAF CLVLLRMGFG 

51 LPHIVTKCAV RPYRTFSPLP VLPKQPSAVL LSVPLSVALP RPAVNRHSTL 

101 RSPDFPPRMP YAIRGDCLPV PCAARIITRN TKMPSETVQV SDGIQPKLHA 

151 LFQAGFDEAV QVAIQYGFGV ADFVACTQVF DT* 



m288.seq 




1 


ATGCACACCG 


51 


TCTAGGCATA 


101 


CGGGCAGCGT 


151 


CTGCCGCATA 


201 


ACCCTTACCT 


251 


CACTTTCCGT 


301 


CGGAGCCCGG 


351 


TCTGCCCGTC 


401 


CGTCTGAAAC 


451 


CTGTTTCAGG 


501 


CTTTGGTGTT 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 



PO7US99/09346 



m288/g288 97.8% identity in 181 aa overlap 



10 20 30 40 50 60 

m288 . pep MHTGQAVSRVLSRTVIPLGIPLPVCS SNLPERSAGSVIAFCLVLLRMGFGLPHIVTKCAV 

I I It I II I M I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I 
g288 MHTGQAVSRVLSRTVIPLGIPLPVCSSNLPERSAGSVIAFCLVLLRMGFGLPHIVTKCAV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m288 . pep RPYRTFSPLPVLPKQPSAVLLSVPLSVALPRPAVNRHSTLRSPDFPPRMPYAIRGDCLPV 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II II I I I I I I I II I I I I I 
g288 RPYRTFSPLPVLPKQPSAVLLSVPLSVALPRPAVNRHSTLRSPDFPPRMPYAIRGDCLPV 

70 80 90 100 110 120 

130 140 150 160 170 180 

m288 . pep PCAARI I TRN T KM P S ET VQ V S DG I QPKLH AL FQ AG F DEAVQV A I Q Y G FG V AD FVACT QV F 

I I I I I I I I I I : I I I ! I ! II I I I I I I I I I I : I I I I I I I I I I II I : I I I I I II I I I I I I I I 
g288 PCAARI ITRNAKMPSETVQVSDGIQPKLHTLFQAGFDEAVQVAVQYGFVVADFVACTQVF 

13C 140 150 160 170 180 



m288.pep DTX 
I : I 

g288 DAX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1209>: 

a288. seq 

1 ATGCACACCG GACAGGCGGT AAGCCGGGTT CTGTCTCGGA "AGTCATTCC 

51 TCTAGGCATA CCGTTGCCGG TATGCTCAAG CAACCTACCC GAACGCTCGG 

101 CGGGCAGCGT CATTGCGTTC TGTTTGGTCT TGCTCCGAAT GGGGTTTGGC 

151 CTGCCGCATA TTGTTACCAA ATGCGCGGTG CGCCCTTACC GCACCTTTTC 

201 ACCCTTGCCT GTGCTGCCAA AGCAGCCATC GGCGGTTTTG CTTTCTGTTC 

251 CACTTTCCGT CGCGTTACCG CGCCCGGCCG TTAACCGGCA TTCTACCCTG 

201 CGGAGCCCGG ACTTTCCTCC CCGTATGCCT TACGCGATAC GCGGCGACTG 

351 TCTGCCCGTC CCGTGTGCGG CGCGGATTAT AACACGAAAC GCAAAAATGC 

401 CGTCTGAAAC GGTACAGGTT TCAGACGGCA TACAGCCTAA ACTACACGCC 

451 CTGTTTCAGG CTGGCTTCGA TAAAGCCGTC CAAGTCGCCG TCCAATACGG 

501 CTTTGGTGTT GCCGACTTCG TAGCCTGTGC GCAAGTCTTT AATGCGTGA 

This corresponds to the amino acid sequence <SEQ ID 1210; ORF 288. a>: 
a288.pep 

1 MHTGQAVSRV LSRTVIPLGI PLPVCSSNLP ERSA GSVIAF CLVLLRMGFG 

51 LPHIVTKCAV RPYRTFSPLP VLPKOPSAVL LSVPLSVALP RPAVNRHSTL 

101 RSPDFPPRMP YAIRGDCLPV PCAARI I TRN AKMPSETVQV SDGIQPKLHA 

151 LFQAGFDKAV QVAVQYGFGV ADFVACAQVF NA* 



n\288/a288 97.2% identity in 181 aa overlap 

1C 20 30 40 50 60 

m288.pep MHTGQAVSRVLSRTVIPLGIPLPVCSSNLPERSAGSVIAFCLVLLRMGFGLPHIVTKCAV 
I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I t I I I I 1 I : I li II I I I I 
a288 MHTGQAVSRVLSRTVIPLGIPLPVCSSNLPERSAGSVIAFCLVLLRMGFGLPHIVTKCAV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m2 88 . pep RPYRTFSPLPVLPKQPSAVLLSVPLSVALPRPAVNRHSTLRSPDFFPRMPYAIRGDCLPV 
I I I I M I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I || 
a288 RPYRTFSPLPVLPKQPSAVLLSVPLSVALPRPAVNRHSTLRSPDFPPRMPYAIRGDCLPV 

70 80 90 100 110 120 



m288.pep 



13G 140 150 160 170 180 

PCAARIITRNTKMPSETVQVSDGIQPKLHALFQAGFDEAVQVAIQYGFGVADFVACTQVF 
I I I I I I I I I I : I I I ; I I I I I I I I I I I i I I I I I I I t I I : I I I I I : I I I I I I : I I I I I : I I I 



WO 99/57280 



PCT/US99/09346 



676 



a288 PCAARIITRNAKMPSETVQVSDGIQPKLHALFQAGFDKAVQVAVQYGFGVADFVACAQVF 
130 140 150 160 170 100 



m288.pep DTX 
a288 NAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 121 1>: 

g290.seq 

1 atggcaaaaa tgatgaaatg ggcggctgtt gcggcggtcg 

51 ggtttggggc ggatggtctt atctgaagcc cgaaccgcag 

101 ttacggaagc ggtcaggcgc ggcgatatca gccggacggt 

151 ggcgagattt cgccgtccaa cctggtatcg gtcggcgcgc 

201 gcagattaaa aagctttatg tcaaactcgg gcaacaggtc 

251 atttgattgc ggaaatcaat tcgaccacgc agaccaacac 

301 gaaaaatcca aattggaaac gtatcaggcg aagctggtgt 

351 tgcattgggc agcgcggaaa aaaaatataa gcgtcaggcg 

4 01 aggatgatgc gacctctaaa gaagatttgg aaagcgcgca 

451 gccgccgcca aagccaatgt: tgccgagttg aaggctttaa 

501 caaaatttcc atcaataccg ccgagtcgga tttgggctac 

551 ccgcgacgat ggacggcacg gtggtggcga ttcccgtgga 

601 actgtgaacg cggcgcagtc tacgccgacg attgtccaat 

651 ggatatgatg ttgaacaaaa tgcagattgc cgagggcgat 

701 tgaaggcggg gcaggatatt tcgtttacga ttttgtccga 

751 ccgattaagg cgaagctcga cagcgtcgac cccgggctga 

801 gtcgggcggc tacaacagca gtacggatac ggcttccaat 

851 attatgcccg ttcgtttgtg ccgaatccgg acggcaaact 

901 atgacgacgc agaatacggt tgaaatcgac ggtgtgaaaa 

951 tattccgtcg ctgaccgtga aaaatcgcgg cggcaaggcg 

1001 tgttgggtgc ggacggcaag gcagtggaac gcgaaatccg 

1051 aaagacagta tgaataccga agtgaaaagc gggttgaaag 

1101 agtggtcatc tccgaaataa ccgccgccga gcagcaggaa 

1151 gcgccctagg cggcccgccg cgccgataa 

This corresponds to the amino acid sequence <SEQ ID 1212; ORF 290.ng>: 

g2 90.pep 

1 MAKMMKWAAV AAVAAA AVWG GWSYLKPEPQ AAYITEAVRR GDISRTVSAT 

51 GEISPSNLVS VGAQASGQIK KLYVKLGQQV KKGDLIAEIN STTQTNTIDM 

101 EKSKLETYQA KLVSAQIALG SAEKKYKRQA ALWKDDATSK EDLESAQDAL 

151 AAAKANVAEL KALIRQSKIS INTAESDLGY TRITATMDGT WAIPVEEGQ 

201 TVNAAQSTPT IVQLANLDMM LNKMQIAEGD ITKVKAGQDI SFTILSEPDT 

251 PIKAKLDSVD PGLTTMSSGG YNSSTDTASN AVYYYARSFV PNPDGKLATG 

301 MTTQNTVE3D GVKNVLLIPS ITVKNRGGKA FVRVLGADGK AVEREIRTGM 

351 KDSMNTEVKS GLKEGDKWI SEITAAEQQE SGERALGGPP RR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 121 3>: 

m290.seq (partial) 

1 . . GTATCGGTCG GCGCGCAGGC ATCGGGGCAG ATTAAGATAC TTTATGTCAA 

51 ACTCGGGCAA CAGGTTAAAA AGGGCGATTT GATTGCGGAA ATCAATTCGA 

101 CCTCGCAGAC CAATACGCTC AATACGGAAA AATCCAAGTT GGAAACGTAT 

151 CAGGCGAAGC TGGTGTCGGC ACAGATTGCA TTGGGCAGCG CGGAGAAGAA 

201 ATATAAGCGT CAGGCGGCGT TATGGAAGGA AAACGCGACT TCCAAAGAGG 

251 ATTTGGAAAG CGCGCAGGAT GCGTTTGCCG CCGCCAAAGC CAATGTTGCC 

301 GAGCTGAAGG CTTTAATCAG ACAGAGCAAA ATTTCCATCA ATACCGCCGA 

351 GTCGGAATTG GGCTACACGC GCATTACCGC AACGATGGAC GGCACGGTGG 

4 01 TGGCGATTCT CGTGGAAGAG GGGCAGACTG TGAACGCGGC GCAGTCTACG 

4 51 CCGACGATTG TCCAATTGGC GAATCTGGAT ATGATGTTGA ACAAAATGCA 

501 GATTGCCGAG GGCGATATTA CCAAGGTGAA GGCGGGGCAG GATATTTCGT 

551 TTACGATTTT GTCCGAACCG GATACGCCGA TTAAGGCGAA GCTCGACAGC 

601 GTCGACCCCG GGCTGACCAC GATGTCGTCG GGCGGTTACA ACAGCAGTAC 

651 GGATACGGCT TCCAATGCGG TCTACTATTA TGCCC3TTCG TTTGTGCCGA 



cggcggcagc 
gctqcttata 
ttccgcgacg 
aggcttcggg 
aaaaagggcg 
gatcgatatg 
ccgcacagat 
gcgttgtgga 
ggatgcgett 
tcagacagag 
acgcgcatta 
agaggggcag 
tggcgaatct 
attaccaagg 
accggatacg 
ccacgatgtc 
ccggtctatt 
cgccacgggg 
atgtgttgct 
ttcgtacgcg 
gaccggtatg 
agggggacaa 
agcggcgaac 
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701 
751 
801 
851 
901 
951 
1001 



ATCCGGACGG 
ATCGACGGCG 
TCGCGGCGGC 
CGGAACGCGA 
AAAAGCGGGT 
CGCCGAGCAA 
GATAA 



CAAACTCGCC 
TGAAAAATGT 
AAGGCGTTTG 
AATCCGGACC 
TGAAAGAGGG 
CAGGAAAGCG 



677 



ACGGGGATGA 
GCTGATTATT 
TGCGCGTGTT 
GGTATGAGAG 
GGACAAAGTG 
GCGAACGCGC 



CGACGCAGAA 
CCGTCGCTGA 
GGGTGCGGAC 
ACAGTATGAA 
GTCATCTCCG 
CCTAGGCGGC 



TACGGTTGAA 
CCGTGAAAAA 
GGCAAGGCGG 
TACCGAAGTA 
AAATAACCGC 
CCGCCGCGCC 



This corresponds to the amino acid sequence <SEQ ID 1214; ORP 290>: 

m290.pep (partial) 

1 . . VSVGAQASGQ IKILYVKLGQ QVKKGDLIAE INSTSQTNTL NTEKSKLETY 

51 QAKLVSAQIA LGSAEKKYKR QAALWKENAT SKEDLESAQD AFAAAKANVA 

101 ELKALIRQSK ISINTAESEL GYTRITATMD GTVVAILVEE GQTVNAAQST 

151 PTIVQLANLD MMLNKMQIAE GDITKVKAGQ DISFTILSEP DTPIKAKLDS 

201 VDPGLTTMSS GGYNSSTDTA SNAVYYYARS FVPNPDGKLA TGMTTQNTVE 

251 IDGVKNVLII PSLTVKNRGG KAFVRVLGAD GKAAEREIRT GMRDSMNTEV 

301 KSGLKEGDKV VISEITAAEQ QESGERALGG PPRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m290/g290 96.1% identity in 334 aa overlap 

10 20 30 

m290 . pep VSVGAQASGQ I KILYVKLGQQVKKGDL I AE 

I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
g290 PQAAYITEAVRRGDISRTVSATGEISPSNLVSVGAQASGQIKKLYVKLGQQVKKGDLIAE 
30 40 50 60 70 80 

40 50 60 70 80 90 

m290 . pep IN ST SQTNTLNTEKSKLETYQAKLVSAQ I ALGSAEKKYKRQAALWKEN AT SKEDLESAQD 

I I I I : I I I I : : I I I I I I I I I I I I I II I I I I I I I I I I I I I ! I I I I I : : I I I I I I I I I I I I 
g290 INSTTQTNTIDMEKS KLET YQ AKL V S AQ I A LG S AEKKYKRQ AALWKD DAT S KE D LE S AQD 

90 100 110 120 130 140 

100 110 120 130 140 150 

m290.pep AFAAAKANVAELKALIRQSKISINTAESELGYTRITATMDGTWAI LVEEGQTVNAAQST 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I : I I 11 I II I I I I I I I I I I I I I II I I I I I I I I 
g290 ALAAAKANVAELKALIRQSKISINTAESDLGYTRITATMDGTWAIPVEEGQTVNAAQST 
150 160 170 180 190 200 

160 170 180 190 200 210 

m290.pep PTIVQLANLDMMLNKMQIAEGDITKVKAGQDISFTILSEPDTPIKAKLDSVDPGLTTMSS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 1 I I I I I II I I I I I I II I I I I I I I I 
g290 PTIVQIANIiDMMIiNKMQIAEGDITKVKAGQDISFTILSEPDTPIKAKLDSVDPGLTTMSS 
210 220 230 240 250 260 

220 230 240 250 260 270 

m290 . pep GGYNSSTDTASNAVYYYARSFVPNPDGKLATGMTTQNTVEIDGVKNVLII PSLTVKNRGG 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I : I I I I I I I 1 I I I 
g290 GG YN S STDTASNAV Y YYARSFVPN PDGKLATGMTTQNT VE I DGVKN VLLI PSLTVKNRGG 

270 280 290 300 310 320 

280 290 300 310 320 330 

m290 . pep KAFVRVLGADGKAAEREIRT3MRDSMNTEVKSGLKEGDKWISEITAAEQQESGERALGG 
I I I I I I I I I I I I I : I I II I I I I : I II I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 
g290 KAFVRVLGADGKAVEREIRTSMKDSMNTEVKSGLKEGDKVVISEITAAEQQESGERALGG 
330 340 350 360 370 380 



ra290.pep PPRRX 
Mill 

g290 PPRRX 
390 
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The following partial DNA sequence was 

a290.seq 

1 ATGGCAAAAA TGATGAAATG GGCGGCTGTT 

51 GGTTTGGGGC GGATGGTCTT ATCTGAAGCC 

101 TTACGGAAAC GGTCAGGCGC GGCGACATCA 

151 GGGGAGATTT CGCCGTCCAA CCTGGTATCG 

201 GCAGATTAAG AAACTTTATG TCAAACTCGG 

251 ATTTGATTGC GGAAATCAAT TCGACCTCGC 

301 GAAAAATCCA AATTGGAAAC GTATCAGGCG 

351 TGCATTGGGC AGCGCGGAGA AGAAATATAA 

4 01 AGGATGATGC GACCGCTAAA GAAGATTTGG 

4 51 GCCGCCGCCA AAGCCAATGT TGCCGAGCTG 

5C1 CAAAATTTCC ATCAATACCG CCGAGTCGGA 

551 CCGCAACGAT GGACGGCACG GTGGTGGCGA 

6C1 ACTGTGAACG CGGCGCAGTC TACGCCGACG 

651 GGATATGATG TTGAACAAAA TGCAGATTGC 

7 01 TGAAGGCGGG GCAGGATATT TCGTTTACGA 

7 51 CCGATTAAGG CGAAGCTCGA CAGCGTCGAC 

801 GTCGGGCGGC TACAACAGCA GTACGGATAC 

851 ATTATGCCCG TTCGTTTGTG CCGAATCCGG 

901 ATGACGACGC AGAATACGGT TGAAATCGAC 

951 TATTCCGTCG CTGACCGTGA AAAATCGCGG 

1001 TGTTGGGTGC AGACGGCAAG GCGGCGGAAC 

1051 AGAGACAGTA TGAATACCGA AGTAAAAAGC 

1101 AGTGGTCATC TCCGAAATAA CCGCCGCCGA 

1151 GCGCCCT AGG CGGCCCGCCG CGCCGATAA 



identified in N. meningitidis <SEQ ID 121 5>: 



GCGGCGGTCG 
CGAGCCGCAG 
GCCGGACGGT 
GTCGGCGCGC 
GCAACAGGTT 
AGACCAATAC 
AAGCTGGTGT 
GCGTCAGGCG 
AAAGCGCACA 
AAGGCTCTAA 
ATTGGGCTAC 
TTCTCGTGGA 
ATTGTCCAAT 
CGAGGGCGAT 
TTTTGTCCGA 
CCCGGGC7GA 
GGCTTCCAAT 
ACGGCAAACT 
GGTGTGAAAA 
CGGCAGGGCG 
GCGAAATCCG 
GGGTTGAAAG 
GCAGCAGGAA 



CGGCGGCAGC 
GCTGCTTATA 
TTCTGCAACA 
A3GCATCGGG 
AAAAAGGGCG 
GCTCAATACG 
CGGCACAGAT 
GCGTTGTGGA 
GGATGCGCTT 
TCAGACAGAG 
ACGCGCATTA 
AGAGGGGCAG 
TGGCGAATCT 
ATTACCAAGG 
ACCGGATACG 
CCACGATGTC 
GCGGTCTACT 
CGCCACGGGG 
ATGTGCTGAT 
TTTGTGCGCG 
GACCGGTATG 
AGGGGGACAA 
AGCGGCGAAC 



This corresponds to the amino acid sequence <SEQ ID 1216; ORF 290.a>: 
a290.pep 

1 MAKMMKWAAV AAVAAA AVWG GWSYLKPEPQ AAYITETVRR GDISRTVSAT 

51 GEISPSNLVS VGAQASGQIK KLYVKLGQQV KKGDLIAEIN STSQTNTLNT 

101 EKSKLETYQA KLVSAQIALG SAEKKYKRQA ALWKDDATAK EDLESAQDAL 

151 AAAKANVAEL KALIRQSKIS INTAESELGY TRITATMDGT VVAI LVEEGQ 

201 TVNAAQSTPT IVQLANLDMM LNKMQIAEGD ITKVKAGQDI SFTILSEPDT 

251 PIKAKLDSVD PGLTTMSSGG YNSSTDTASN AVYYYARSFV PNPDGKLATG 

301 MTTQNTVEID GVKNVLIIPS LTVKNRGGRA FVRVLGADGK AAEREIRTGM 

351 RDSMNTEVKS GLKEGDKWI SEITAAEQQE SGERALGGPP RR* 



m290/a290 



98.2% identity in 334 aa overlap 



10 20 30 

IU290 . pep VSVGAQASGQIKILYVKLGQQVKKGDLIAE 

I I I I I I I I II I I I I I I II I I I I I I 1 I I I I 
a290 PQAAYITETVRRGDISRTVSATGEISPSNLVSVGAQASGQIKKLYVKLGQQVKKGDLIAE 
30 40 50 60 70 80 

40 50 60 70 80 90 

m2 90 . pep INSTSQTNTLNTEKSKLETYQAKLVSAQIALGSAEKKYKRQAALWKENATSKEDLESAQD 

I I I I I 1 1 I I I I I I I I I I I I II I 1 1 I I I I I II I II I I I 1 1 I I II I I I :: II: 1 1 I I I I II I 
a290 INSTSQTNTLNTEKSK LET YQAKLV S AQI ALG S AEKKY KRQ AA LW K DD AT AKE DLE S AQ D 

90 100 110 120 130 140 

100 110 120 130 140 150 

m290.pep AFAAAKANVAELKALIRQSKISINTAESELGYTRITATMDGTWAILVEEGQTVNAAQST 
I : I II I I I I I I I I I I I I I I II II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I | I 
a290 ALAAAKANVAELKALIRQSKISINTAESELGYTRITATMDGTWAILVEEGQTVNAAQST 
150 160 170 -v 180 190 200 

160 170 180 190 200 210 

m290 . pep PTIVQLANLDMMLNKMQIAEGDITKVKAGQDISFTILSEPDTPIKAKLDSVDPGLTTMSS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | | | | | 

a290 ptivqlanldmmlnkmoiaegditkvkagqdisftilsepdtpikakldsvdpglt™ss 

210 220 230 240 250 260 



220 



230 



240 



250 



260 



270 
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m290.pep 



GGYNSSTDTASNAVYYYARSFVPNPDGKLATGMTTQNTVEIDGVKNVLIIPSLTVKNRGG 
I I M I I I I I I I I I I I I I I I i I I I I I I I I I I II I I I I I I I II I ! I I I I I I I I I M M I II I 
GGYNSSTDTASNAVYYYARSFVPNPDGKLATGMTTQNTVEIDGVKN\'LIIPSLTVKNRGG 
270 280 290 300 310 320 



a290 



m290.pep 



a290 



280 290 300 310 320 330 

KAFVRVLGADGKAAEREIRTGMRDSMNTEVKSGLKEGDKWISEITAAEQQESGERALGG 
: I I I I I I I I I I I I I I I I I I . I I II 1 I I I I I I I I I I I I I I II I I II ! I I I I I I I I I I I | I | 
RAFVRVLGADGKAAEREIRTGMRDSMNTEVKSGLKEGDKWrSEITAAEQQESGERALGG 
330 340 350 360 370 380 



m290.pep 



PPRRX 
I II I I 
PPRRX 
390 



a290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1217>: 

g292 .seq 



1 atgaaaacca agttaatcaa aatcttgacc ccctttaccg tcctgccgct 

51 gctggcttgc gggcaaacgc ccgtttccaa tgccaacgcc gaatccgccg 

101 tcaaagccga atccgccggc aaatccgttg ccgcttcttt gaaagcgcgt 

151 ttggaaaaaa cctattccgc ccaagatttg aaagtgttga gcgtcagcga 

201 aacaccggtc aaaggcattt acgaagtcgt cgtcagcggc aggcagatta 

251 tctacaccga tgccgaaggc ggctatatgt tcgtcggcga actcatcaac 

301 atcgacacgc gcaaaaacct gaccgaagaa cgcgccgccg atttgaacaa 

351 aatcgacttc gcctccctgc ctttggacaa agccatcaaa gaagtacgcg 

401 gcaacggcaa gctgaaagtc gccgtcttct ccgaccccga ttgtccgttc 

451 tgcaaacgct tggaacatga gtttgaaaaa atgaccgacg tgacggttta 

501 cagctttatg atgcccattg ccggcctgca cccagatgcc gcgcgcaagg 

551 cgcaaatctt atggtgtcag cccgaccgtg ccaaagcgtg gacggattgg 

601 atgcgtaaag gcaaattccc ggtcggcggc agcatctgcg acaatcccgt 

651 cgcggaaacc acttccttgg gcgaacagtt cggcttcaac ggcacgccga 

701 cccttcgtct tccccaacgg gcgcacccaa agcggttaca gcccgatgcc 

751 ccaactggag gaaatcatcc gcaaaaacca gcagtaaacc cgcaatga 



This corresponds to the amino acid sequence <SEQ ID 1218; ORF 292.ng>: 

g292.pep 



1 MKTKLIKILT PFTVLPLLA C GQTPVSNANA ESAVKAESAG KSVAASLKAR 

51 LEKTYSAQDL KVLSVSETPV KGIYEWVSG RQIIYTDAEG GYMFVGELIN 

101 I DTRKNLTEE RAADLNKIDF ASLPLDKAIK EVRGNGKLKV AVFSDPDCPF 

151 CKRLEHEFEK MTDVTVYSFM MPIAGLHPDA ARKAQILWCQ PDRAKAWTDW 

201 MRKGKFPVGG SICDNPVAET TSLGEQFGFN GTPTLRLPQR AHPKRLQPDA 

251 PTGGNHPQKP AVNPQ* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1219>: 

m292.seq 



1 ATGAAAACCA AGTTAATCAA AATCTTGACC CCCTTTACCG TCCTCCCGCT 

51 GCTGGCTTGC GGGCAAACGC CCGTTTCCAA TGCCAACGCC GAACCCGCCG 

101 TCAAAGCCGA GTCCGCCGGC AAATCCGTTG CCGCCTCTTT GAAAGCGCGT 

151 TTGGAAAAAA CCTATTCCGC CCAAGATTTG AAAGTGTTGA GCGTCAGCGA 

201 AACACCGGTC AAAGGCATTT ACGAAGTCGT CGTCAGCGGC AGGCAGATTA 

251 TCTACACCGA TGCCGAAGGC GGCTATATGT TCGTCGGCGA ACTCATCAAC 

301 ATCGACACGC GCAAAAACCT GACCGAAGAA CGCGCCGCCG ATTTGAACAA 

351 AATCGACTTC GCCTCCCTGC CTTTGGACAA AGCCATCAAA GAAGTGCGCG 

401 GCAACGGCAA GCTGAAAGTC GCCGTCTTCT CCGACCCCGA TTGTCCGTTC 

451 TGCAAACGCT TGGAACACGA GTTTGAAAAA ATGACCGACG TGACGGTTTA 

501 CAGCTTTATG ATGCCCATTG CCGGCCTGCA CCCCGATGCC GCGCGCAAGG 

551 CGCAAATCTT ATGGTGTCAG CCCGACCGCG CCAAAGCGTG GACGGATTGG 

601 ATGCGTAAAG GCAAATTCCC GGTCGGCGGC AGCATCTGCG ACAATCCCGT 

651 CGCGGAAACC ACTTCCTTGG GCGAACAATT CGGCTTCAAC GGCACGCCGA 

701 CCCTCGTCTT CCCCAACGGG CGCAGCCAAA GCGGCTACAG CCCGATGCCC 
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751 CAACTGGAGG AAATCATCCG CAAAAATCAA TAA 



This corresponds to the amino acid sequence <SEQ ID 1220; ORF 292>: 

ra2 92.pep 

1 MKTKLIKILT PFTVLPLLA C GQTPVSNANA EPAVKAESAG KSVAASLKAR 

51 LEKTYSAQDL KVLSVSETPV KGIYEVWSG RQIIYTDAEG GYMFVGELIN 

101 IDTRKNLTEE RAADLNKIDF ASLPLDKAIK EVRGNGKLKV AVFSDPDCPF 

151 CKRLEHEFEK MTDVTVYSFM MPIAGLHPDA ARKAQILWCQ PDRAKAWTDW 

201 MRKGKFPVGG SICDNPVAET TSLGEQFGFN GTPTLVFPNG RSQSGYSPMP 

251 QLEEIIRKNQ * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m292/g292 98.7% identity in 238 aa overlap 



10 20 30 40 50 60 

m292 . pep MKTKLIKILTPFTVLPLLACGQTPVSNANAEPAVKAESAGKSVAASLKAR LEKTYSAQDL 

I I I I I I I 1 M I I I II I I I I - I I I I f I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I 
g292 MKTKLIKILTPFTVLPLLACGQTPVSNANAESAVKAESAGKSVAASLKAR LEKTYSAQDL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 92 .pep KVLSVSETPVKGIYEVWSGRQIIYTDAEGGYMFVGELINIDTRKNLTEERAADLNKIDF 
I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I II I I II I I I I I I 1 I I I I I I || I I ! I 
g292 KVLSVSETPVKGIYEWVSGRQIIYTDAEGGYMFVGELINIDTRKNLTEERAADLNKIDF 

70 80 90 100 110 120 



130 140 150 160 170 180 

m292.pep A S L P L DKA I KE VRGN GKLK V AV FS D PDC P FCKRLEH E FE KMT DVT VY S FMM P I AG LH P DA 

I I I I I I I I II I I I II II I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I II I I I ! I 
g292 ASLPLDKAIKEVRGNGKLKVAVFSDPDCPFCKRLEHEFEKMTDVTVYSFMMPIAGLHPDA 

130 140 150 160 170 180 



190 200 210 220 230 240 

m292.pep ARKAQILWCQPDRAKAWTDWMRKGKFPVGGSICDNPVAETTSLGEQFGFNGTPTLVFPWG 

I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I ! I I I I I I I I I I I I I : I : 
g292 ARKAQILWCQPDRAKAWTDWMRKGKFPVGGSICDNPVAETTSLGEQFGFNGTPTLRLPQR 

190 200 210 220 230 240 



250 ■ 260 

m292 .pep RSQSGYSPMPQLEEIIRKNQX 

g2 92 AH PKRLQPDAPTGGNHPQKPAVN PQX 

250 260 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1221>: 

a292.seq 

1 ATGAAAACCA AGTTAATCAA AATCTTGACC CCCTTTACCG TCCTCCCGCT 

51 GCTGGCTTGC GGG CAAACGC CCGTTTCCAA TGCCAACGCC GAACCCGCCG 

101 TCAAAGCCGA GTCCGCCGGC AAATCCGTTG CCGCCTCTTT GAAAGCGCGT 

151 TTGGAAAAAA CCTATTCCGC CCAAGATTTG AAAGTGTTGA GCGTCAGCGA 

201 AACACCGGTC AAAGGCATTT ACGAAGTCGT CGTCAGCGGC AGGCAGATTA 

251 TCTACACCGA TGCCGAAGGC GGCTATATGT TCGTCGGCGA ACTCATCAAC 

301 ATCGACACGC GCAAAAACCT GACCGAAGAA CGCGCCGCCG ATTTGAACAA 

351 AATCGACTTC GCCTCCCTGC CTTTGGACAA AGC CATC AAA GAAGTGCGCG 

401 GCAACGGCAA GCTGAAAGTC GCCGTCTTCT CCGACCCCGA CTGTCCGTTC 

451 TGCAAACGCT TGGAACACGA GTTTGAAAAA ATGACCGACG TGACGGTTTA 

501 CAGCTTTATG ATGCCCATTG CCGGCCTGCA CCCCGATGCC GCGCGCAAGG 

551 CGCAAATCTT ATGGTGTCAG CCCGACCGCG CCAAAGCGTG GACGGATTGG 

601 ATGCGTAAAG GCAAATTCCC GGTCGGCGGC AGCATCTGCG ACAATCCCGT 

651 CGCGGAAACC ACTTCCTTGG GCGAACAATT CGGCTTCAAC GGCACGCCGA 

701 CCCTCGTCTT CCCCAACGGG CGCAGCCAAA GCGGCTACAG CCCGATGCCC 

751 CAACTGGAGG AAATCATCCG CAAAAATCAA TAA 
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This corresponds to the amino acid sequence <SEQ ID 1222; ORF 292.a>: 

a292 .pep 

1 MKTKLIKILT PFTVLPLLA C GQTPVSNANA EPAVKAESAG KSVAASLKAR 

51 IjEKTYSAQDL KVLSVSETPV KGIYEVWSG RQIIYTDAEG GYMFVGELIN 

101 IDTRKNLTEE RAADLNKIDF ASLPLDKAIK EVRGNGKLKV AVFSDPDCPF 

151 CKRLEHEFEK MTDVTVYSFM MPIAGLHPDA ARKAQILWCQ PDRAKAWTDW 

201 MRKGKFPVGG SICDNPVAET TSLGEQFGFN GTPTLVFPNG RSQSGYSPMP 

251 QLEEIIRKNQ * 

m292/a292 100.0% identity in 260 aa overlap 

10 20 30 40 50 60 

m292.pep MKTKLIKILTPFTVLPLLACGQTPVSNANAEPAVKAESAGKSVAASLKARLEKTYSAQDL 
I I II I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I i I II I I I I I I I I I I I I I 1 I I I I I 
a 292 MKTKLIKTLTPFTVLPLLACGQTPVSNANAEPAVKAESAGKSVAASLKARLEKTYSAQDL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m292.pep KVLSVSETPVKGIYEVVVSGRQIIYTDAEGGYMFVGELINIDTRKNLTEERAADLNKIDF 
I I i I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I II I I I M I I ! I I ! I 
a292 KVLSVSETPVKGIYEWVSGRQIIYTDAEGGYMFVGELINIDTRKNLTEERAADLNKIDF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 92 . pep ASLPLDKAIKEVRGNGKLK\'AVFSDPDCPFCKRLEHEFEKM?DVTVYSFMMPIAGLHPDA 

I I I I I I I I I I M I I II I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I II M I 
a292 ASLPLDKAIKEVRGNGKLK\ r AVFSDPDCPFCKRLEHEFEKM7DVTVYSFMMPIAGLHPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

itl292.pep ARKAQILWCQPDRAKAWTDV/MRKGKFPVGGSICDNPVAETTSLGEQFG FN GTPTLVFPNG 

1 1 1 1 i 1 1 1 1 1 1 1 1 i i r 1 1 1 1 1 f 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 ii 1 1 

a292 ARKAQILWCQPDRAKAWTDWMRKGKFPVGGSICDNPVAETTSLGEQFGFMGTPTLVFPNG 
190 200 210 220 230 240 

250 260 
m292 . pep RSQSGYSPMPQLEE I I RKNQX 

II I I I I I I I I I I I Ml I t I II 
a 2 9 2 RS OSG YS PMPQLEE 1 1 RKNQX 

250 260 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1223>: 

g294.seq (partial) 

1 atccgtatta cctgtgcgcc gatgtcgctt ttgtcggcgg cagtctggtc 

51 ggttcgggct gtcagaacat catcgaaccg ctttcctgcg gcgttacgac 

101 gatattcggc ttttcgacct acaatttttc cgaagcctgc cggcacgcct 

151 tgccatcggg tgcggcggtt caagtcgaat cggcggacgc gtggcgtgaa 

201 gccgttgaaa aaaccttatc tggcgagggg ggcggaatgc agatgcaggc 

251 gcccgtggac ggctttatcg cacaacatcg cggagcgggc gcgagaatcg 

301 cccaggcggt gcgggaagcg gtatgcggac atcgggggcg atagtgatac 

351 aatccgtatc cgagttttcc ggttggagca tcgtatgagt atttatgccg 

401 tcgcgcacat catccacctg tattgcgcca ccgcctttgt cggcggcgtg 

451 ttttttgaag tgctggtttt gtccgtcctg catacgggac gggtgtcgcg 

501 cgeggcgcgg cgcgaagtgg aaaaggcaat gtcttaccgc gccgtcaggg 

551 tgatgccgtt tgcggtcgga ctgctgttcg ccaggggaac tctagagtcg 

601 actgcagcag catgccctc... 

This corresponds to the amino acid sequence <SEQ ID 1224; ORF 294.ng>: 

g294.pep (partial) 

1 MRITCAPMSL LSAAV WSVRA VRTSSNRFPA ALRRYSAFRP TIFPKPAGTP 

51 WHRVRRFKSN RRTRGVKPLK KPYLARGAEC RCRRAWTALS HNIAERARES 

101 PRRCGKRYAD IGGDSDTIRI RVFRLEHRMS IYAVAHIIHL Y CATAFVCGV 

151 FFEVLVLS VL HTGRVSREAR REVEKAMSYR AVRVMPFAVG LLFARGT LES 

201 TAAACP 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1225>: 
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m294 .seq 
1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 



ATGCGTATTA 
GATTCGGGTT 
GATATTCGGC 
TGGCATCGGG 
GCCGTTGAAA 
GCGCGTGGAC 
CCC-AGGCGGT 
AATCCGTATC 
TCGCGCACAT 
TTTTTTGAAG 
CGAGGCGCGG 
TGATGCCGTT 
GCAAACCGCT 
TACGATGCTG 
TCGCCATCGC 
TCGAAATACA 
TTTGGCAAAA 



CCTGTGCGCC 
GTCAGAACAT 
TTTTCAACCT 
TGCGGCGGTT 
AAACCTTATC 
GGCTTTATCG 
GCGGGAAGCG 
CGAGTTTTCC 
CGTTCATCTG 
TGCTGGTTTT 
CGCGAAGTGG 
TGTGGTCGGA 
ATCTTTCTAT 
ACGCTGAAAA 
CGTCGTCAAA 
TACACGCCGT 
GCGATGTTTT 



GATGTCGCTT 
CATCGAACCG 
ACAATTTTTC 
CAAGTCGAAT 
GTCCGAGGGG 
CACAACATCG 
GTATGCGGAT 
GTTTGGAGCA 
TAT TGCGCTA 
GTCCGTCCTG 
AAAAGGCAAT 
CTGCTGTTCG 
ATTGGGCGAA 
TCCTGTTGGC 
ATGGCGCGTT 
CGTCTTTACC 
ATATCAGCTG 



TTGTCGGCGG 
CTTTCCTGCG 
CGAAGCCTGC 
CGGCGGATGC 
GGGGGGATGC 
CGGAGCGGGC 
ATCGGGGGCG 
TCGTATGAGT 
TTGCCTTTGT 
CATACGGGAC 
GTCTTACCGC 
CCAGCGGCAT 
CCGTTTGCCA 
GTTCAGCGTA 
CCACACTGAC 
CATATGcTGC 
GTAA 



CAGTCTGGTC 
GCGTTCCGAC 
CGACACGCCT 
GTGGCGGGAA 
AGATGCAGGC 
GCGAGAATCG 
ATAGTGATAC 
ATTTATGCCG 
CGGCGGCGTG 
GGGTGTCGCG 
GCCGTCAGGG 
CGTGATGGCG 
CTTCCTTCGG 
TTGGCGCACT 
GGTCGGTTGG 
TGATTGTCTT 



This corresponds to the amino acid sequence <SEQ ID 1226; ORF 294>: 

xo294.pep 

MRITCAPMSL LSAAV WSIRV VRTSSNRFPA AFRRYSAFQP TIFPKPADTP 
WHRVRRFKSN RRMRGGKPLK KPYRPRGGGC RCRRAWTALS HNIAERARES 
PRBCGKRYAD IGGDSDTIRI RVFRLEHRMS IYAVAHIVHL Y CAIAFVGGV 
FFEVLVLSVL HTGRVSREAR REVEKAMSYR AVRVMPFVVG LLFASGIVMA 



1 
51 
101 
151 
201 
251 



ANRYLSILGE PFATSFGTM L TLKILLAFSV LAHFAI AVVK MARSTLTVGW 
SKYIHAWFT HMLLIVFLAK AMFYISW* 



g294/m294 92.3% identity in 196 aa overlap 

10 20 30 40 50 60 

g294 . pep MRirCAPMSLLSAAVWSVRAVRTSSNRFPAALRRYSAFRPTIFPKPAGTPWHRVRRFKSN 
I | I I I I I I I I I I I I I I I : I : I I I I I I I I I I : I M I I I : I I I I I I I I I I I t M I I I I I I 
m294 MRITCAPMSLLSAAVWSIRVVRTSSNRFPAAFRRYSAFQPTIFPKPADTPWHRVRRFKSN 

10 20 3C 40 50 60 

70 80 9C 100 110 120 

g294 . pep RRTRGVKPLKKPYLARGAECRCRRAWTALSHNIAERARESPRRCGKRYADIGGDSDTIRl 
I I | i I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I 
in294 RRMRGGKPLKKPYRPRGGGCRCRRAWTALSHNIAERARESPRRCGKRYADIGGDSDTIRI 

70 80 9C 100 110 120 

130 140 15C 160 170 180 

g294 . pep RVFRLEHRMS I YAVAHI IHLYCATAFVGGVFFEVLVLSVLHTGRVSREARREVEKAMSYR 

I I I I I I t I I I I I II I I I : ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
m294 RVFRLSHRMSI YAVAHI VHLYCAIAFVGGVFFEVLVLSVLHTGRVSREARREVEKAMSYR 

130 140 15C 160 170 180 



190 200 
g294 . pep AVRVMPFAVGLLFARGTLESTAAACP 
I I I I I I I : i I II I I i 

Itl294 AVRVMPFVVGLLFASGIVMAANRYLSILGEPFATSFGTMLTLKI LLAFSVLAHFAIAWK 

190 200 210 220 230 240 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1227>: 

a294 . seq 

1 ATGCGTATTA CCTGTGCGCC GATGTCGCTT TTGTCGGCGG CAGTCTGGTC 

51 GATTCGGGCT GTCAGAACAT CATCGAACCG CTTTCCTGCG GCGTTCCGAC 

101 GATATTCGGC TTTTCGACCT ACAATTTTTC CGAAGCCTGC CGGCACGCCT 

151 TGGCATCGGG TGCGGCGGTT CAAGTCGAAT CGGCGGACGC GTGGCGGGAA 

201 GCCGTTGAAA AAAACTTATC GTCCGAGGAG GGCGGAATGC AGATGCAGGC 

2 51 GCGCGCGGAC GGCTTTATCG CACAACATCG CGGAGCGGGC GCGAGAATCG 

301 CCGAGGCGGT ACGGGAAGCG GTATGCGGAC ATCGGGGACG ATAGTGATAC 

351 AATCCGTATC CGAGTTTTCC GGTTGGAGTA CCGTATGAGT ATTTATGCCG 

401 TCGCGCACAT CGTCCACCTG TATTGCGCCA TCGCCTTTGT CGGCGGCGTG 

4 51 TTTTTTGAAG TGCTGGTTTT GTCCGTCCTG CATACGGGAC GGGTGTCGTG 

501 CGAGGCGCGG CGCGAAGTGG AAAAGGCAAT GTCTTACCGC GCCGTCAGGG 

551 TGATGCCGTT TGTGGTCGGA CTGCTGTTCG CCAGCGGCAT CGTGATGGCG 

601 GCAAACCGCT ATCTTTCTAT ATTGGGCGAA CCGTTTGCCA CTTCCTTCGG 

651 TACGATGCTG ACGCTGAAAA TCCTGTTGGC GTTCAGCGTG TTGGCGCACT 
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701 TCGCCATCGC CGTCGTCAAA ATGGCGCGTT CCACACTGAC CGTCGGCTGG 

751 TCGAAATACA TACACACCGT CGTCTTTACC CATATGCTGC TGATTGTCTT 

801 TTTGGCAAAA GCGATGTTTT ATATCAGCTG GTAA 

This corresponds to the amino acid sequence <SEQ ID 1228; ORF 294. a>: 

a294 .pep 

1 MRITCAPMSL LSAA VWSIRA VRTSSNRFPA AFRRYSAFRP TIFPKPAGTP 

51 WHRVRRFKSN RRTRGGKPLK KTYRPRRAEC RCRRARTALS HNIAERARES 

101 PRRYGKRYAD IGDDSDTIRT RVFRLEYRMS IYAVAH IVHL YC AIAFVGGV 

151 FFEVLVLSVL HTGRVSCEAR REVEKAMSYR AVR VMPFVVG LLFASGIVMA 

'201 ANRYLSILGE PFATSFGTML T LKILLAFSV LAHFAIAW K MARSTLTVGW 

251 SKYIHT WFT HMLLIVFLAK AMF YISW* 



m294/a294 94.9% identity in 277 aa overlap 

10 20 30 40 50 60 

m294 .pep MRITCAPMSLLSAAVWSIRVVRTSSNRFPAAFRRYSAFQPTIFPKPADTPWHRVRRFKSN 
I I I I i I I I I I I I I I I I I I I : I I I i 1 I I I i I I I I I I I I I t I I I I I I I I I I I I ! I I I 1 I I I 
a294 MRITCAPMSLLSAAVWS IRAVRTSSNRFPAAFRRYSAFRPTI FPKPAGTPWHRVRRFKSN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 9 4 . pep RRMRGGKPLKKPYRPRGGGCRCRRAWTALSHNIAERARESPRRCGKRYADIGGDSUTIRI 
I I MINIM I I I I : III III I I I I I I I I I I I I I I I I I 111 I I I t I I I ! I II I 
a294 RRTRGGKPLKKTYRPRRAECRCRRARTALSHNIAERARESPRRYGKRYADIGDDSDTIRI 

70 80 90 100 110 120 

130 140 150 160 170 180 

m294 .pep RVFRLEHRMS IYAVAH I VHLYCAIAFVGGVFFEVLVLSVLHTGRVSREARREVEKAMSYR 

I I II I I : II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a2 94 RVFRLEYRMS I YAV AH I VHLYCAIAFVGGVFFEVLVLSVLHTGRVSCEAR REVEKAMSYR 

130 140 150 160 170 180 

190 20Q 210 220 230 240 

m294 .pep AVRVMP FWGLLFASG I VMAANRYLS ILGEPFATS FGTMLTLK I LLAFSVLAHFAIAWK 

I I I I M I I I II I I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I II I II I I I II I I II 
a 294 AVRVMP FWGLLFASG I VMAANRYLS ILGEPFATS FGTMLTLK I LLAFSVLAHFAIAWK 

190 200 210 220 230 240 

250 260 270 

m2 94 .pep MARSTLTVGWSKYIHAWFTHMLLIVFLAKAMFYISWX 
I I I I I I I I I I I I II I : II I I I I I I I I I I I I I I I I I I 1 I 
a294 MARSTLTVGW SKY I HTWFTHMLL I VFLAKAMFYISWX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1229>: 

g295.seq 

1 atgctcggga tggcgcggca cgacggccag cagggcatcg ccgcgatatt 

51 gttgccacgc cgccagcagt ttttccgcct cgtcttcgcc ccgataaacg 

101 cgcgtgctgc cgcacacggc aaccggccgg cctccgatgc gtttttcaaa 

151 ctgccccgcc agcgttttca tgtcttcaga cggcatcagg tcgtatttgg 

201 tattgccgca cacctgcacg gatgccgcgc ccaatttcgc caaccgcgcc 

251 gcatccgcct ccgtctgcgc cagacagccc gtcagcgaag cggctgcggg 

301 acggatcagg cggcggactt tcagataacc gttcagcgat ttttccgaca 

351 gccgcgcatt cgccaaaaac agcggcacac ccgctcgccg gcattccttc 

401 atcagattgg gccagatttc ggtttccatc aaaatgccga acatcgggcg 

451 gtgttcgcgc aaaaactgcc gtacccacgt ttttttgtca tacggaagat 

501 agcggcattg cgcatcggga aacagaactt gcgcggtttc ccgtcccgtc 

551 ggggtcatct gcgtcatcag cagcggcgca tcgggaaaac gccgccgcaa 

601 ctcgcgtatc aagggctggg cggcacgcgt ttctccgacc gaaacggcgt 

651 gtatccaaac cgcgccggta acgggattcg gatgcggctt gccgaaacgc 

701 tcgtccctat gcgcccggta tgccggggca cttccggagc gtttgtccaa 

751 ataacgccgt atccatatcg gcgcaagcag ccacaataca tcataaagcc 

801 atcggaacat ctttctattt cctgcaaaac aaatgccgtc cgaacggttc 

851 ggacggcatt tcggcaacgg aatcaaatat cgtag 



This corresponds to the amino acid sequence <SEQ ID 1230; ORF 295.ng>: 
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g295.pep 

1 MLGMARHDGQ QGIAAILLPR RQQFFRLVFA PINARAAAHG NRPASDAFFK 

51 LPRQRFHVFR RHQVVFGIAA HLHGCRAQFR QPRR1 RLRLR QTARQRSGCG 

101 TDQAADFQIT VQRFFRQPRI RQKQRHTRSP AFLHQIGPDF GFHQNAEHRA 

151 VFAQKLPYPR FFVIRKIAAL RIGKQNLRGF PSRRGHLRHQ QRRIGKTPPQ 

201 LAYQGLGGTR FSDRNGVYPN RAGNGTRMRL AETLVPMRPV CRGTSGAFVQ 

251 ITPYPYRRKQ PQYIIKPLEH LSISCKTNAV RTVRTAFRQR NQIS* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1231: 

m295. aoq 

1 ATGCTCGGGA TGGCGCGGCA CGACGACCAG CAGCGCATCG CCGCGATATT 

51 GTTGCCACGC CGCCAGCAGT TTTTCCGCCT CGTCTTCACC CCGATAAACG 

101 CGCGTGCTCC CGCACACGGC AACCGGCCGG CCTCCGATGC GTTTTTCAAA 

151 CTGCCCCGCC AGCGTTTTCA TCTGTTCCGA CGGTATGATG TCGTATTTGG 

201 TATTGCCGCA CACCTGCACG GATGCCGCGC CCAATTTCGC CAACCGCGCC 

251 GCATCCGCCT CTGTCTGCGC CAGACACCCC GTCAGCGAAG CGGCGGCAGG 

301 ACGGATCAGG CGGCGGACTT TCAGATAACC GTTCAACGAT TTTTCCGACA 

351 GCCGCGCATT CGCCAAAAAC AGCGGCACAC CCGCGCGCCG GCATTCCCTC 

401 ATCAGGTTGG GCCAGATTTC GGTTTCCATC AAAATGCCGA ACATCGGGCG 

451 GTGTTCGCGC AAAAACTGCC GT AC CC AC GT TTTTTTGTCA TACGGAAGAT 

501 AGCGGCATTG CGCATCGGGA AACAGAACTT GCGCGGTTTC CCGCCCCGTC 

551 GGGGTCATCT GCGTCATCAG CAGCGGCGCA TCGGGAAAAC GCCGCCGCAA 

601 CTCGCGTATC AAGGACTGGG CGGCACGCGT TTCTCCGACC GAAACGGCGT 

651 GTATCCAAAC CGCGCCGGTA ACGGGATTCG GATACGGCTT GCCGAAACGC 

701 TCGTCCCGAT GCGCCCGATA TGCCGGGGCA CTTCCGGAGC GTTTGTCCAA 

751 ATAACGCCGT ATCCATATCG GCGCAAGCAG CCACAATACA TCATAAAGCC 

801 ATTGGAACAT CTTTCTATTT CCTGCAAAAC AAATGCCGTC TGAACGGTTC 

851 AGACGGCATT TCGGCAACGG AATCAAATAT CGTAG 



This corresponds to the amino acid sequence <SEQ ED 1232; ORF 295>: 

m2 9 5 . pep 

1 MLGMARHDDQ QRIAAILLPR RQQFFRLVFT PINARAAAHG NRPASDAFFK 

51 LPRQRFHLFR RYDVVFGIAA HLHGCRAQFR QPRRI RLCLR QTPRQRSGGR 

101 TDQAADFQIT VQRFFRQPRI RQKQRHTRAP AFPHQVGPDF GFHQNAEHRA 

151 VFAQKLPYPR FFVIRKIAAL RIGKQNLRGF PPRRGHXRHQ QRRIGKTPPQ 

201 LAYQGLGGTR FSDRNGVYPN RAGNGIRIRL AETLVPMRPI CRGTSGAFVQ 

251 ITPYPYRRKQ PQYIIKPLEH LSISCKTNAV XTVQTAFRQR NQIS* 



m295/g295 93.9% identity in 294 aa overlap 

10 20 30 40 50 60 

m295 . pep MLGMARHDDQQRIAAILLPRRQQFFRLVFTPINARAAAHGNRPASDAFFKLPRQRFHLFR 
MUM II || M IMMMi MMIMMMMM MMMMMMMMMMMI 
g295 mLGMARHDGQQGIAAILLPRRQQFFRLVFAPINARAAAHGNRPASDAFFKLPRQRFHVFR 

10 20 30 40 50 60 



70 80 90 100 110 120 

ra295 . pep RYDVVFGIAAHLHGCRAQFRQPRRIRLCLRQTPRQRSGGRTDQAADFQITVQRFFRQPRI 
I : : I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I) I II I I I I I I I I I I I I I I I I I 
g295 RHQVVFGIAAHLHGCRAQFRQPRRIRLRLRQTARQRSGCGTDQAADFQITVQRFFRQPRI 
70 80 90 100 110 120 



130 140 150 160 170 180 

m295 . pep RQKQPJtTRAPAFPHQVGPDFGFHQNAERRAVFAQKLPYPRFFVIRKIAALRIGKQNLRGF 
I I t I I I I I : I I I I I - I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I i I I I I I I 
g295 RQKQRHTRSPAFLHQIGPDFGFHQNAEHRAVFAQKLPYPRFFVIRKIAALRIGKQNLRGF 

130 140 150 160 170 180 



190 200 210 220 230 240 

m295 . pep PPRRGHLRHQQRRIGKTPPQLAYQGLGGTRFSDRNGVYPNRAGNGIRIRLAETLVPMRPI 
I I ) I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I: I I I I I I I I I I I : 
g295 PSRRGHLRHQQRRIGKTPPQLAYQGLGGTRFSDRNGVYPNRAGNGIRMRLAETLVPMRPV 

190 200 210 220 230 240 



250 260 270 280 290 

m295 . peo CRGTSGAFVQITPYPYRRKQPQYIIKPLEHLSISCKTNAVXTVQTAFRQRNQISX 

I I I I I I II I I I I I I II I I I I I I I I I I I I 1 1 I I I I I I I I I I I I : I I I I I I I I I I 
g295 CRGTSGAFVQITPYPYRRKQPQYII KPLEHLS1SCKTNAVRTVRTAFRQRNQIS 

250 260 270 280 290 
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The following partial DNA sequence was identified in N, meningitidis <SEQ ID 1233>: 
a295. seq 

1 ATGCTCGGGA TGGCGCGGCA CGACGACCAG CAGGGCATCG CCGCGATATT 

51 GTTGCCACGC CGCCAGCAGT TTTTCCGCCT CGTCTTCACC CCGA7AAACG 

101 CGCGTGCTGC CGCACACGGC AACCTGCCGG TCTCCGATGC GTTTTTCAAA 

151 CTGCCCCGCC AGCGTTTTCA TCTGTTCCGA CGGCATCAGG TCGTATTTGG 

201 TATTGCCGCA CACCTGCACG GATGCCGCGC CCAATTTCGC CAACCGCGCC 

251 GCATCCGCCT CCGTCTGTGC CAGACAGCCC GTCAGCGAAG CGGCGGCAGG 

301 ACGGATCAGG CTGCGGACTT TCAGATAACC GTTTAGCGAT TTTTCCGACA 

351 GCCGCGCATT CGCCAAAAAC AGCGGCACAC CCGTGCGCCG GCAT?CCTTC 

4 01 ATCAGATTGG GCCAGATTTC GGTTTCCATC AAAATGCCGA ACATCGGGCG 

4 51 GTGTTCGCGC AAAAACTGCC GTACCCACGT TTTTTTGTCA TACGGAAGAT 

501 AGCGGCATTG TGCATCAGGA AACAGAACTT GCGCGGTTTC CCGTCCCGTC 

551 GGGGTCATCT GCGTCATCAG CAGCGGCGCA TCGGGAAAAC GCTGCCGCAA 

601 CTCGCGTATC AAAGGTTGGG CGGCACGCGT TTCCCCGACC GAAACGGCGT 

651 GTATCCAAAC CGCGCCGGTA ACGGGATTCG GATACGGCTT GCCGAAACGC 

701 TCGCCCCGAT GCGCCCGATA TGCAGGGGCA CTTCCGGAGC GTTTGTCCAA 

751 ATAACGCCGT ATCCATATCG GCGCAAGCAG CCACAATACA TCATAAAGCC 

801 ATTGGAACAT CTTTCTATTT CCTGCAAAAC AAATGCCGTC CGAACGGTTC 

851 GGACGGCATT TCGGCAACGG AATCAAATAT CGTAG 

This corresponds to the amino acid sequence <SEQ ID 1234; ORF 295 .a>: 

£295. pep 

1 MLGMARHDDQ QGIAAILLPR RQQFFRLVFT PINARAAAHG NLPVSDAFFK 

51 LPRQRFHLFR RHOVVFGIAA HLHGCRAQFR QPRRIRLRLC QTARQRSGGR 

101 TDQAADFQIT V*RFFRQPRI RQKQRHTRAP AFLHQIGPDF GFHQNAEHRA 

151 VFAQKLPYPR FFVIRKIAAL CIRKQNLRGF PSRRGHLRHQ QRRIGKTLPQ 

201 LAYQRLGGTR FPDRNGVYPN RAGNGIRIRL AETLAPMRPI CRGTSGAFVQ 

251 ITPYPYRRKQ PQYIIKPLEH LSISCKTNAV RTVRTAFRQR NQIS* 

m295/a295 93 .2% identity in 294 aa overlap 

10 20 30 40 50 CO 

m295 . pep mlgmarhddqqriaaillprrqqffrlvftpinaraaahgnrpasdaffklprqrfhlfr 
I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I : I I I M I I I I I I I I I I I 

a295 mlgmarhddqqgiaaillprrqqffrlvftpinaraaahgnlpvsdaffklprqrfhlfr 

10 20 30 40 50 60 

70 60 90 100 110 120 

m2 95 . pep RYDWFGI aahlhgcraqfrqprrirlclrqtprqrsggrtdqaadfqitvqrffrqpri 
I : : I I I I I I I I I I I II I I I I I I I I I I I I II I I I i I I I 11 I I I I I I I I I I I I I I I I I 

a 295 rhqwfgiaahlhgcraqfrqprrirlrlcqtarqrsggrtdqaadfqitvxrffrqpri 

70 80 90 100 110 120 

130 140 150 160 170 180 

m295 . pep RQKQRHTRAPAFPHQVGPDFGFHQNAEHRAVFAQKLPYPRFFVIRKIAALRIGKQNLRGF 
I i I I ! I I I I I I I I I : I I I I I I I I I M I I II I I I I I I I I I I I II I I I I I I 1 I I I I I I I 
a295 RQKQRHT RAP AF LHQ I G P D FG FH QN AEHRAV FAQK L P Y PRF FV I RK I AALC I RKQN LRG F 

130 140 150 160 170 180 

190 200 210 220 230 240 

m295 . pep PPRRGHLRHQQRRIGKTPPQLAYQGLGGTRFSDRNGVYPNRAGNGIRIRLAETLVPMRPI 
I I I I I I I I I I ! I II I I I I II I I I I I I I I I I I I I I I I . I I I I I f I I I II I I : I M I I 
a2 9 5 PSRRGHLRHQQRRIGKTLPQLAYQRLGGTRFPDRNGVYPNRAGNGIRIRLAETLAPMRPI 
190 200 210 220 230 240 

250 260 270 200 290 

m295 . pep CRGTSGAFVQITPYPYRRKQPQYI IKPLEHLSI SCKTNAVXTVQTAFRQRNQISX 

I I I I 1 I I I I II I I I II I I I I I I I I I I I I II I I I I I I I I I I 11:11111111111 
a295 C RGT SG A FVQ I T P Y P Y RRKQP Q Y 1 1 K PLE H LS I SC KTNAVRTVRT A FRQRNQ I S X 

250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1235>: 

g297 .aoq 

1 ATGGCTGTCT TCCCACTTTC GGCAAAACAT CGGAAATACG CGCTGCGCGC 
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51 GCTTGCCGTT TCGATTATTC TGGTGtcgGC GGCATACATT GCttcgacag 

101 aggggaccga gcgcgtcaga ccgcAGCGCG TggaacaaAA ACTGCCGCCG 

151 CTGTCtTGGg gcggcaacgg CGTtcagacg gcaTATTGGG TGCAGGAGGC 

201 GGTGCagccg ggggactcgC TGGCGGACGT GCTGGCGCGT TCGGGTATGG 

251 CGCGGGacga gattgCCcga ATcacGGAAA aataTggcgG CGAAGCCGAT 

301 TTGCGgcatt tGCGTGCCGA CCAGTCGGTT CATGTTTTGG TCGGCGGCGA 

351 CGGCAGTGCG CGCGAAGTGC AGTTTTttaC CGACGAAGAC GGCGAGCGCA 

401 aTctGGTCGC TTTGGAAAAA AAAGGCGGCA TATGGCGGCG GTCGGCTTCT 

4 51 GATGCGGATA TGAAGGTTTT GCCGACACTG CGTTCGGTCG TGGTCAAAAC 

501 GTCGGCGCGC GGTTCGCTGG CGCGGGCGGA AGTGCCCGTC GAAATCCGCG 

551 AATCCTTAAG CGGGATTTTT GCCGGCCGCT TCAGCCTTGA CGGTTTGAAG 

601 GAAGGCGATG CCGTGCGCCT GCTTTACGAC AGCCTGTATT TCCACGGGCA 

651 GCAGGTGGCG GCGGGCGATA TTTTGGCGGC GGAAGTTGTC AAGGGCGGCA 

701 CAACCCATCA GGCGTTCTAT TACCGTTCGG ACAAGGAAGG CGGAGGGGGC 

751 CGCAATTATT ACGATGAAGA CGGCAGGGTG TTGCAGGAAA AAGGCGGCTT 

801 CAACATCgaG CCGCTGGTCT ATACGCGCAT TTCTTCGCCG TTCGGCTACC 

851 GTATGCACCC CATCCTGCAC ACATGGCGGC TGCACACGGG CATCGATTAT 

901 GCCGCACCGC AGGGAACGCC GGTCAGGGCT TCCGCCGACG GCGTGATTAC 

951 CTTTAAAGGC CGGAAGGGCG GATACGGCAA CGCGGTGATG ATACGCCACG 

1001 CCAACGGTGT GGAAACGCTG TACGCGCACT TGAGCGCGTT TTCGCAGGCA 

1051 CAAGGCAATG TGCGCGGCGG CGAGGTCATC GGTTTTGTCG GTTCGACAGG 

1101 GCGTTCGACC GGGCCGCACC TGCATTACGA GGCGCGCATC AACGGGCAGC 

1151 CCGTCAATCC TGTTTCGGTC GCATTGCCGA CACCCGAATT GACGCAGGCG 

1201 GACAAGGCGG CGTTTGCCGC GCAGAAACAG AAGGCGGACG CGCTGCTTGC 

1251 GCGCTTGCGC GGCATACCGG TTACCGTGTC GCAATCGGAT TGA 

This corresponds to the amino acid sequence <SEQ ID 1236; ORF 297.ng>: 

g297 .pep 

1 MAVFPLSAKH RKYALRALAV SIILVSAA YI ASTEGTERVR PQRVEQKLPP 

51 LSWGGNGVQT AYWVQEAVQP GDSLADVLAR SGMARDEIAR ITEKYGGEAD 

101 LRHLRADQSV HVLVGGDGSA REVQFFTDED GERNLVALEK KGGIWRRSAS 

151 DADMKVLPTL RSWVKTSAR GSLARAEVPV EIRESLSG1 F AGRFSLDGLK 

201 EGDAVRLLYD SLYFHGQQVA AGDILAAEVV KGGTTHQAFY YRSDKEGGGG 

251 GNYYDEDGRV LQEKGGFNIE PLVYTRISSP FGYRMHPILH TWRLHTGIDY 

301 AAPQGTPVRA SADGVITFKG RKGGYGNAVM IRHANGVETL YAHLSAFSQA 

351 QGNVRGGEVI GFVGSTGRST GPHLHYEARI NGQPVNPVSV ALPTPELTQA 

401 DKAAFAAQKQ KADALLARLR GIPVTVSQSD * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 1237>: 

m297.seq 

1 ATGGCTGTCT TCCCACTTTC GGCAAAACAT CGGAAATACG CGCTGCGTGC 

51 GCTTGCCGTT TCGATTATTT TGGTGTCGGC GGCATACATT GCTTCGACAG 

101 AGAGGACGGA GCGCGTCAGA CCGCAGCGCG TGGAACAAAA TCTGCCGCCG 

151 CTGTCTTGGG GCGGCAGCGG CGTTCAGACG GCATATTGGG TGCAGGAGGC 

201 GGTGCAGCCG GGCGACTCGC TGGCGGACGT GCTGGCGCGT TCGGGTATGG 

2 51 CGCGGGACGA GATTGCCCGA ATCACGGAAA AATATGGCGG CGAAGCCGAT 

301 TTGCGGCATT TGCGTGCCGA CCAGTCGGTT CATGTTTTGG TCGGCGGCGA 

351 CGGCGGCGCG CGCGAAGTGC AGTTTTTTAC CGACGAAGAC GGCGAGCGCA 

401 ATCTGGTCGC TTTGGAAAAG AAAGGCGGCA TATGGCGGCG GTCGGCTTCT 

451 GAGGCGGATA TGAAGGTTTT GCCGACGCTG CGTTCGGTCG TGGTCAAAAC 

501 GTCGGCGCGC GGTTCGCTGG CGCGGGCGGA AGTGCCCGTC GAAATCCGCG 

551 AATCCTTAAG CGGGATTTTC GCCGGCCGCT TCAGCCTTGA CGGTTTGAAG 

601 GAAGGCGATG CCGTGCGCCT GATGTACGAC AGCCTGTATT TCCACGGGCA 

651 GCAGGTGGCG GCGGGCGATA TTTTGGCGGC TGAAGTCGTT AAGGGCGGCA 

701 CAAGGCATCA GGCGTTCTAT TACCGTTCGG ACAAGGAAGG CGGAGGGGGC 

751 GGCAATTATT ATGATGAAGA CGGCAAGGTG TTGCAGGAAA AAGGCGGCTT 

801 CAACATCGAG CCGCTGGTCT ATACGCGCAT TTCTTCGCCG TTCGGCTACC 

851 GTATGCACCC CATCCTGCAC ACATGGCGGC TGCACACGGG CATCGATTAT 

901 GCCGCACCGC AGGGAACGCC GGTCAGGGCT TCCGCCGACG GCGTGATTAC 

951 CTTTAAAGGC CGGAAGGGCG GATACGGCAA CGCGGTGATG ATACGCCACG 

1001 CCAACGGTGT GGAAACGCTG TACGCGCACT TGAGCGCGTT TTCGCAGGCG 

1051 GAAGGCAATG TGCGCGGCGG CGAGGTCATC GGTTTTGTCG GTTCGACCGG 

1101 GCGTTCGACC GGGCCGCACC TGCATTACGA GGCGCGCATC AACGGGCAGC 

1151 CCGTCAATCC TGTTTCGGTC GCATTGCCGA CACCGGAATT GACGCAGGCG 

1201 GACAAGGCGG CGTTTGCCGC GCAGAAACAG AAGGCGGACG CGCTGCTTGC 

1251 GCGCTTGCGC GGCATACCGG TTACCGTGTC GCAATCGGAT TGA 

This corresponds to the amino acid sequence <SEQ ID 1238; ORF 297>: 

m297 .pop 

1 MAVFPLSAKH RKYALRALAV SIILVSAA YI ASTERTERVR PQRVEQNLPP 

51 LSWGGSGVQT AYWVQEAVQP GDSLADVLAR SGMARDEIAR ITEKYGGEAD 

101 LRHLRADQSV HVLVGGDGGA REVQFFTDED GERNLVALEK KGGIWRRSAS 

151 EADMKVLPTL RSVWKTSAR GSLARAEVPV EIRESLSGIF AGRFSLDGLK 
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201 EGDAVRLMYD SLYFHGQQVA AGDILAAEVV KGGTRHQAFY YRSDKEGGGG 

251 GNYYDEDGKV LQERGGFNIE PLVYTRISSP FGYRMHPILH TWRLHTGIDY 

301 AAPQGTPVRA SADGVITFKG RKGGYGNAVM IRHANGVETL VAHLSAFSQA 

351 EGNVRGGEVI GFVGSTGRST GPHLHYEARI NGQPVNPVSV ALPTPELTQA 

401 DKAAFAAQKQ KADALLARLR G3PVTVSQSD * 

m297/g297 97.9% identity in 430 aa overlap 

10 20 30 40 50 60 

m297.pep MAVFPLSAKHRKYALRALAVSIILVSAAYIASTERTERVRPQRVEQNLPPLSWGGSGVQr 
I I I I I I 1 1 i I I 1 I I i I I I I I I I I I I I I I I I i I 1 1 I I I I I I I II 1 1 M I I I I 1 1 I Ml 1 1 
g297 MAVFPLSAKHRKYALRALAVSIILVSAAYIASTEGTERVRPQRVEQKLPPLSWGGNGVQT 

10 20 30 4 0 50 60 

70 80 90 100 110 120 

m297.pep AYWVQEAVQPGDSLADVLARSGMARDEIARITEKYGGEADLRHLRADQSVHVLVGGDGGA 
I I I I I I I I I I I ! I ! I I I i I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I ! I I I I I I I : I 
g297 A Y W V QEA V Q PG DS L A D VLARS GMARDE I A RI TEKYGGEA DLRH LRADQS VH VL VGGDGS A 

70 80 90 100 110 120 

130 140 150 160 170 180 

m297 . pep REVQFFTDEDGERNLVALEKKGGIHRRSASEADMKVLPTLRSVWKTSARGSLARAEVPV 

I | I I II I I I I I I I I I I I 1 I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g297 REVQFFTDEDGERNLVALEKKGGIWRRSASDADMKVLPTLRSVWKTSARGSLARAEVPV 

130 140 150 160 no 180 

190 200 210 220 230 240 

m297.pep EIRESLSGIFAGRFSLDGLKEGDAVRLMYDSLYFHGQQVAAGDILAAEVVKGGTRHQAFY 
) I I I I I I I I I I I I I I t I I I I I I I I I I I : ! I 1 I I I I I ! I I I I I I I I I I I I I I I I I I I I I I 
g297 EIRESLSGIFAGRFSLDGLKEGDAVRLLYDSLYFHGQQVAAGDILAAEVVKGGTTHQAFY 

190 200 210 220 230 240 

250 260 270 200 290 300 

m297.pep YRSDKEGGGGGNYYDEDGKVLQEKGGFNIEPLVYTRISSPFGYRMHPILHTWRLHTGIDY 

II MINIMI I! I||f|:| Mill I I I I I I I I i 1 1 I I I I i I I I I I I i I I I I I I I I ill 
g2 97 YRSDKEGGGGGNY Y DEDGRVLQEKGGFNI EPLVYTRI SS PFGYRMHPI LHTWRLHTGI DY 

250 260 270 280 290 300 

310 320 330 340 350 360 

m297.pep aapqgtpvrasadgvitfkgrkggygnavmirhangvetlyahlsafsqaegnvrggevi 

I I ! I I 1 I I 1 i II I i I I I i I I I M I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I 

g297 aapqgtpvrasadgvitfkgrkggygnavmirhangvetlyahlsafsqaqgnvrggevi 

310 320 330 340 350 360 

370 380 390 400 410 420 

m297 . pep gfvgstgrstgphlhyearingqpvnpvsvalptpeltqadkaafaaqkqkadallarlr 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II I II 1 1 1 1 

g297 gfvgstgrstgphlhyearingqpvnpvsvalptpeltqadkaafaaqkqkadallarlr 

370 380 390 400 410 420 

430 

m297.pep GIPVTVSQSDX 
I I I I II I I I I I 
g297 GIPVTVSQSDX 
430 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1239>: 
a297.seq 

1 ATGGCTGTCT TCCCACTTTC GGCAAAACAT CGGAAATACG CGCTGCGCGC 

51 CCTTGCCCTT TCGATTATTT TGGTGTCGGC GGCATACATT GCTTCGACAG 

101 AGAGGACGGA GCGCGTCAGA CCGCAGCGCG TGGAACAAAA ACTGCCGCCG 

151 CTGTCTTGGG GCGGCAGCGG TGTTCAGACG GCATATTGGG TGCAGGAGGC 

201 GGTGCAGCCA GGCGACTCGC TGGCGGACGT GCTGGCGCGT TCGGGTATGG 

251 CGCGGGACGA AATTGCCCGA ATAACGGAAA AATATGGCGG CGAAGCCGAT 

301 TTGCGGCATT TGCGTGCCGA CCAGTCGGTT CATGTTTTGG TCGGCGGCGA 

351 CGGCGGCGCG CGCGAAGTGC AGTTTTTTAC CGACGAAGAC GGCGAGCGCA 

401 ATCTGGTCGC TTTGGAAAAA AAAGGCGGCA TATGGCGGCG GTCGGCTTCT 

451 GAGGCGGATA TGAAGGTTTT GCCGACGCTG CGTTCGGTCG TGGTCAAAAC 

501 GTCGGCGCGC GGTTCGCTGG CGCGGGCGGA AGTGCCCGTC GAAATTCGCG 
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551 AATCCTTAAG CGGGATTTTC GCCGGCCGCT TCAGCCTTGA TGGTTTGAAG 

601 GAAGGCGATG CCGTGCGCCT GATTTACGAC AGCCTGTATT TCCACGGGCA 

651 GCAGGTGGCG GCGGGCGATA TTCTGGCGGC GGAAGTCGTT AAGGGCGGCA 

701 CAAGGCATCA GGCGTTCTAT TACCGTTCGG ACAAGGAAGG AGGAGGGGGC 

751 GGCAATTATT ACGATGAAGA CGGCAGGGTG TTGCAGGAAA AAGGCGGCTT 

801 CAACATCGAG CCACTGGTCT ATACGCGCAT TTCTTCGCCG ?TCGGCTACC 

851 GTATGCACCC CATCCTGCAC ACTTGGCGGC TGCACACGGG CATCGATTAT 

901 GCCGCACCGC AGGGAACGCC GGTCAGGGCT TCCGCCGACG GCGTGATTAC 

951 CTTTAAAGGC CGGAAGGGTG GCTACGGCAA CGCGGTGATG ATACGCCACG 

1001 CCAACGGTGT GGAAACGCTG TATGCGCACT TGAGCGCGTT TTCTCAGGCA 

1051 GAAGGCAATG TGCGCGGCGG CGAGGTCATC GGTTTTGTCG GTTCGACCGG 

1101 GCGTTCGACG GGGCCGCACC TGCATTACGA GGCGCGCATC AATGGGCAGC 

1151 CCGTCAATCC TGTTTCGGTC GCATTGCCGA CACCCGAATT GACGCAGGCG 

1201 GACAAGGCGG CGTTTGCCGC GCAGAAACAG AAGGCGGACG CGCTGCTTGC 

1251 GCGCTTGCGC GGCATACCGG TTACCGTGTC GCAATCGGAT TGA 



This corresponds to the amino acid sequence <SEQ ID 1240; ORF 297.a>: 

a297 .pep 

1 MAVFPLSAKH RKYAIRALAV SIILVSA AYI ASTERTERVR PQRVEQKLPF 

51 LSWGGSGVOT AYWVQEAVQP GDSLADVLAR SGMARDE IAR ITEKYGGEAD 

101 LRHLRADQSV HVLVGGCGGA REVQFFTDED GERNLVALEK KGGIWRRSAS 

151 EADMKVLPTL RSVWKTSAR GSLARAEVPV EIRESLSGIF AGRFSLDGLK 

201 EGDAVRLIYD SLYFHGQQVA AGDILAAEW KGGTRHQAFY YRSDKEGGGG 

251 GNYYDEDGRV LQEKGGFNIE PLVYTRISSP FGYRMHPILH TWRLHTGIDY 

301 AAPQGTPVRA SADGVITFKG RKGGYGNAVM IRHANGVETL YAHLSAFSQA 

351 EGNVRGGEVI GFVGSTGRST GPHLHYEARI NGQPVNPVSV ALPTPSLTQA 

401 DKAAFAAQKQ KADALLARLR GIPVTVSQSD * 

m297/a297 99.3% identity in 430 aa overlap 

10 20 30 40 50 60 

m297 . pep MAVFPLSAKHRKYALRALAVSIILVSAAYIASTERTERVRPQRVEQNLPPLSWGGSGVQT 
I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I II I I I I: I II I I I I I I I I I I 
a297 MAVFPLSAKHRKYALRALAVSIILVSAAYIASTERTERVRPQRVEQKLPPLSWGGSGVQT 

10 20 30 40 50 60 



70 80 90 100 110 120 

m297.pep AYWVQEAVQPGDSLADVLARSGMARDE IARITEKYGGEADLRHLRADQSVHVLVGGDGGA 

I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II II I I I 
a 2 9 7 • A YW VQE AVQPG DS LADVLAR S GMARDE IARITEKYGGEADLRHLRADQSVHVLVGG DGGA 

70 80 90 100 110 120 



130 140 150 160 170 180 

m2 97 . pep REVQFFT DE DGERN LVALEKKGG I WRRS AS E ADMKVL PT LRS V WKT S ARGS LARAEVPV 

I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a297 R EV Q F FT DE DGERN LVALEKKGG I WRR S AS EADMKV L PT LR S WVKT S ARGS LARAEV P V 

130 140 150 160 170 180 



190 200 210 220 230 240 

m2 97 . pep EXRESLSGIFAGRFSLDGLKEGDAVRLMYDSLYFHGQQVAAGDILAAEVVKGGTRHQAFY 
I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I 
a297 EIRESLSGIFAGRFSLDGLKEGDAVRLIYDSLYFHGQQVAAGDILAAEWKGGTRHQAFY 

190 200 210 220 230 240 



250 260 270 280 290 300 

m297.pep YRS DKEGGGGGNYYDEDGK\ r LQEKGGFNIE PLVYTRI SS PFGYRMHPILHTWRLIITGIDY 

II I I I I I I I II I I I I I I I : I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
a297 YRSDKEGGGGGNYYCEDGRVLQEKGGFNIEPLVYTRISSPFGYRMHPILHTWRLHTGIDY 

250 260 270 280 290 300 



310 320 330 340 350 360 

m2 97 . pep AAPQGTPVRASADGVITFKGRKGGYGNAVMTRHANGVETLYAHLSAFSQAEGNVRGGEVI 
I I I I I I I I I 1 1 I I I 1 1 M I I 1 1 1 1 I I I I I I I I I I M I II 1 1 I 1 1 I I I I 1 1 I 1 1 I 1 1 1 1 1 1 
a297 AAPQGTPVRASADGVITFKGRKGGYGNAVMIRHANGVETLYAHLSAFSQAEGNVRGGEVI 

310 320 330 340 350 360 
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370 380 390 400 410 420 

m2 97 . pep GFVGSTGRSTGPHLHYEARINGQPVNPVSVALPTPELTQADKAAFAAQKQKADAIiLARLR 
I I I I | I I I II I I I I I I I I I I I I I I I I I I I I I I I ! I I I I ! I I I I I I I I I I I I I I I I I I i I I 
a 297 GFVGSTGRSTGPHLHYEARINGQPVNPVSVALPTPELTQADKAAFAAQKQKADALLARLR 

370 380 390 400 410 420 

430 

m297.pep GIPVTVSQSDX 
I I I I I I I I I I I 
a297 GIPVTVSQSDX 
430 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1241>: 

g298.seq 

1 ATGAAAAACT TTCTTTCCCT TTTCGCCTCC. ATACTGATGT CTGCCCTGAT 

51 TGCCGTGTGG TTCAGC CAAA ACCCCATCAA CGCCTACTGG CAGCAGACCT 

101 ACCACCGCAA CAGCCCGCTC GAACCGCTTG CCGCCTACGG ATGGTGGCGG 

151 AGCGGAGCGG CGTTGCAAGA AAACGCCTAC GCCCTTTCAG ACGGCATCAA 

201 AACCTTCCTG TCCGGCGAAA cgccccccac ggCTCAAGAC GGCGGTTCGG 

251 CAGATATGCC GCCTGAAGCC GCCGCATCCG AAGCCGCCCC GCCGGCCGGC 

301 GGAACAGAAT GGAAACAAGG CACCGAAGCC GCCGCCGTCC GCAGCGGCGA 

351 CAAAGTCTTT TTCGCCGGAG ATTCGCTGAT GCAGGGCGTT GCGCCTTTCG 

401 TGCAAAAAAG CCTGAAACAG CAATACGGCA TCGAATCCGC CAACCTCAGC 

451 AAACAAAGCA CGGGGCTTTC CTATCCCTCA TTCTTCGACT GGCCGAAAAC 

501 GATTGAAGAA ACCTTGAAAA AACATCCCGA AATCAGCGTA CTCGCCGTCT 

551 TCCTCGGCCC GAACGACCCG TGGGATTTCC CCGTCGGCAA ACGCTACCTC 

601 AAATTCGCTT CCGACGAATG GGCGCAAGAA TACCTGAAAC GCGTCGACCG 

651 CATCCTTGAA GCCGCACACA CGCACCGCGT CCAAGTCGTC TGGCTCGGCA 

701 TCCCCTACAT GAAAAAAGTC AAGCTCGACG GTCAGATGCG CTACCTCGAC 

751 AAACTGCTTT CGGAACACTT GAAAGGCAAA ATCATCCTGA TTCCCACCGC 

801 GCAAACACTG AGCGGCGGGA AAGgccGCTA CACCGATTCC GTCAACGTCA 

851 ACGGCAAACC CGTCCGCTAC CGCAGTAAGG ACGGCATACA CTTTACCGCC 

901 GAAGGACAAA AACTGCTGGC GGAAAAAATA ATGGAAAAAA TCGTTTTTGA 

951 ACCGAGTACG CAACCATCAA GTACACAGCC ATGA 

This corresponds to the amino acid sequence <SEQ ID 1242; ORF 298.ng>: 

g298.pep 

1 MKNFLSLFAS ILMSA LIAVW FSQNPINAYW QQTYHRNSPL EPLAAYGWWR 

51 SGAALQENAY ALSDGI KTFL SGETPPTAQD GGSADMPPEA AASEAAPPAG 

101 GTEWKQGTEA AAVRSGDKVF FAGDSLMQGV APFVQKSLKQ QYGIESANLS 

151 KQSTGLSYPS FFDWPKTIEE TLKKHPEISV LAVFLGPNDP WDFPVGKRYL 

201 KFASDEWAQE YLKRVDRILE AAHTHRVQW WLGIPYMKKV KLDGQMRYLD 

251 KIjIiSEHLKGK IILIPTAQTL SGGKGRYTDS VNVNGKPVRY RSKDGIHFTA 

301 EGQKLLAEKI MEKIVFEPST QPSSTQP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1243>: 

m298.seq 

1 ATGAAAAACT TTCTTTCCCT TTTCTCCTCC ATACTGATGT CTGCCCTGAT 

51 TGCCGTGTGG TTCAGCCAAA ACCCCATCAA CGCCTACTGG CAGCAGACCT 

101 ACCACCGCAA CAGCCCGCTC GAACCGCTTG CCGCCTACGG ATGGTGGCGG 

151 AGCGGTGCGG CGTTGCAAGA AAACGCCTAC GCCCTTTCAG ACGGCATCAA 

201 AGCCTTCCTG TCCGGCGAAA CGCCGCCGAC GGCTCAAGAC GGCGGTTCGG 

251 CAGATATGCC GTCTGAAGCC GCCGCATCCG AAGCCGTCCC TCAAACCGGT 

301 GAAACAGAAT GGAAACAAGA CACCGAAGCC GCCGCCGTCC GCAGCGGCGA 

351 CAAAGTCTTT TTTGTCGGCG ACTCGCTGAT GCAGGGCGTT GCCCCCTTCG 

401 TGCAAAAAAG CCTGAAACAG CAATACGGCA TCGAATCCGT CAACCTCAGC 

451 AAACAAAGCA CGGGGCTGTC CTACCCCTCA TTCTTCGACT GGCCGAAAAC 

501 GATTGAAGAA ACCCTGCAAA AACATCCCGA AATCAGCGTA CTCGCCGTCT 

551 TCCTCGGACC GAACGACCCG TGGGATTTCC CCGTCGGCAA ACTCTATCTC 

601 AAATTCGCTT CCGACGAATG GGCGCAAGAA TACCTGAAAC GTGTCGACCG 

651 CATCCTTGAA GCCGCACACA CGCACCGCGT CCAAGTCGTC TGGCTCGGCA 

701 TCCCCTACAT GAAAAAAGCC AAGCTCGACG GACAGATGCG CTACCTAGAC 

751 AAACTGCTTT CGGAACATTT GAAAGGCAAA ATCATCCTGA TTCCCACCAC 

801 GCACACCCTG AGCGGCGGGA AAGACCGCTA CACCGACTCC GTCAACGTCA 

851 ACGGCAAACC CGTCCGCTAC CGCAGCAAGG ACGGCATACA CTTTACCGCC 

901 GAAGGACAAA AACTGCTGGC GGCAAAAATA ATGGAAAAAA TCGTTTTTGA 

951 ACCAAGTACG CAACCATCAA GTACACAGCC ATGA 
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This corresponds to the amino acid sequence <SEQ ID 1244; ORF 298>: 

m298.pep 

1 MKNFLSLFSS ILMSA LIAVW FSQNPINAYW QQTYHRNSPL EPLAAYGWWR 

51 SGAALQENAY ALSDGIKAFL SGETPPTAQD GGSADMPSEA AASEAVPQTG 

101 ETBWKQDTEA AAVRSGDKVF FVGDSLMQGV APFVQKSLKQ QYGIESVNLS 

151 KQSTGLSYPS FFDWPKTIEE TLQKHPEISV LAVFLGPNDP WDFPVGKLYL 

201 KFASDBWAQE YLKRVDRILE AAHTHRVQW WLGIPYMKKA KLDGQMRYLD 

251 KLLSEHLKGK IILIPTTHTL SGGKDRYTDS VNVNGKPVRY RSKDGIHFTA 

301 EGQKLLAAKI MEKIVFEPST QPSSTQP* 

m298/g298 94.8% identity in 327 aa overlap 

10 20 30 40 50 60 

m2 98 . pep MKNFLSLFSSILMS ALI AVWFSQNPINAYNQQTYHRNSPLEPLAAYGWWRSGAALQENAY 

MINI INI II I 1111 II II Nil!! II II Hill I Mill I II II Mill Mil! M 
g 2 9 8 MKNFLSLFASILMSALI AVWFSQNPINAYWQQTYHRNSPLE P LAAYGWWR SGAALQENAY 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 98 pep ALSDGIKAFLSGETPPTAQDGGSADMPSEAAASEAVPQTGETEWKQDTEAAAVRSGDKVF 

MINIMUM MIMI IIIMII !lllll|:| I III!) Illllllllllll 
g2 98 ALSDG I KTFLSGET P PTAQDGG S ADMP PE AAAS EAAP PAGGTE WKQGT EAAAVR S GDK VF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 9 8 . pep FVGDSLMQGVAPFVQKSLKQQYGIKSVNLSKQSTGLSYPSFFDWPKTI EETLQKHPEI SV 

hiiiiiiiiiiimiiiiiiMiMiimiiniiiiiiiiiiiiiihiiiiii! 

g298 FAGDSLWQGVAPFVQKSLKQQYGIESAHLSKQSTGLSYPSFFDWPKTIEETLKKHPEISV 
130 140 150 160 170 180 

190 200 210 220 230 240 

m2 98 . pep lAWl^PNDPWDFPVGKLYLKFASDEWAQEYLKRTORILEAAHTHRVQVVWLGIPYMKKA 

llllllimiMMII M IMMIIMNIIMMMMMMIII IMINMM M 

g2 9 8 IiAVF LG PND PWDF P VG KR Y LKFASDEWAQEYLKRVDR ILEAAHTHRVQWWLG I PYMKJCV 

190 200 210 220 230 240 

250 260 270 280 290 300 

TO298 .pep KLDGQMRYLDKLLS EHLKG KI I L I PTTHTLSGGKDRYTDSVNVNG K PVR YRS KDG I HFT A 

illllllti illllilllli llllll-lltlll lllllllllllllllllllllllll 
g2 98 KLDGQhKYLDKLLSEHLKGKIILTPTAQ/TIiSGGKGRYTDSVNVNGKPVRYRSKDGlHFTA 
250 260 270 280 290 300 

310 320 
m298.pep EGQKLLAAKIMEKIVFEPSTQPSSTQPX 

1:111 lllllllllillllllltll 

g298 EGQKLLAEKIMEKIVFEPSTQPSSTQPX 
310 320 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1245>: 

a298. seq 

1 ATGAAAAACT TTCTTTCCCT TTTCGCCTCC ATACTGATGT CTGCCCTGAT 

51 TGCCGTGTGG TTCAGCCAAA ACCCCATCAA CGCCTACTGG CAGCAGACCT 

101 ACCACCGCAA CAGCCCGCTC GAACCGCTTG CCGCCTACGG ATGGTGGCGG 

151 AGCGGTGCGG CATTGCAAGA AAACGCCTAC GCCCTTTCAG ACGGCATCAA 

201 AGCCTTCCTG TCCGGCGAAA CGCCGCCGAC GGCTCAAGAC GGCGGTTCGG 

251 CAGATATGCC GTCTGAAGCC GCCGCACCCG AAACCGCCCC TCAAACTGGC 

301 GAAACAGAAT GGAAACAAAA CACCGAAGCC GCCGCCGTCC GAACAGGGGA 

351 CAAAGTCTTT TTCGCCGGCG ACTCGCTGAT GCAGGGCGTT GCACCCTTCG 

401 TGCAAAAAAG CCTGAAACAG CAATACGGCA TCGAATCCGT CAACCTCAGC 

451 AAACAAAGCA CGGGGCTGTC CTACCCCTCA TTCTTCGACT GGCCGAAAAC 

501 GATTGAAGAA ACCCTGAAAA AACATCCCGA AATCAGCGTG CTCGCCGTCT 

551 TCCTCGGTCC GAACG AC CCG 7GGGATTTCC CCGTTGGCAA ACGCTACCTC 

601 AAATTCGCTT CCGACGAATG GGCGCAAGAA TACCTGAAAC GCGTCGACCG 

651 CATCCTTGAA GCCGCACACA CGCACTACGT CCAAGTCGTC TGGCTCGGCA 

701 TCCCCTACAT GAAAAAAGCC AAGCTCGACG GACAGATGCG CTACCTAGAC 
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751 AAACTGCTTT CGGAATATTT GAAAGGCAAA ATCATCCTGA TTCCCACCGC 

801 GCACACCCTG AGCGGCGGGA AAGACCGCTA CACCGACTCC GTCAACGTCA 

851 ACGGCAAACC CGTCCGCTAC CGCAGCAAGG ACGGCATACA CTTTACCGCC 

901 GAAGGACAAA AACTGCTGGC GGCAAAAATA ATGGAAAAAA TCGTTTTTGA 

951 ACCAAGTACG CAACCATCAA GTACACAGCC ATGA 

This corresponds to the amino acid sequence <SEQ ID 1246; ORF 298.a>: 

a298.pep 

1 MKNFLSLFAS ILMS ALIAVW FSQNPINAYW QQTYHRNSPL EPLAAYGWWR 

51 SGAALQENAY ALSDGIKAFL SGETPPTAQD GGSADMPSEA AAPETAPQTG 

101 ETEWKQNTEA AAVRTGDKVF FAGDSLMQGV APFVQKSLKQ QYGIESVNLS 

151 KQSTGLSYPS FFDWPKTIEE TLKKHPEISV LAVFLGPNDP WDFFVGKRYL 

201 KFASDEWAQE YLKRVDRILE AAHTHYVQVV WLGIPYMKKA KLDGQMRYLD 

251 KLLSEYLKGK IILIPTAHTL SGGKDRYTDS VNVNGKPVRY RSKDGIHFTA 

301 EGQKLI.AAKI MEKIVFEPST QPSSTQP* 

m298/a298 96.3% identity in 327 aa overlap 

10 20 30 40 50 60 

m298 . pep MKNFLSLFSSILMSALIAVt^FSQNPINAYWQQTYHRNSPLEPLAAYGWWRSGAALQENAY 
I | | I I I I I : I I I I I I I I I I I I I ! I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I 
3298 MKNFLSLFAS I LMSAL I ATOFSQNP IN AYWQQTYHRNSPLEPLAAYGWWRSGAALQENAY 

10 20 30 40 50 60 

70 80 90 100 110 120 

m298 . pep ALSDGIKAFLSGETPPTAQDGGSADMPSEAAASEAVPQTGETEWKQDTEAAAVRSGDKVF 
I I I I I I 1 I I I I I I I I I I II I I I I I I I I I I I t I I I I I I I I I I I I : I I I I I I I : I I I i I 
a 298 ALSDGIKAFLSGETPPTAQDGGSADMPSEAAAPETAPQTGETEWKQNTEAAAVRTGDKVF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m298.pep FVGDSLMQGVAPFVQKSLKQQYGIESVNLSKQSTGLSYPSFFDWPKTIEETLQKHPEISV 
[ : M I || M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! II I I 1 I I : I I I I I I I 
a 298 FAGDSLMQGVAPFVQKSLKQQYGIESVNLSKQSTGLSYPSFFDWPKTIEETLKKHPEISV 

130 140 150 160 170 180 

190 200 . 210 220 230 240 

m2 98 . pep LAVFLGPNDPWDFPVGKLYLKFASDEWAQEYLKRVDRILEAAHTHRVQWWLGI PYMKKA 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I i I I I I I 
a2 98 LAVFLGPNDPWDFPVGKRYLKFASDEWAQEYLKRVDRILEAAHTHYVQWWLGIPYNKKA 
190 200 210 220 230 240 

250 260 270 280 290 300 

m298 . pep KLDGQMRYLDKLLSEHLKGKIILIPTTHTLSGGKDRYTDSVNVNGKPVRYRSKDGIHFTA 
i I I I I I I I I I It I I I: I I I I I M I I I : I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I i I 
a 2 9 8 KLDGQMRYLDKLLSE YLKGK 1 1 L I PTAHTLSGGK DR YT DS VN VNGKP VRYRSKDG I-HFTA 

250 260 270 280 290 300 

310 320 
m2 98 . pep EGQKLLAAKIMEKIVFEPSTOPSSTQPX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a2 98 EGQKLLAAKIMEKIVFEPSTQPSSTQPX 

310 320 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1247>: 

g299.seq 

1 ATGAACCCCA AACACTTCAT CGCATTTTCC GCCCTGTTCG CCGCCACGCA 

51 GGCAGAAGCC CTGCCCGTCG CCTCCGTCAG CCCCGACACC GTTACCGTTT 

101 CCCCGTCCGC CCCCTACACC GATACAAACG GGCTGCTGAC CGACTACGGC 

151 AACGCCGCCG CCTCGCCTTG GATGAAAAAA CTCCGATCCG TCGCACAAGG 

201 CAGCGGCGAG GCCTTCCGCA TCCTGCAAAT CGGCGACTCG CATACCGCCG 

251 GCGACTTCTT TACCGACGCC CTGCGCAAAC GCCTGCAAAA AACATGGGGC 

301 GACGGCGGCA TAGGCTGGGT TTACCCCGCC AACGTCAAAG GGCAGCGCAT 

351 GGCGGCCGTC CGTCACAGCG GCAACTGGCA AAGCTTCACC AGCAGGAACA 

4 01 ATACCGGAGA TTTCCCGCTC GGCGGCATCC TCGCCCAAAC CGGCAGCGGC 

4 51 GGCGGCATGA CCCTGACCGC GTCTGACGGC AAAACCGGCA AACAGCGCGT 
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501 TTCCCTGTTT GCCAAACCGC TGCTCGCCGA ACAAACCCTG ACCGTCAACC 

551 GCAACACCGT CTCCGCCAAC GGCGGCGGCT GGCAGGTACT GGATACGGGC 

601 GCGGCACTGC CCCTGGCCAT ACAGACCGAA ATGCCGTGGG ACATCGGCTT 

651 CATCAACATC GAAAATCCCG CCGGCGGCAT TACCGTTTCC GCGATGGGCA 

701 TCAACGGCG C ACAATTGACC CAGTGGTCGA AATGGCGTGC CGACCGTATG 

751 AAOGACCTTG CCCAAACCGG CGCCGATTTG GTTATCCTTT CCTACGGCAC 

801 CAACGAAGCC TTCAACAACA ACATCGACAT TGCCGATACC GAACAAAAAT 

851 GGCTGGATAC CGTCCGCCAA ATCCGCGACA GCCTGCCCGC CGCCGGCATC 

901 CTCATCATCG GCGCGCCCGA ATCCCTGAAA AACACGCTCG GCGTATGCGG 

951 CACGCGCCCC GTCCTCCTGA CCGAAGTCCA ACAGATGCAG CGGCGCGTCG 

1001 CCCGTCAGGG GCAGACGATG TTTTGGTCTT GGCAAAACGC AATGGGCGGC 

1051 AT ATGCAG CA TGAAAAACTG GCTCAACCAA GGATGGGCCG CCAAAGACGG 

1101 CGTACACTTC TCCGCCCAAG GCTACCGGCG CGCGGCGGAA ATGCTTGCCG 

1151 ACAG CCTCGA AGAACTCGTC CGCGCCGCCG CAATCAGGCA ATAA 



This corresponds to the amino acid sequence <SEQ ID 1248; ORF 299.ng>: 

g299.pep 

MNPKHFIAFS 



1 
51 
101 
151 
201 
251 
301 
351 



NAAASPWMKK 
DGGIGWVYPA 
GGMTLTASDG 
AALPLAIQTE 
NDLAQTGADL 
LIIGAPESLK 
ICSMKNWLNQ 



ALFAATQAEA 
LRSVAQGSGB 
NVKGQRMAAV 
KTGKQRVSLF 
NPWDIGFINI 
VILSYGTNEA 
NTLGVCGTRP 
GWAAKDGVHF 



LPVASVSPDT 
AFRILQIGDS 
RHSGNWQSFT 
AKPLLAEQTL 
ENPAGGITVS 
FNNNIDIADT 
VLLTEVQQMQ 
SAQGYRRAAE 



VTVSPSAPYT 
HTAGDFFTDA 
SRNNTGDFPL 
TVNGNTVSAN 
AKGINGAQLT 
EQKWLDTVRQ 
RRVARQGQTM 
MLADSLEELV 



DTNGLLT DYG 
LRKRLQKTWG 
GGILAQTGSG 
GGGWQVLDTG 
QWSKWRADRM 
IRDSLPAAGI 
FWSWQNAMGG 
RAAAIRQ* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1249>: 

m299.seq 

1 ATGAACCCCA AACACCTCAT CGCATTTTCC GCCCTATTCG CCGCCACGCA 

51 GGCAGAAGCC CTACCTGTCG CCTCCGTCAG CCTCGACACC GTTACCGTTT 

101 CCCCGTCCGC CCCCTACACC GATACAAACG GGCTGCTGAC CGACTACGSC 

151 AACGCCTCCG CCTCGCCTTG GATGAAAAAA CTCCAATCCG TCGCACAAGG 

201 CAGCGGCGAG ACCTTCCGTA TCCTGCAAAT CGGCGACTCG CATACCGCCG 

251 GCGACTTCTT TACCGACAGC CTGCGCAAAC GCCTGCAAAA AACTTGGGGC 

301 GACGGCGGCA TAGGCTGGGT TTACCCCGCC AACGTCAAAG GGCAGCGCAT 

351 GGCGGCCGTC CGGCACAACG GTAACTGGCA AAGCCTCACC AGCAGGAACA 

401 ACACCGGAGA CTTCCCGCTC GGCGGCATCC TCGCCCACAC CGGCAGCGGC 

451 GGCAGCATGA CCCTGACCGC ATCGGACGGC ATAGCAAGCA AGCAGCGCGT 

501 TTCCCTGTTT GCCAAACCCC TGCTTGCCGA ACAAACCCTG ACCGTCAACG 

551 GCAACACCGT CTCCGCCAAC GGCGGCGGCT GGCAGGTACT GGATACGGGC 

601 GCGGCACTGC CCCTGACCAT ACAGACCGAA ATGCCGTGGG ACATCGGCTT 

651 CATCAACATC GAAAATCCCG CCGGCGGCAT TACCGTTTCC GCGATGGGCA 

701 TCAACGGCG C ACAATTAACC CAGTGGTCGA AATGGCGTGC CGACCGTATG 

751 AACGACCTCG CCCAAACCGG CGCCGATTTG GTTATCCTTT CCTACGGCAC 

801 CAACGAAGCT TTCAACAACA ACATCGACAT TGCCGACACC GAACAAAAAT 

851 GGCTGGATAC CGTCCGCCAA ATCCGCGACA GCCTGCCTGC CGCCGGCATC 

901 CTCATCATCG GCGCACCCGA ATCCCTGAAA AACACGCTCG GCGTATGCGG 

951 CACACGCCCC GTCCGCCTGA CCGAAGTCCA ACAGATGCAG CGGCGCGTCG 

1001 CCCGTCAGGG GCAGACGATG TTCTGGTCTT GGCAAAACGC CATGGGCGGC 

1051 AT ATGCAG C A TGAAAAACTG GCTCAACCAA GGATGGGCCG CCAAAGACGG 

1101 CGTACACTTC TCCGCCAAAG GCTACCGGCG CGCGGCGGAA ATGCTCGCCG 

1151 ACAGCCTCGA AGAACTCGTC CGCTCCGCTG CAATCAGGCA ATAA 

This corresponds to the amino acid sequence <SEQ ID 1250; ORF 299>: 

m299.pep 



1 MNPKHLIAF5 ALFAATQAEA 



51 NASASPWMKK 

101 DGGIGWVYPA 

151 GSMTLTASDG 

201 AALPLTIHTE 

251 NDLAQTGADL 

301 LIIGAPESLK 

351 ICSMKNWLNQ 



LQSVAQGSGE 
NVKGQRMAAV 
IASKQRVSLF 
MPWDIGFINI 
VILSYGTNEA 
NTLGVCGTRP 
GWAAKDGVHF 



LPVASVSLDT 
TFRILQIGDS 
RHNGNWQSLT 
AKPLLAEQTL 
ENPAGGITVS 
FNNNIDIADT 
VRLTEVQQMQ 
SAKGYRRAAB 



VTVSPSAPYT 
HTAGDFFTDS 
SRNNTGDFPL 
TVNGNTVSAN 
AMGINGAQLT 
EQKWLDTVRQ 
RRVARQGQTM 
MLADSLEELV 



DTNGLLTDYG 
LRKRLQKTWG 
GGILAHTGSG 
GGGWQVLDTG 
QWSKWRADRM 
IRDSLPAAGI 
FWSWQNAMGG 
RSAAIRQ* 



m299/g299 95.5% identity in 397 aa overlap 



m299.pep 



10 20 30 40 50 60 

MNPKHLIAFSALFAATQAEALPVASVSLDTVTVSPSAPYTDTNGLLTDYGNASASPWMKK 

HIIMllllllllllllllllllll iltllllllltlMlllli!lil|:||||ii( 
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MNPKHFI AFSALFAATQAB ALPVASVS PDTVTVS PSAPYTDTNGLLTDYGNAAAS PWMKK 
10 20 30 40 50 60 



70 80 90 100 110 120 

m299 . pep LQSVAQGSGETFRI LQIGDSHTAGDFFTDSLRKRLQKTWGDGG IGWVYPANVKGQRMAAV 

hillllllhlillillllliMllllhlMlilllllllllllllllliilllllli 

g299 LRSVAQGSGEAFRILQIGDSHTAGDFFTDALRKRLQKTWGDGG IGWVYPANVKGQRMAAV 

70 BO 90 100 110 120 

130 140 150 160 170 180 

m299 . pep RHNGNWQSLTSRNNTGDFPLGGILAHTGSGGSMTLTASDGIASKQRVSLFAKPIXAEQTL 
iMIIihllimilllillilhllllhllilllll « • I I I I I I I I || I | | | | | | 
g2 9 9 RHSGNWQS FTSRNNTGDFPLGG I LAQTCSCGGMTLTASDGKTG KQRVSLFAKPLLAEQTL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 9 9 . pep TVNGNTVSANGGGWQVLDTGAALPLTIHTEMPWDIGFI NIENPAGGITVSAMG INGAQLT 

lllllllllllllllllllllillhhillllllMIMIIIIIIIIIIIIIIIMIII 

g2 9 9 TVNGNTVSANGGGWQVLDTGAALPLAIQTEMPWDIGFINI ENP AGGITVSAMG INGAQLT 

190 200 210 220 230 240 

250 260 270 280 290 300 

m2 9 9 . pep QWS KWRADRMNDLAQTGADLVI LS YGTKEAFNNNIDIADTEQKWLDTVRQI RDSLPAAG I 

llinillilllllllllilllililllllllllilllMIIIIKIIIIIiillillli 
g299 QWSKWRADRMNDI^C^GADLVILSYGTNEAFNNN 

250 260 270 280 290 300 

310 320 330 340 350 360 

m2 9 9 . pep L 1 1 GAPESLKNTLG VCGTRPVRLTEVQQMQRRVARQGQTMFWS WQNAMGGI CSMKNWLNQ 

llllllllllllllltlllfl llllllltlllltlllllllllliiilllMlllllli 
g2 9 9 LIIGAPESLKNTLGVCGTRPVLLTEVQQMQRRVARQGQTMFWSWONAMGGICSMKNWLNQ 
310 320 330 340 350 360 



370 380 390 

ra2 9 9 . pep GWAAKDGVHFSAKGYRRAAEMLADSLEBLVRSAAIRQX 

MM II Mill h! HUM IN MMMIM lllll 

g299 G WAAKDGVH PS AQG YRRAAEMLADSLEELVRAAA I RQX 

370 380 390 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1251>: 

a299. seq 

1 ATGAACCCCA AACACCTCAT CGCATTTTCC GCCCTATTCG CCGCCACGCA 

51 GGCAGAAGCC CTACCTGTCG CCTCAGTCAG CCTCGACACC GTTACCGTTT 

101 CCCCGTCCGC CCCCTACACC GATACAAACG GGCTGCTGAC CGACTACGGC 

151 AACGCCTCCG CCTCGCCTTG GATGAAAAAA CTCCAATCCG TCGCACAAGG 

201 CAGCGGCGAG ACCTTCCGTA TCCTGCAAAT CGGCGACTCG CATACCGCCG 

251 GCGACTTCTT TACCGACAGC CTGCGCAAAC GCCTACAAAA AACTTGGGGC 

301 GACGGCGGCA TAGGCTGGGT TTACCCCGCC AACGTCAAAG GGCAGCGCAT 

351 GGCGGCCGTC CGGCACAACG GTAACTGGCA AAGCCTCACC AGCAGGAACA 

4 01 ACACCGGAGA CTTCCCGCTC GGCGGCATCC TCGCCCACAC CGGCAGCGGC 

451 GGCAGCATGA CCCTGACCGC ATCGGACGGC ATAGCAAGCA AGCAGCGCGT 

501 TTCCCTGTTT GCCAAACCCC TGCTTGCCGA ACAAACCCTG ACCGTCAACG 

551 GCAACACCGT CTCCGCCAAC GGCGGCGGCT GGCAGGTACT GGATACGGGC 

601 GCGGCACTGC CCCTGACCAT ACACACCGAA ATGCCGTGGG ACATCGGCTT 

651 CATCAACATC GAAAATCCCG CCGGCGGCAT TACCGTTTCC GCGATGGGCA 

701 TCAACGGCGC ACAATTAACC CAGTGGTCGA AATGGCGTGC CGACCGTATG 

751 AACGACCTTG CCCAAACCGG CGCCGATCTA GTCATCCTTG CCTACGGTAC 

801 CAACGAAGCC TTCGGCGACA ACATCGACAT TGCCGATACC GAACAGAAAT 

851 GGCTGGATAC CGTCCGCCAA ATCCGCGACA GCCTACCrGC CGCCGGCATC 

901 CT CAT CAT CG GCGCGCCCGA ATCCCTGAAA AACACGCTCG GCGTATGCGG 

951 CACACGCCCC GTCCGCCTGA CCGAAGTCCA ACAGATG"AG CGGCGCATCG 

1001 CCCGTCAGGG GCAGACGATG TTCTGGTCTT GGCAAAACGC GATGGGCGGC 

1051 GTTTGCAGCA TGAAAAACTG GCTCAACCAC GGATGGGCCG CCAAAGACGG 

1101 CGTACACTTT TCCGCCAAAG GCTACCAACG GTCGGCGGAA ATGCTCGCCG 

1151 ACAGCCTCGA AGAACTCGTC CGCTCCGCTG CAATCAGGCA ATAA 
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This corresponds to the amino acid sequence <SEQ ID 1252; ORF 299.a>: 

a299.pep 

1 MNPKHLIAFS ALFAATQAEA LPVASVSLDT VTVSPSAPYT DTNGLLTDYG 

51 NASASPWMKK LQSVAQGSGE TFRILQIGDS HTAGDFFTDS LRKRLQKTWG 

101 DGGIGWVYPA NVKGQRMAAV RHNGNWQSLT SRNNTGDFPL GGILAHTGSG 

151 GSMTLTASDG IASKQRVSLF AKPLLAEQTL TVNGNTVSAN GGGWQVLDTG 

201 AALPLTIHTE MPWDIGFINI ENPAGGITVS AMGINGAQLT QWSKWRADRM 

251 NDLAQTGADL VILAYGTNEA FGDNIDIADT EQKWLDTVRQ IRDSLPAAGI 

301 LIIGAPESLK NTLGVCGTRP VRLTEVQQMQ RRIARQGQTM FWSWQNAMGG 

351 VCSMKNWLNH GWAAKDGVHF SAKGYQRSAE MLADSLEELV RSAAIRQ* 

m299/a299 98.0% identity in 397 aa overlap 

10 20 30 40 50 60 

m29 9 . pep MNPKHLIAFSALFAATQAEALPVASVSLDTVTVSPSAPYTDTNGLLTDYGNASASPWMKK 
I I I I I I M I M I I I I I I I I I I I I I I I I I t 1 II I i I I II I I I I I I I I I I I I I ! | | I | | | | | 
a299 MNPKHLIAFSALFAATQAEALPVASVSLDTVTVSPSAPYTDTNGLLTDYGNASASPWMKK 

10 20 30 40 50 . 60 

70 80 90 100 110 120 

m299 . pep LQSVAQGSGETFRILQIGDSHTAGDFFTDSLRKRLQKTWGDGGIGWVYPANVKGQRMAAV 
I I I I I I I I II I I I I II I II I I II I I I I I I I I I I I I I I I I I I I I I II I II I I I I I | M | | 
a299 LQS VAQG SGETFRILQIGDS HTAG DFFTDS LRKRLQKTWG DGG I G W V Y PAN VKGQRMAAV 

70 80 90 100 110 120 

130 140 150 160 . 170 180 

m299 . pep RHNGNWQSLTSRNNTGDFPLGGILAHTGSGGSMTLTASDGIASKQRVSLFAKPLLAEQTL 
I I M I I I I I I I I I I I 1 1 I I I M I II I I I I I I I 1 1 I 1 1 I I I I 1 1 I 1 1 I I I I I I I I 1 1 1 I I I 
a299 RHNGNWQSLTSRNNTGDFPLGGILAHTGSGGSMTLTASDGIASKQRVSLFAKPLLAEQTL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m299 . pep TVNGNTVSANGGGWQVLDTGAALPLTIHTEMPWDIGF3NIENPAGGITVSAMGINGAQLT 
I I I II I I I I I I I I I 1 I I I 1 I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 
a2 9 9 TVNGNTV SAN GGGWQVLDTGAALPLT I HTEMPWDIGFINIENPAGG IT VS AMGINGAQLT 

190 200 210 220 230 240 

250 260 270 280 290 300 

m2 99.pep QWSKWRADRMNDLAQTGADLVILSYGTNEAFNNNIDIADTEQKWLDTVRQIRDSLPAAGI 
I I I I I I I I I 1 I i I I I I I I I I I II : I I I I II i ;: I I I I I I I I I I I I I i I I I I I I I I M I I I 
a2 99 QWSKWRADRMNDLAQTGADLVILAYGTWEAFGDNIDIADTEQKWLDTVRQIRDSLPAAGI 

250 260 270 280 290 300 

310 320 330 340 350 360 

m2 9 9 . pep LI IGAPESLKNTLGVCGTRPVRLTEVQQMQRRVARQGQTMFWSWQNAMGGICSMKNWLNQ 

I I I! I I I I I I I I M I I I 11 I II I I I I I I I I I I: I I I I I I I M I I I I I I I I: t I I I I I I I : 
a2 99 LIIGAPESLKNTLGVCGTRPVRLTEVQQMQRRIARQGQTMFWSWQNAMGGVCSMKNWLNH 

310 320 330 340 350 360 

370 380 390 

m299 . pep GWAAKDGVH FS AKGYRRAAEMLADS LEELVRS AAI RQX 

MM IIMMI!lll:|:Mlllllimillll III I 
a2 99 GWAAKDGVH FSAKGYQRSAEMLADS LEELVRS AAI RQX 

370 380 390 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 1253>: 

g302.seq 9 

1 ATGCACTCAA TATATTTTTT TAAGGAGAAG CAGATGAGTC AAACCGACGC 

51 GCGTCGTAGC GGACGATTTT TACGCACAGT CGAATGGCTG GGCAATATGT 

101 TGCCGCACCC GGTTACGCTT TTTATTATTT TCATTGTGTT ATTGCTGATT 

151 GCCTCTGCCG TCGGTGCGTA TTTCGGACTA TCCGTCCCCG ATCCGCGTCC 

201 TGTTGGGGCG AAAGGACGTG CC3ATGACGG TTTGATTCAC GTTGTCAGCC 

251 TGCTCGATGC CGACGGTTTG ATCAAAATCC TGACGCATAC CGTTAAAAAT 

301 TTCACCGGTT TCGCGCCGTT GG3AACGGTG TTGGTTTCTT TATTGGGCGT 
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351 GGGGATTGCG GAAAAATCGG GCTTGATTTC CGCATTAATG CGCTTATTGC 

4 CI TCACAAAATC CCCACGCAAA CTCACTACTT TTATGGTTGT TTTTACAGGG 

4 51 ATTTTAT CCA ATACGGCTTC TGAATTGGGC TATGTCGTCC TAATCCCTTT 

501 GTCCGCCGTC ATCTTTCATT CGCTCGGCCG CCATCCGCTT GCCGGTTTGG 

551 CTGCGGCTTT CGCCGGCGTT TCGGGCGGTT ATTCGGCCAA TCTGTTCTTA 

601 GGCACAATCG ATCCGCTCTT GGCAGGCATC ACCCAACAGG C3GCGCAAAT 

651 CATCCATCCC GACTACGTCG TAGGCCCTGA AGCCAACTGG TTTTTTATGG 

701 CAGCCAGTAC GTTTGTGATT GCTTTGATTG GTTATTTTGT TACTGAAAAA 

751 ATCGTCGAAC CGCAATTGGG CCCTTATCAA TCAGATTTGT CACAAGAAGA 

801 AAAAGACATT CGGCATTCCA ATGAAATCAC GCCTTTGGAA TATAAAGGAT 

851 TAATTTGGGC AGGCGTGGTG TTTGTTGCCT TATCCGCCCT ATTGGCTTGG 

901 AGCATCGTCC CTGCCGACGG TATTTTGCGT CATCCTGAAA CAGGATTGGT 

951 TGCCGGTTCG CCGTTTTTAA AATCGATTGT TGTTTTTATT TTCTTGTTGT 

1001 TTGCGCTGCC GGGCATTGTT TATGGCCGGA TAACCCGAAG TTTGCGCGGC 

1051 GAACGGGAAG TCGTTAATGC GATGGCCGAA TCGATGAGTA CTTTGGGACT 

1101 TTATTTGGTC ATCATCTTTT TTGCCGCACA GTTTGTCGCA TTTTTTAATT 

1151 GGACGAATAT TGGGCAATAT ATTGCCGTTA AAGGGGCGGT GTTCTTAAAA 

1201 GAAGTCGGCT TGGGCGGCAG TGTGTTGTTT ATCGGTTTTA TTTTAATTTG 

1251 TGCTTTTATC AATCTGATGA TAGGCTCCGC CTCCGCGCAA TGGGCGGTAA 

1301 CTGCGCCGAT TTTCGTCCCT ATGCTGATGT TGGCCGGCTA CGCGCCCCAA 

1351 GTCATTCAAG CCGCTTACCG CAT CGGTGAT TCCGTTACCA ATATTATTAC 

1401 GCCGATGATG AGTTATTTCG GGCTGATTAT GGCGACGGTA ATCAAATACA 

1451 AAAAAGATGC GGGCGTAGGC ACGCTGATTT CTATGATGTT GCCGTATTCC 

1501 GCTTTCTTCT TAATTGCATG GATCGCCTTA TTCTGCATTT GGGTATTTGT 

1551 TTTGGGTCTG CCCGTCGGTC CCGGCACACC CACATTCTAT CCGGTGCCTT 

1601 AA 

This corresponds to the amino acid sequence <SEQ ID 1254; ORF 302.ng>: 

g302 -pep 

1 MHSIYFFKEK QMSQTDARRS GRFLRTVEWL GNMLPHP VTL FIIFIVLLLI 

51 ASAV GAYFGL SVPDPRPVGA KGRADDG LIH WSLLDADGL IKIL THTVKN 

101 FT GFAPLGTV LVSLLGVGIA EKSGLISALM RLLLTKSPRK LTTFMWFTG 

151 ILSNTAS ELG YWLIPLSAV I FHSLG RKPL AGLAAAFAGV SGGYSANLFL 

201 GTIDPLLAGI TQQAAQI IHP DYWGPEANW FFMAASTFVI ALIGYFV TEK 

251 IVEPQLGPYQ SDLSQEEKDI RHSNEITPLE YKGLI WAGW FVALSALLAW 

3 01 SIVPADGILR HPETGLVAGS PFLKS IWFI FLLFALPGIV YGR ITR5LRG 

351 EREWNAMAE SMST LGLYLV I I FFAAQFVA F FNWTNIGQY IAVKGAVFLK 

401 EVGLGGSVLF IGFILICAFI NLMI GSASAQ WAVTAPIFVP MLMLAGYA PQ 

451 VIQAAYRIGD 5VTN IITPMM SYFGLIMATV I KYKKDAGVG TLISMMLPYS 

501 AFFLIAWIAL FCIWVFVL GL PVGPGTPTFY PVP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1255>: 

m302 . seq 

1 ATGCACTCAA TATATTTTTT TAAGGAGAAG CAGATGAGTC AAACCGATAC 

51 GCAACGGGAC GGACGATTTT TACGCACAGT CGAATGGCTG GGCAATATGT 

101 TGCCGCATCC GGTTACGCTT TTTATTATTT TCATTGTGTT ATTGCTGATT 

151 GCCTCTGCCG TCGGTGCGTA TTTCGGACTA TCCGTCCCCG ATCCGCGCCC 

201 TGTTGGTGCG AAAGGACGTG CCGATGACGG TTTGATTTAC ATTGTCAGCC 

251 TGCTCAATGC CGACGGTTTT ATCAAAATCC TGACG CATAC CGTTAAAAAT 

301 TTCACCGGTT TCGCGCCGTT GGGAACGGTG TTGGTTTCTT TATTGGGCGT 

351 GGGGATTGCG GAAAAATCGG GCTTGATTTC CGCATTAATG CGCTTATTGC 

401 TCACAAAATC GCCACGCAAA CTCACTACTT TTATGGTTGT TTTTACAGGG 

451 ATTTTATCTA ATACCGCTTC TGAATTGGGC TATGTCGTCC TAATCCCTTT 

501 GTCCGCCATC ATCTTTCATT CCCTCGGCCG CCATCCGCTT GCCGGTCTGG 

551 CTGCGGCTTT CGCCGGCGTT TCGGGCGGTT ATTCGGCCAA TCTGTTCTTA 

601 AGCACAATCG ATCCGCTCTT GGCATGCATC ACCCATCAGG CGGCGGTCGT 

651 AGGCCCTGAA GCCAACTGGT TTTTTATGGT AG CCAGTACG TTTGTGATTG 

701 CTTTGATTGG TTATTTTGTT ACTGAAAAAA TCGTCGAACC GCAATTGGGC 

751 CCTTATCAAT CAGATTTGTC ACAAGAAGAA AAAGACATTC GGCATTCCAA 

801 TGAAATCACG CCTTTGGAAT ATAAAGGATT AATTTGGGCT GGCGTGGTGT 

851 TTGTTGCCTT ATCCGCCCTA TTGGCTTGGA GCATCGTCCC TGCCGACGGT 

901 ATTTTGCGTC AT CCTGAAAC AGGATTGGTT TCCGGTTCGC CGTTTTTAAA 

951 ATCGATTGTT GTTTTTATTT TCTTGTTGTT TGCACTGyCG GGCmTTGTTT 

1001 ATGGmCGGGT AAC CCGAAGT TTG CGCGGCG AACAGGAAGT CGTTAATGCG 
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1051 ATGGCCGAAT CGATGAGTAC TCTGGsGCTT TmTTTGswCA kcATCTTTTT 

1101 TGCCGCACAG TTTGTCGCAT TTTTTAATTG GACGAATATT GGGCAATATA 

1151 TTGCCGTTAA AGGGGCGACG TTCTTAAAAG AAGTCGGCTT GGGCGGCAGC 

1201 GTGTTGTTTA TCGGTTTTAT TTTAATTTGT GCTTTTATCA ATCTGATGAT 

1251 AGGCTCCGCC TCCGCGCAAT GGGCGGTAAC TGCGCCGATT TTCGTCCCTA 

1301 TGCTGATGTT GGCCGGCTAC GCGCCCGAAG TCATTCAAGC CGCTTACCGC 

1351 ATCGGTGATT CCGTTACCAA TATTATTACG CCGATGATGA GTTATTTCGG 

1401 GCTGATTATG GCGACGGTGA TCAAATACAA AAAAGATGCG GGCGTGGGTA 

14 51 CGCTGATTTC TATGATGTTG CCGTATTCCG CTTTCTTCTT GATTGCGTGG 

1501 ATTGCCTTAT TCTGCATTTG GGTATTTGTT TTGGGCCTGC CCX3TCGGTCC 

1551 CGGCGCGCCC ACATTCTATC CCGCACCTTA A 

This corresponds to the amino acid sequence <SEQ ED 1256; ORF 302>: 

m3 02 -pep 

1 MHSIYFFKEK QMSQTDTQRD GRFLRTVEWL GNMLPHP VTL FIIFIVLLLI 

51 ASAV GAYFGL SVPDPRPVGA KGRADD GLIY IVSLLNADGF IKIL THTVKN 

101 FT GFAPLGTV LVSLLGVGIA EKSGLISALM RLLLTKSPRK LTTFMWFTG 

151 ILSNTASE LG YWLIPLSAI IFHSL GRHPL AGLAAAFAGV SGGYSANLFL 

201 STIDPLLACI THQAAWGPE ANW FFMVAST FVIALIGYFV TEKIVEPQLG 

251 PYQSDLSQEE KDIRHSNEIT PLEYKGLIW A GWFVALSAL LAWSIV PADG 

301 ILRHPETGLV SG5PFLKS IV VFI FLLFALX GXVYG RVTRS LRGEQEWNA 

351 MAESMS TLXL XLXXIFFAAQ FVAF FNWTNI GQYIAVKGAT FLKEVGLGGS 

4 01 VLFIGFILIC AFINLMI GSA SAQ WAVTAPI FVPMLMLAGY A PEVIQAAYR 

451 IGDSVTN IIT PMMSYFGLIM ATVI KYKKDA GVGTLISMML PYS AFFLIAW 

501 IALFCIWVFV L GLPVGPGAP TFYPAP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 302 shows 94.0% identity over a 533 aa overlap with a predicted ORF (ORF 302.ng) 
from N. gonorrhoeae: 

m302/g302 



10 20 30 40 50 60 

m3 0 2 . pep MHS I Y F FKE KQMSQTDTQRJDGRFLRTVEWLGNML PHPVTL F 1 1 FIVLLLI ASAVGAYFGL 

I I I II I I I I I I I I I I h « I « I I I I I II I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I 
g3 0 2 MHSI YFFKEKQMSQTDARRSGRFLRTVEWLGNMLPHPVTLFI I FIVLLLI ASAVGAYFGL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m3 0 2 . pep SVPDPRPVGAKGRADDGLI YI VSLLNADGFI KI LTHTVKN FTG FAPLGTVLVS LLGVGI A 

III III III III II 1 1 ll-l Ml: IIHIIM M II I II Ml Ml llllllllllll 

g3 0 2 SVPD PRPVGAKGRADDGL IHWSLLDADGLI KI LTHTVKN FTGFAPLGTVLVSLLGVGIA 

70 80 90 100 110 120 



130 140 150 160 170 180 

m3 0 2 . pep EKSGLISALMRLLLTKSPRKLTTFMWFTGI LSNTAS ELGYWL I PLSAI IFHSLGRHPL 

M I M M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 : 1 1 1 M 1 1 1 1 1 

g3 0 2 EKSGLISALMRLLLTKSPRKLTTFMWFTGI LSNTASELGYWLIPLSAVI FHSLGRHPL 

130 140 150 160 170 180 



190 200 210 220 230 

m302 .pep AGLAAAFAGVSGGYSANLFLSTIDPLLACITHQAA WGPEANWFFMVASTFVI 

M I MUM III MM MM M I III II I hi 1 1 1 1 1 1 1 1 1 1 1 M - 1 1 1 1 1 1 

g302 AGLAAAFAGVSGGYSANLFLGTIDPLLAGITQQAAQIIHPDYWGPEANWFFMAASTFVI 
190 200 210 220 230 240 



240 250 260 270 280 290 

m3 0 2 . pep ALIGYFVTEKI VEPQLGPYQSDLSOEEKDI RHSNEI TPLEYKGLI WAGWFVALSALLAW 

MUM III I IMM I IM II I III II MMII Mill MINIMI I Ml Ml III 1 1 

g302 ALIGYFVTEKIVEPQLGPYQSDLSQEEKDIRHSNEITPLEYKGLI WAGWFVALSALLAW 

250 260 270 280 290 300 



300 310 320 330 340 35C 
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ra3 02 . pep SIVPADGILRHPETGLVSGSPFLKSIWFIFLLFAIOCGXVYGRVTRSLRGEQEVVNAMAE 

I I I I I I I I I I I I I I I I I : I I . I II I I I I I I I II I I I I I I I h I I I I I I I : I I I I I I | I 
g302 SIVPADGILRHPETGLVAGSPFLKSIWFIFLLFALPGIVYGRITRSLRGEREWNAMAE 
310 320 330 340 350 360 

360 370 380 390 400 410 

m302.pep SMSTLXLXLXXIFFAAQFVAFFNWTNIGQYIAVKGATFLKEVGIiGGSVLFIGFILICAFI 

1 1 1 1 I I I i Mill ill I IIIIIMI IIHMhlllll II lllllll II III MM 

g302 SMSTLGLYLVIIFFAAOFVAFFNWTNIGQYIAVKGAVFLKEVGLGGSVLFIGFILICAFI 
370 380 390 400 410 420 



420 430 440 450 460 470 

m302.pep NLMIGSASAQWAVTAPI FVPMLMLAGYAPEVIQAAYRIGDSVTNI ITPMMS YFGLIMATV 

1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 II I 1 1 1 II I I 

g3 0 2 NLMI GS AS AQWAVTAP I FVPMLMLAGYAPQVI QAAYRIGDSVTNI ITPMMS YFGLIMATV 

430- 440 450 460 470 480 



480 490 500 510 520 

m3 02 . pep I KYKKDAGVGTL I SMML PYSAFFLI AW I ALFCI WVFVLGLPVGPGAPTFYPAPX 

i 1 1 1 II MM II I H IIIIMM MIIIMIH lllllll II llhll i Ihl I 

g302 I KYKKDAGVGTL I SMML PYSAFFLI AW I ALFC I WVFVLGLPVGPGTPTFYPVPX 

490 500 510 520 530 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1257>: 

a302 . seq 

1 ATGCACTCAA TATATTTTTT TAAGGAGAAG CAGATGAGTC AAACCGATAC 

51 GCAACGGGAC GGACGATTTT TACGCACAGT CGAATGGCTG GGCAATATGT 

101 TGCCGCACCC GGTTACGCTT TTTATTATTT TCATTGTGTT ATTGCTGATT 

151 GCCTCTGCCG CCGGTGCGTA TTTCGGACTA TCCGTCCCCG ATCCGCGCCC 

2 CI TGTTGGTGCG AAAGGACGTG CCGATGACGG TTTGATTCAC GTTGTCAGCC 

2 51 TGCTCGATGC TGACGGTTTG ATCAAAATCC TGACGCATAC CGTTAAAAAT 

301 TTCACCGGTT TCGCGCCGTT GGGAACGGTG TTGGTTTCTT TATTGGGCGT 

351 GGGGATTGCG GAAAAATCGG GCTTGATTTC CGCATTAATG CGCTTATTGC 

401 TCACAAAATC TCCACGCAAA CTCACTACTT TTATGGTTGT TTTTACAGGG 

4 51 ATTTTATCTA ATACCGCTTC TGAATTGGGC TATGTCGTCC TAATCCCTTT 

501 GTCCGCCATC ATCTTTCATT CCCTCGGCCG CCATCCGCTT GCCGGTCTGG 

551 CTGCGGCTTT CGCCGGCGTT TCGGGCGGTT ATTCGGCCAA TCTGTTCTTA 

601 GGCACAATCG ATCCGCTCTT GGCAGGCATC ACCCAACAGG CGGCGCAAAT 

651 CATCCATCCC GACTACGTCG TAGGCCCTGA AGCCAACTGG TTTTTTATGG 

7 Olo TAGCCAGTAC GTTTGTGATT GCTTTGATTG GTTATTTTGT TACTGAAAAA 

751 ATCGTCGAAC CGCAATTGGG CCCTTATCAA TCAGATTTGT CACAAGAAGA 

801 AAAAGACATT CGACATTCCA ATGAAATCAC GCCTTTGGAA TATAAAGGAT 

851 TAATTTGGGC TGGCGTGGTG TTTGTTGCCT TATCCGCCCT ATTGGCTTGG 

901 AGCATCGTCC CTGCCGACGG TATTTTGCGT CATCCTGAAA CAGGATTGGT 

951 TTCCGGTTCG CCGTTTTTAA AATCAATTGT TGTTTTTATT TTCTTGTTGT 

1001 TTGCACTGCC GGGCATTGTT TATGGCCGGG TAACCCGAAG TTTGCGCGGC 

1051 GAACAGGAAG TCGTTAATGC GATGGCCGAA TCGATGAGTA CTCTGGGGCT 

1101 TTATTTGGTC ATCATCTTTT TTGCCGCACA GTTTGTCGCA TTTTTTAATT 

1151 GGACGAATAT TGGGCAATAT ATTGCCGTTA AAGGGGCGAC GTTCTTAAAA 

1201 GAAGTCGGCT TGGGCGGCAG CGTGTTGTTT ATCGGTTTTA TTTTAATTTG 

1251 TGCTTTTATC AATCTGATGA TAGGCTCCGC CTCCGCGCAA TGGGCGGTAA 

1301 CTGCGCCGAT TTTCGTCCCT ATGCTGATGT TGGCCGGCTA CGCGCCCGAA 

1351 GTCATTCAAG CCGCTTACCG CATCGGTGAT TCCGTTACCA AT AT TAT T AC 

1401 GCCGATGATG AGTTATTTCG GGCTGATTAT GGCGACGGTG ATCAAATACA 

14 51 AAAAAGATGC GGGCGTGGGT ACGCTGATTT CTATGATGTT GCCGTATTCC 

1501 GCTTTCTTCT TGATTGCGTG GATTGCCTTA TTCTGCATTT GGGTATTTGT 

1551 TTTGGGCCTG CCCGTCGGTC CCGGCGCGCC CACATTCTAT CCCGCACCTT 

1601 AA 

This corresponds to the amino acid sequence <SEQ ID 1258; ORF 302.a>: 

a302.pep 

1 MHSIYFFKEK OMSQTDTQRD GRFLRTVEWL GNMLPHP VTL FIIFIVLLLI 

51 ASAA GAYFGL SVPDPRPVGA KGRADDG LIH VVSLLDADGL IKIL THTVKN 

101 FTG FAPLGTV LVSLLGVGIA EKSGLISALM RLLLTKSPRK LTTFMWFTG 



WO 99/57280 



PCT/US99/09346 



698 



151 ILSNTASE LG YWLIPLSAI IFHSL GRMPL AGLAAAFAGV SGGYSANLFL 

201 GTIDPLLAGI TQQAAQIIHP DYWGPEANW FFMVASTFVI ALIGYFV TEK 

2 51 IVEPQLGPYQ SDLSQEEKDI RHSNEITPLE YKGLIW AGVV FVALSALLAW ' 

301 SIVPADGILR HPETGLVSGS PFLKS IWFI FLLFALPGIV YG RVTRSLRG 

351 EQEWNAMAE SMST LGLYLV 1 1 FFAAQFVA F FNWTNIGQY IAVKGATFI.K 

4 01 EVGLGGS VLF IGFILICAFI KLMI GSASAQ W AVTAPIFVP MLMLAGYA PE 

4 51 VIQAAYRIGD SVTN IITPMM SYFGLIMATV I KYKKDAGVG TLISMMLPYS 

501 A FFLIAWIAL FCIWVFVL GL PVGPGAPTFY PAP* 

m302/a302 96. 1 % identity in 533 aa overlap 

10 20 30 40 50 60 

m3 02 . pep MHS IYFFKEKQMSQTDTQRDGRFLRTVEWLGbfMLPHPVTLFI IFI VLLLIASAVGAYFGL 

I I I I I I I I 1 1 ! I I I I i I I i I I I I I I I I I I I I I I I I ! I I I I I I I I I I I II I I I I : I I I I 1. 1 
a302 MHSIYFFKEKQMSQTDTQRDGRFLRTVEWLGNMLPHPVTLFIIFIVLLLIASAAGAYFGL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m302.pep SVPDPRPVGAKGRADDGLIYIVSLLNADGFIKILTHTVKNFTGFAPLGTVLVSLLGVGIA 
I I I I I I M I I I I I I II I II:: I It I: I I I: I I I I I I I I II I II I I li I M I I I I I I I I I I 
a302 SVPDPRPVGAKGRADDGLIHWSLLDADGLIKILTHTVKNFTGFAPLGTVLVSLLGVGIA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m302.pep EKSGLISALMRLLLTKSPRKLTTFMWFTGILSNTASELGYVVLIPLSAIIFHSLGRHPL 
I I I I I I I I I I I I I I li I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I 
a302 EKSGLISALMRLLLTKSPRKLTTFMWFTGILSNTASELGYWLIPLSAIIFHSLGRHPL 

130 140 150 160 170 180 

190 200 210 220 230 

m302 .pep AGLAAAFAGVSGGYSANLFLSTIDPLLACITHQAA WGPE ANWFFMVASTFV I 

I I I I I I I I I I I I I M I I II I : I I I I i I I 11:111 I I I I I I I I I I I | | I | | | I 

a302 AG L AAA FAGV S GG Y S AN L FLGT I D PLL AG I TQQAAQ 1 1 H P DYWGPEANW F FMVA ST FV I 

190 200 210 220 230 240 

240 250 260 270 280 290 

m302.pep ALIGYFVTEKIVEPQLGPYQSDLSQEEKDIRHSNEITPLEYKGLIWAGWFyALSALLAW 

I II I 1 1 I I 1 1 I II I 1 1 I 1 1 I I 1 1 I 1 1 I I I I 1 1 I 1 1 I I I I I I I I I I I I 1 1 I M I I I I 1 1 I I 
a302 ALIGYFVTEKIVEPQLGPYQSDLSQEEKDIRHSNEITPLEYKGLIWAGWFVALSALLAW 
250 260 270 280 290 300 

300 310 320 330 340 350 

ra302.pep SIVPADGILRHPETGLVSGSPFLKSIWFIFLLFALXGXVYGRVTRS LRGEQEWNAMAE 

I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a302 SIVPADGI LRHPETG1VSGSPFLKSIWFI FLLFALPGIVYGRVTRS LRGEQEWNAMAE 

310 320 330 340 350 360 

360 370 380 390 400 410 

m302 . pep SMSTLXLXLXXIFFAAQFVAFFNWTNIGQYIAVKGATFLKEVGLGGSVLFIGFILICAFI 
Mill I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I 11 I I 
a302 SMSTLGLYLVIIFFAAQFVAFFNWTNIGQYIAVKGATFLKEVGLGGSVLFIGFILICAFI 
370 380 390 400 410 420 

420 430 440 450 460 470 

m302 .pep NLMIGSASAQWAVTAPIFVPMLMLAGYAPEVIQAAYRIGDSVTNIITPMMSYFGLIMATV 
I I I I ! I I I I I 1 1 t I I I 1 1 I I 1 1 1 1 I I I 1 1 I 1 1 I II I I I I I I I t 1 11 I I I 1 1 I i I M I I I I 
a302 NLMIGSASAQWAVTAPIFVPMLMLAGYAPEVIQAAYRIGDSVTNIITPMMSYFGLIMATV 
430 440 450 460 470 480 

480 490 500 510 52C 

m302 . pep IKYKKDAGVGTLISMMLPYSAFFLIAWIALFCIWVFV1.GLPVGPGAPTFYPAPX 

1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 

a302 IKYKKDAGVGTLISMMLPYSAFFLIAWIALFCIWVFVLGLPVGPGAPTFYPAPX 
490 500 510 520 530 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1259>: 
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g305 . seq 

1 ATGGATTTTT TGATTGTCCT GAAAGCCCTG ATGATGGGCT TGGTAGAAGG 

51 TTTTACCGAA TTTTTACCGA TTTCCAGCAC CGGACATTTG . ATTGTGTTCG 

101 GCAATCTGAT TGGTTTTCAC AGCAATCACA AGGTTTTTGA AATTGCCATC 

151 CAGCTCGGTG CGGTTTTGGC GGTAGTGTTT GAATACCGGC AGCGTTTCAG 

201 CAATGTGTTG CATGGCGTGG GAAAAGACCG GAAAGCCAAC CGTTTCGTCC 

251 TCAATCTTGC CATTGCTTTT ATACCTGCCG CCGTGATGGG GCTGTTGTTC 

301 GACAAACAAA TCAAAGAGTA TCTGTTTAAC CCCTTGAGTG TTGCAGTCAT 

351 GCTGGTTTTG GGCGGTTTTT TTATTTTGTG GGTGGAGAAA CGCCAAAGCC 

401 GAGCAGAGCC TAAAATTGCC GATGTTGATG CATTGCGTCC GATTGATGCG 

451 TTGATGATCG GTGTTGCCCA AGTGTTTGCA CTGGTTCCGG GTACGTCCCG 

501 TTCGGGCAGT ACGGTTATGG GCGGGATGCT TTGGGGAATC GAGCGGAAAA 

551 CGGCAACGGA GTTTTCATTT TTCTTGGCCG TTCCGATGAT GGTTGCAGCA 

601 ACGGCTTATG ATGTCCTGAA ACATTACCGA TTTTTCACCC TGCATGATGT 

651 CGGTTTGATT TTGATAGGCT TTATTGCCGC TTTTGTTTCC GGTTTGGTAG 

701 CGGTTAAAGC ACTGCTGAAG TTTGTTTCCA AGAAAAACTA TATCCCGTTT 

751 GCCTATTACC GCATTGTTTT CGGCATTGTC ATCATAATAT TGTGGTTGTC 

801 GGG CTGG ATA AGTTGGGAAT GA 

This corresponds to the amino acid sequence <SEQ ID 1260; ORF 305 .ng>: 

g305.pep 

1 MD FLIVLKAL MMGLVEGFT E FLPISSTGHL IVFGNLIGFH SNH KVFEIAI 

51 QLGAVLAWF E YRQRFSNVL HGVGKDRKAN R FVLNLAIAF I PAAVMGLL F 

101 DKQIKEYLFN P LSVAVMLVL GGFFILWV EK RQSRAEPKIA DVDALRPIDA 

151 LMIGVAQVFA LVPG TSRSGS TVMGGMLWGI ERKTATEFS F FLAVPMMVAA 

201 TAYDVLKHYR FFTLHDVG LI LIGFIAAFVS GLVAV KALLK FVSKKNYIPF 

251 AYYR IVFGIV IIILWLSGWI SW E* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 126 1>: 

m305.seq (partial) 

1 AtGGATTTTC TGATTGTCCT GAAAGCCCTG ATGATGGGCT TGGTAGAAGG 

51 TTTTACCGAA TTTTTACCGA TTTCCAGCAC CGGACATTTG ATTGTGTTCG 

101 GCAATCTGAT TGGTTTTCAC AGCAATCACA AGGTTTTTGA AATTGCCATC 

151 CAGCTCGGTG CAGTTTTGGC GGTAGTGTTT GAATACCGGC AACGTTTCAG 

201 CAATGTGTTG CACGGCTTGG GAAAAGACCG GAAAGCCAAC CGCTTCGTCC 

251 TTAATCTTGC CATTGCTTTT ATACCTGCCG CCGTGATGGG GCTGTTGTTC 

301 GGCAwACAAA TCAAAGAGyA TCTGTTTAAC CCCTTGAGTG TTGCAGTCAT 

351 GCTGGTTyTG GrCGGTTTTT yTATTTTGTG GGTGGAGAAA CGCCAAAGCC 

401 GAGCAGAGCC TAAAATTGCC GATGTTGATG CATTGCGTCC GATTGATGCC 

451 TTGATGATCG GCGTTGCCCA AGTGTTTGCA CTGGTTCCGG GTACGTCCCG 

501 TTCGGGCAGT ACGATTATGG GCGGGATGCT TTGGGGCATC GAACGGAAAA 

551 CTGCGACAGA ATTCTCGTTT TTCTTGGCTG TGCCGATGAT GGTTGCCGCA 

601 ACGGCTTATG ATGTCCTGAA ACATTACCGA TTTTTCACCC TGCATGATGT 

651 CGGTTTGATT CTGATAGGCT TTATTGCTGC CTTTGTTTCA GGCTTGGTAG 

701 CGGTAAAAGC GTTGCTGAGG TTTGTTTCGG GTAC. . . 

This corresponds to the amino acid sequence <SEQ ED 1262; ORF 305>: 

m305.pep (partial) 

1 MD FLIVLKAL MMGLVEGFT E FLPISSTGHL IVFGNLIGFH SNH KVFEIAI 

51 QLGAVLAWF E YRQRFSNVL HGLGKDRKAN R FVLNLAIAF I PAAVMGLL F 

101 GXQIKEXLFN P LSVAVMLVL XGFXILWV EK RQSRAEPKIA DVDALRP IDA 

151 LMIGVAQVFA LVPG TSRSGS TIMGGMLWGI ERKTATEFS F FLAVPMMVAA 

201 TAYDVL KHYR FFTLHDVG LI LIGFIAAFVS GLVAV KALLR FVSG. . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 305 shows 96.7% identity over a 243 aa overlap with a predicted ORF (ORF 305 .ng) 
from N. gonorrhoeae: 

g305/m305 

10 20 30 40 50 60 

g305.pep MD FL I VLKALMMGLVEG FTEFLP I S S TGHL I VFGNL I GFHS NHKVFE I A I QLGAVLAWF 

IM fill Ml Mil Mill II MM II MINIMI I IN lllllll Mill Ml MM I 

m305 MDFLIVLKAIJ^IMGLVEGFTEFLPISSTGHLIVFGNLIGFHSNHKVFEIAIQLGAVLAVVF 
10 20 30 40 50 60 
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70 80 90 100 110 120 

g305 . pep EYRQRFSWLHGVGKDRKANRWI^LAIAFIPAAVMGLLFDKQIKEYLFNPLSVAVMLVL 

1 1 MM III 1 1 Mil Ml I II MIMMIIIMM IIMI MM I III II I MUM 

m3 05 EYRQRFSNVLHGLGKDRKANRFVLNLAIAFI PAAVMGLLFGXQI KEXLFNPLSVAVMLVL 

70 80 90 100 110 120 



130 140 150 160 170 180 

g305 - pep • GGF F I LWVE KRQS RAE P KI ADVDALRP I DALM I GVAQVFALVPGTS RS G S TVMGGML WG I 

II III Mill III MUM III MM MM III III Ml II III II I Ihll MM II 

m305 XGFXILWVEKRQSRAEPKIADVDALRPIDALMIGVAQVFALVPGTSRSGSTIMGGMLWGI 
130 140 150 160 170 180 



190 200 210 220 230 240 

g3 05 . pep ERKTATE FS F FLiAVPMMVAATA YDVLKH YRF FTLHDVGL I LI GFI AAFVSGLVAV KALLK 

MINIM Ml MM IN MINI II II IN III MUM MINI M 1 1! Ml MMM 

m305 ERKTATE FS FFLAVPMMVAATAYDVLKH YRFFTLHDVGLILI GFI AAFVSGLVAVKALLR 

190 200 210 220 230 240 



250 260 270 

g305 .pep FVSKKNYIPFAYYRIVFGIVIIILWLSGWISWEX 
III 

m3 05 FVSG 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1263>: 

a305.seq 

1 ATGGATTTTC TGATTGTCCT GAAAGCCCTG ATGATGGGCT TGGTAGAAGG 

51 TTTTACCGAA TTTTTACCGA TTTCCAGCAC CGGACATTTG ATTGTGTTCG 

101 GCAATCTGAT TGATTTTCAC AGCAATCACA AGGTTTTTGA AATTACCATC 

151 CAGCTCGGTG CGGTTTTGGC GGTAGTGTTT GAATACCGGC AGCGTTTCAG 

201 CAATGTGTTG CATGGCGTGG GAAAAGACCG GAAAGCCAAC CGTTTCGTCC 

251 TTAATCTTGC CATTGCTTTT ATACCTGCCG CCGTGATGGG GCTGTTGTTC 

301 GGCAAACAAA TCAAAGAGTA TCTGTTTAAC CCCTTGAGTG TTGCAGTCAT 

351 GCTGGTTTTG GGCGGTTTTT TTATTTTGTG GGTGGAGAAA CGCCAAAGCC 

401 GAGCAGAGCC TAAAATTGTC GATGTTGATG CATTGCGTCC GATTGATGCG 

451 TTGATGATCG GCGTTGCCCA AGTGTTTGCA CTGGTTCCAG GTACGTCCCG 

501 TTCGGGCAGT ACGATTATGG GCGGGATGCT TTGGGGAATC GAGCGGAAAA 

551 CGGCAACGGA GTTTTCATTT TTCTTGGCCG TTCCGATGAT GGTTGCAGCA 

601 ACGGCTTATG ATGTCCTGAA GCATTACCGG TTTTTCACCC TGCATGATGT 

651 CGGTTTGATT TTGATTGGCT TTGTTGCTGC CTTTGTTTCA GGCTTGGTGG 

701 CGGTCAAAGC GTTGCTGAGG TTTGTTTCCA AGAAAAATTA TATTCCTTTT 

751 GCCTATTACC GCATTGTTTT TGGTATTGCC AT C ATT AT AT TGTGGCTGTC 

801 AGGCTGGATA AGTTGGGAAT GA 



This corresponds to the amino acid sequence <SEQ ID 1264; ORF 305,a>: 

a305.pep 

1 MD FLIVLKAL MMGLVEGFT E FLPISSTGHL IVFGNLIDFH SNH KVFEITI 

51 QLGAVLAWF E YRQRFSNVL HGVGKDRKAN R FVLNLAIAF IPAAVMGLL F 

101 GKQIKEYLFN P LSVAVMLVL GGFFILWV EK RQSRAEPKIV DVDALRPIDA 

151 LMIGVAQVFA LVPG TSRSGS TIMGGMLWGI ERKTATE FS F FLAVPMMVAA 

201 TAYDVL KHYR FFTLHDVG LI LIGFVAAFVS GLVAV KALLR FVSKKNYIPF 

251 AYYRIVFGIA IIILWLSGWI SWE* 



m305/a305 96.3% identity in 243 aa overlap 

10 20 30 40 50 60 

m305 . pep MDFLIVLKALMMGLVEGFTEFLPISSTGHLIVFGNLIGFHSNHKVFEIAIQLGAVLAWF 

I I I I I I I M I I I I I I I I I I I I ! I I I I I I I I I I ! ! M I I I I I I I I I I t : I I I I I I I I I I i 
a305 MDFLIVLKALMMGLVEGFTEFLPISSTGHLIVFGNLIDFHSNHKVFEITIQLGAVLAWF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m305 . pep EYRQRFSNVLHGLGKDRKANRFVLNLAIAFIPAAVMGLLFGXQIKEXLFNPLSVAVMLVL 

I I 1 1 1 1 1 1 1 1 1 1 : II : 1 1 1 1 1 1 1 1 1 1 II M 1 1 I M II M M Mil 1 1 i I ; 1 1 I 1 1 1 1 1 

a305 E YRQR FSN V LHG VGKDRKAN R FVLNLA I AF I PAA VMG LL FG KQ I KE Y L FN PLS V A VKLV L 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m305 . pep XGFXI LWVEKRQSRAEPKIADVDALRPI DALMIGVAQVFALVPGTSRSGSTIMGGMLWGI 

II I I I I I I I I I I I I I I I : I I i I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I I I I I 
a305 GGFFILWVEKRQSRAEPKIVDVDALRPIDALMIGVAQVFALVPGTSRSGSTIMGGMLWGI 

130 140 150 160 170 180 

190 200 210 220 230 240 

m305 . pep ERKTATEFSFFLAVPMMVAATAYDVLKHYRFFTLHDVGLILIGFIAAFVSGLVAVKALLR 
I I I I ! I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 II I : I! I I I I I II I I I I I 1 
a305 ERKTATEFS FFLAVPMMVAATAYDVLKHYRFFTLHDVGLI LIGFVAAFVSGLVAVKALLR 

190 200 210 220 230 240 



m305.pep FVSG 
I I I 

a305 FVSKKNYIPFAYYRIVFGIAIIILWLSGWISWEX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1265>: 



g306 . seq 












1 


ATGTTTATGA 


ACAAATTTTC 


CCAATCCGGA 


AAAGGTCTGT 


CCGGTTTCTT 


51 


CTTCGGTTTG 


ATACTGGCAA 


CGGTCATTAT 


TGCCGGTATT 


TTGCTTTATC 


101 


TGAACCAGGG 


CGGTCAAAAT 


GCGTTCAAAA 


TCCCGGCTCC 


GTCGAAGCAG 


151 


CCTGCAGAAA 


CGGAAATCCT 


GAAACTGAAA 


AACCAGCCTA 


AGGAAGACAT 


201 


CCAACCTGAA 


CCGGCCGATC 


AAAACGCCTT 


GTCCGAACCG 


GATGTTGCGA 


251 


AAGAGGCAGA 


GCAGTCGGAT 


GCGGAAAAAG 


CTGCCGACAA 


GCAGCCCGTT 


301 


GCCGACAAAG 


CCGACGAGGT 


TGAAGAAAAG 


GCGGGCGAGC 


CGGAACGGGA 


351 


AGAGCCGGAC 


GGACAGGCAG 


TGCGCAAGAA 


AGCACTGACT 


GAAGAGCGTG 


401 


AACAAACCGT 


CAGGGAAAAA 


GCGCAGAAGA 


AAGATGCCGA 


AACGGTTAAA 


451 


AAAAAAGCGG 


TAAAACCGTC 


TAAAGAAACA 


GAGAAAAAAG 


CTTCAAAAGA 


501 


AGAGAAAAAG 


GCGGCGAAAG 


AAAAAGTTGC 


ACCCAAACCG 


ACCCCGGAAC 


551 


AAATCCTCAA 


CAGCCGCAGT 


ATCGAAAAAG 


CGCGTAGTGC 


CGCTGCCAAA 


601 


GAAGTGCAGA 


AAATGAAAAA 


CTTTGGGCAA 


GGCGGAAGCC 


AACGCATTAT 


651 


CTGCAAATGG 


GCGCGTATGC 


CGAACCCCGG 


AGCGCGGAAG 


GGCAGCGTGC 


701 


CAAACTGGCA 


ATCTTGGGCA 


TATCTTCCGA 


AGTGGTCGGC 


TATCAGGCGG 


751 


GACATAAAAC 


GCTTTACCGC 


GTGCAAAGCG 


GCAATATGTC 


CGC CGATGCG 


801 


GTGA 











This corresponds to the amino acid sequence <SEQ ID 1266; ORF 306.ng>: 

g3 06 .pep 

1 MFMNKFSQSG KGLSG FFFGL ILATVIIAGI LL YLNQGGQN AFKIPAPSKQ 

51 PAETEILKLK NQPKEDIQPE PADQNALSEP DVAKEAEQSD AEKAADKQPV 

101 ADKADEVEEK AGEPEREEPD GQAVRKKALT EEREQTVREK AQKKDAETVK 

151 KKAVKPSKET EKKASKEEKK AAXEKVAPKP TPEQILNSRS I EKARS AAAK 

201 EVQKMKNFGQ GGSQRIICKW ARMPNPGARK GSVPNWQSWA YLPKW3AIRR 

251 DIKRFTACKA AICPPMR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1267>: 



m306 .seq 


(partial) 










1 


. . GGTTTGTTCT 


TCGGTTTGAT 


ACTGGCGACG 


GTCATTATTG 


CCGGTATTTT 


51 


GTTTTATCTG 


AACCAGAGCG 


GTCAAAATGC 


GTTCAAAATC 


CCGGCTTCGT 


101 


CGAAGCAGCC 


TGCAGAAACG 


GAAATCCTGA 


AACCGmAwAA 


CCAG CyTAAG 


151 


GAAGACATCC 


AACCTGAwCC 


GGCCGATCAA 


AACGCCTTGT 


CCGAACCGGA 


201 


TGCTGCGACA 


GAGGCAGAGC 


AGTCGGATGC 


GGAAAAwGCT 


GCCGACAAGC 


251 


AGCCCGTTGC 


CGATAAAGCC 


GACGAGGTTG 


AAGAAAAGGC 


GGGCGAGCCG 


301 


GAACGGGAAG 


AGCCGGACGG 


ACAGGCAGTG 


CGTAAGAAAG 


CGCTGACGGA 


351 


AGAGCGTGAA 


CAAACCGTCA 


GGGAAAAAGC 


GCAGAAGAAA 


GATGCCGAAA 


401 


CGGTTAAAAw 


ACAAGCGGTA 


AAACCGTCTA 


AAGAAACAGA 


GAAAAAAGCT 


451 


TCAAAAGAAG 


AGAAAAAGGC 


GGCGAAGGAA 


AAAGTTGCAC 


CCAAACCAAC 


501 


CCCGGAACAA 


ATCCTCAACA 


GCGGCAGCAT 


CGAAAAAGCG 


CGCAGTGCCG 


551 


CCGCCAAAGA 


AGTG CAG AAA 


ATGAAAACGC 


CGACAAGGCG 


GAAGCAACGC 
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601 ATTATCTGCA AATGGGCGCG TATGCCGACC GTCAGAGCGC GGAAGGGCAG 
6 51 CGTGCCAAAC TGGCAATCTT GGGCATATCT TCCAAGGTGG TCGGTTATCA 
701 GGCGGGACAT AAAACGCTTT ACCGGGTGCA AAGCGGCAAT ATGTCTGCCG 
751 ATGCGGTGA 

This corresponds to the amino acid sequence <SEQ ID 1268; ORF 306>: 
m306 . pep (partial ) 

1 . . G LFFGLILAT VIIAGILF YL NQSGQNAFKI PASSKQPAET EILKPXNQXK 
51 EDIQPXPADQ NALSEPDAAT EAEQSDAEXA ADKQPVADKA DEVEEKAGEP 
101 EREEPDGQAV RKKALTEERE QTVREKAQKK DAETVKXQAV KPSKETEKKA 
151 SKEEKKAAKE KVAPKPTPEQ ILNSGSIEKA RSAAAKEVQK MKTPTRRKQR 
201 IICKWARMPT VRARKGSVPN WQSWAYLPRW SVIRRDIKRF TGCKAAICLP 
251 MR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 306 shows 88.9% identity over a 253 aa overlap with a predicted ORF (ORF 306.ng) 
from N. gonorrhoeae: 

ra306/g306 

10 20 30 40 

GLFFGLI LATVI I AG I LFYIiNQSGQNAFKI PASS KQPAETE I LKPX 

hllllllMIMIIIhllihlllllilll lilllllllll 

MFMNKFSQSGKGLSGFFFGLILATVIIAGILLYLNQGGQNAFKIPAPSKQPAETEILKLK 
10 20 30 40 50 60 

50 60 70 80 90 100 

NQX KED I Q PX P ADQNALS E PDAATE AE QSDAEXAADKQ PVADKADEVEEKAGEP EREE PD 

ii imi ii inn in ii hi run in iiiiiiiiiiiiiiiiiiiiiiiiiii 

NQPKED I QPEPADQNALS EPDVAKEAEQSDAEKAADKQPVADKADEVEEKAGEPEREEPD 
70 80 90 100 110 120 

110 120 130 140 150 160 

GQAVRKKALTEEREQTVREKAQKKDAETVKXQAVKPSKETEKKASKEEKKAAKEKVAPKP 

I II I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I I I I I I I I I I I I I I I I I I | | | | | 
GQAVRKKAL7EEREQTWEKAQKKDAETVKKJCAVKPS 

130 140 150 160 170 180 

170 180 190 200 210 220 

T PEQ I LNSGS I E KARSAAAKEVQKMKTPTRR - KQRI I CKWARM PTVRARKGS V PNWQ SWA 

llllllll llllllllllllllllh : nilllllllll: lilllllllll II 
TPEQILNSRS IEKARSAAAKEVQKMKNFGQGGSQRI ICKWARMPNPGARKGSVPNWQSWA 
190 200 210 220 230 240 

230 240 250 

YLPRWSVIRRDIKRFTGCKAAICLPMRX 

llhlhllllllllhllllll III! 
YLPKWSAIRRDIKRFTACKAAICPPMRX 
250 260 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1269>: 

a306. seq 

1 ATGTTTATGA ACAAATTTTC CCAATCCGGA AAAGGTCTGT CCGGTTTTTT 

51 CTTCGGTTTG ATACTGGCGA CGGTCATTAT TGCCGGTATT TTGTTTTATC 

101 TGAACCAGAG CGGTCAAAAT SCGTTCAAAA TCCCGGTTCC GTCGAAGCAG 

151 CCTGCAGAAA CGGAAATCCT GAAACCGAAA AACCAGCCTA AG GAAGAC AT 

201 CCAACCTGAA CCGGCCGATC AAAACGCCTT GTCCGAACCG GATGCTGCGA 

251 AAGAGGCAGA GCAGTCGGAT GCGGAAAAAG CTGCCGACAA GCAGCCCGTT 

301 GCCGACAAAG CCGACGAGGT TGAGGAAAAG GCGGACGAGC CGGAGCGGGA 

351 AAAGTCGGAC GGACAGGCAG TGCGCAAGAA AGCACTGACG GAAGAGCGTG 

A 01 AACAAACCGT CGGGGAAAAA GCGCAGAAGA AAGATGCCGA AACGGTTAAA 

4 51 AAACAAGCGG TAAAACCATC TAAAGAAACA GAGAAAAAAG CTTCAAAAGA 

501 AGAGAAAAAG GCGGAGAAGG AAAAAGTTGC ACCCAAACCG ACCCCGGAAC 

551 AAATCCTCAA CAGCGGCAGC ATCGAAAAAG CGCGCAGTGC CGCTGCCAAA 



ra306 .pep 
9306 

ra306 .pep 
g306 

m3 0 6 . pep 
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601 GAAGTGCAGA AAATGAAAAC GCCGACAAGG CGGAAGCAAC GCATTATCTG 
651 CAAATGGGCG CGTATGCCGA CCGCCGGAGC GCGGAAGGGC AGCGTGCCAA 
701 ACTGGCAATC TTGGGCATAT CTTCCAAGGT GGTCGGTTAT CAGGCGGGAC 
751 ATAAAACGCT TTACCGGGTG CAAAGCGGCA ATATGTCTGC CGATGCGGTG 
801 A 

This corresponds to the amino acid sequence <SEQ ID 1270; ORF 306.a>: 

a306.pep 

1 MFMNKFSQSG KGLSG FFFGL ILATVIIAGI LF YLNQSGQN AFKIPVPSKQ 

51 PAETEILKPK NQPKEDIQPE PADQNALSEP DAAKEAEQSD AEKAADKQPV 

101 ADKADEVEEK ADEPEREKSD GQAVRKKALT EEREQTVGEK AQKKDAETVK 

151 KQAVKPSKET EKKASKEEKK AEKEKVAPKP TPEQILNSGS ZEKARSAAAK 

201 EVQKMKTPTR RKQRIICKWA RMPTAGARKG 5VPNWQSWAY LPRWSVIRRD 

251 IKRFTGCKAA ICLPMR* 

m306/a306 93.7% identity in 252 aa overlap 

10 20 30 40 

. m306.pep GLFFGLILATVT I AGI LFYLNQSGQNAFKI PASSKQPAETEI LKPX 

I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I : I I I I I I | } | | | | 
a306 MFMNKFSQSGKGLSGFFFGLILATVIIAGI LFYLNQSGQNAFKI PVPSKQ PAETEILKPK 

10 20 30 40 50 60 

50 60 70 80 90 100 

m306 . pep NQXKED I QPX PADQNALSE PDAATEAEQS DAEXAADKQPVADKADEVEEKAGE PEREEPD 

I! I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I M I I I I I I i I Mill: I 
a30 6 NQPKEDI QPE PADQNALSE PDAAKEAEQS DAEKAADKQPVADKADEVEEKADE PEREKS D 

70 8C 90 100 110 120 

110 120 130 140 150 160 

m3 0 6 . pep GQAVRKKALTEEREQTVREKAQKKDAETVKXQAVKPSKETEKKASKEEKKAAKEKVAPKP 
1 I I I M I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a3 0 6 GQAVRKKALTEEREQTVGEKAQKKDAETVKKQAVKPGKETEKKASKEEKKAEKEKVAPKP 
130 140 150 160 170 180 

170 180 190 200 210 220 

m30 6 . pep TPEQILNSGS IEKARSAAAKEVQKMKTPTRRKQRI I CKWARM PT VRARKGS V PNWQS WAY 

I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I : I I I I II I II I I I I I 
a306 TPEQI LNSGS IEKARSAAAKEVQKMKTPTRRKQRI I CKW ARM PTAGARKGSVPNWQSW AY 

190 200 210 220 230 240 

230 240 250 

m30 6 . pep LPRWSVIRRDIKRFTGCKAAICLPMRX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a306 LPRWSVIRRDIKRFTGCKAAICLPMRX 
250 260 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1271 >: 

g307 . seq 

1 atgaaaacct tcttcaaaac cctttcgacc gcgtcactcg cgctcatcct 

51 cgcagcctgc ggcggtcaaa aagacagcgc gcccgcagcc tctgccgccg 

101 ccccttctgc cgataacggc gcggcgaaaa aagaaatcgt cttcggcacg 

151 accgtgggcg acttcggcga tatggtcaaa gaacaaatcc aagccgagct 

201 ggagaaaaaa ggctacaccg tcaaattggt cgaatttacc gactatgtgc 

251 gcccgaatct ggcattggcg gagggcgagt tggacatcaa cgtcttccaa 

301 cacaaaccct atcttgacga tttcaaaaaa gaacacaacc tggacatcac 

351 cgaagccttc caagtgccga ccgcgccttt gggactgtat ccgggcaaac 

401 tgaaatcgct ggaagaagtc aaagacggca gcaccgtatc cgcgcccaac 

451 gacccgtcca acttcgcacg cgccttggtg atgctgaacg aactgggttg 

501 gatcaaactc aaagacggca tcaatccgct gaccgcatcc aaagccgaca 

551 tcgcggaaaa cctgaaaaac atcaaaatcg tcgagcttga agccgcacaa 

601 ccgccgcgca gccgcgccga cgtggatttt gccgtcgtca acggcaacta 

651 cgccataagc agcggcat:ga agctgaccga agccctgttc caagagccga 
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701 gctttgccta tgtcaactgg tctgccgtca aaaccgccga caaagacagc 
751 caatggctta aagacgtaac cgaggcccat aaccccgacg cgttcaaagc 
801 ctacgcgcac aaacgcttcg agggctacaa ataccctgcc gcatggaatg 
651 aaggcgcagc caaataa 

This corresponds to the amino acid sequence <SEQ ID 1272; ORF 307.ng>: 
g3 07.pep 

1 MKTFFKTLST ASLALILAAC GGQKDSAPAA SAAAPSADNG AAKKEIVFGT 

51 TVGDFGDMVK EQIQAELEKK GYTVKLVEFT DYVRPNLALA EGELDINVFQ 

101 HKPYLDDFKK EHNLDITEAF QVPTAPLGLY PGKLKSLEEV FCDGSTVSAPN 

151 DPSNFARALV MLNELGWIKL KDGINPLTAS KADIAENLKN IKIVELEAAQ 

201 LPRSRADVDF AWNGNYAIS SGMKLTEALF QEPSFAYVNW SAVKTADKDS 

251 OWLKDVTEAY NSDAFKAYAH KRFEGYKYPA AWNEGAAK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1273>: 

m307.seq (partial) 

1 . .CAATGGCTTA AAGACGTAAC CGAGGCCTAT AACTCCGACG CGTTCAAAGC 

51 CTACGCGCAC AAACGCTTCG AGGGCTACAA ATCCCCTGCC GCATGGAATG 

101 AAGGCGCAGC CAAATAA 

This corresponds to the amino acid sequence <SEQ ID 1274; ORF 307>: 
m307.pep {partial) 

1 ..QWLKDVTEAY NSDAFKAYAH KRFEGYKSPA AWNEGAAK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 307 shows 97.4% identity over a 38 aa overlap with a predicted ORF (ORF 307.ng) 
from K gonorrhoeae: 

m307/g307 



10 20 30 

m3 0 7 . pep QWLKDVTEAYNSDAFKAYAHKRFEGYKS PA 

llllimilllllllMIIMMIH II 

9307 SGMKLT EALFQEP S FAYVNWS AVKTAD KDS QWLKDVTE A YNS DAFKA YAHKR FEG Y KYPA 

230 240 250 260 270 280 



39 

m307.pep AWNEGAAKX 

Illllllll 
g3 07 AWNEGAAKX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1275>: 

a307.seq 

1 ATGAAAACCT TCTTCAAAAC CCTTTCCGCC GCCGCACTCG CGCTCATCCT 

51 CGCCGCCTGC GGCGGTCAAA AAGATAGCGC GCCCGCCGCA TCCGCTTCTG 

101 CCGCCCCCGA CAACGGCGCG GCGAAAAAAG NAATCGTCTT CGGCACGACC 

151 GTCGGCGACT TCGGCGATAT GGTCAAAGAA CAAATCCAAC CCGAGCTGGA 

201 GAAAAAAGGC TACACCGTCA AACTGGTCGA GTTTACCGAC TATGTGCGCC 

251 CGAATCTGGC ATTGGCTGAG GGCGAGTNGG ACATCAACGT CTTCCAACAC 

301 AAACCCTATC TTGACGACTT CAAAAAAGAA CACAATCTGG ACATCACCGA 

351 AGTCTTCCAA GTGCCGACCG CGCCTTTGGG ACTGTACCCG GGCAAGCTGA 

4 01 AATCGCTGGA AGAAGTCAAA GACGGCAGCA CCGTATCCGC GCCCAACGAC 

4 51 CCGTCCAACT TCGCCCGCGT CTTGGTGATG CTCGACGAAC TGGGTTGGAT 

501 CAAACTCAAA GANGGCATCA ATCCGCTGAC CGCATCCAAA GCGGACATTG 

551 CCGAAAACCT GAAAAACATC AAAATCGTCG AGCTTGAAGC CGCGCAACTG 

601 CCGCGTAGCC GCGCCGACGT GGATTTTGNC GTCGTCAACG GCAANTACGC 

651 CATAAGCAGC GGCATGA^GC TGACCGAAGC CCTGTTCCAA GAACCGAGCT 

701 TTGCCTATGT CAACTGGTCT GCCGTCAAAA CCGCCGACAA AGACAGCCAA 

751 TGGCTTAAAG ACGTAACCGA GGCCTATAAC TCCGACGCGT TCAAAGCCTA 

801 CGCGCACAAA CGCTTCGAGG GCTACAAATC CCCTGCCGCA TGG AATGAAG 

851 GCGCAGCCAA ATAA 

This corresponds to the amino acid sequence <SEQ ID 1 276; ORF 307.a>: 

a 307. pep 

1 MKTFFKTLSA AALALILAA C GGQKDSAPAA SASAAADNGA AKKXIVFGTT 
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51 VGDFGDMVKE QIQPELEKKG YTVKLVEFTD YVRPNLALAE 

101 XPYLDDFKKE HNLDITEVFQ VPTAPLGLYP GKLKSLEEVK 

151 PSNFARVLVM LDELGWIKLK XGINPLTASK ADIAENLKNI 

201 PRSRADVDFX VVNGXYAISS GMKLTEALFQ EPSFAYVNWS 

251 WLKDVTEAYN SDAFKAYAHK RFEGYKSPAA WNEGAAX+ 



GEXDINVFQH 
DGSTVSAPND 
KIVELEAAQL 
AVKTADKDSQ 



m307/a307 1 00.0% identity in 38 aa overlap 

10 20 30 

m307 . pep QWLKDVTEAYNSDAFKAYAHKRFEGYKSPA 

I I I I I I I I I I I I I I ! I I I I I I I I II I I I M 
a307 SGMKLTEALFQEPS FAYVNWSAVKTADKDSQWLKDVTEAYNS DAFKAYAHKRFEGYKSPA 

220 230 240 250 260 270 



m307 .pep 



a307 



39 

AWNEGAAKX 
I I I I I I I I I 
AWNEGAAKX 
280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1277>: 
g308.seq 

1 ATGTTAAATC GGGTATTTTA TCGGATATTG GGTGTTGCCG ACAATTTGTA 

51 TCCGTGTTTA TCGGATTTCT GTTTTTTCAC TATAATAGCC GGTTTGCCGT 

101 TGGAGGCGGT TTTATGGGAA AGGCGGATGA TGGTACGGCG TTTGATAATC 

151 GGCATCAGCG GGGCGAGCGG TTTCCAATAC GGCGTGAAGG CTTTGGAACT 

201 TTTGCGCGCG CAAGATGTCG AAACGCACCT TGTGGTATCG AAAGGCGCGG 

251 AGATGGCGCG CGCTTCGGAA ACGGATTATA CGAAAGACGA AGTATATGCC 

301 TTGGCTGATT TCGTCCATCC GATCGG CAAT ATCGGGGCGT GCATTGCCAG 

351 CGGTACGTTT AAAACGGACG GGATGCTGGT CGCACCCTGT TCGATGCGGA 

401 CGCTTGCCTC TGTCGCGCAC GGCTTCGGCG ACAACCTCTT GACGCGTGCG 

451 GCGGATGTGG TTTTGAAGGA AAGGCGGCGG CTGGTGCTGA TGGTGCGCGA 

501 AACGCCGCTG AACCTTGCCC ATTTGGACAA TATGAAGCGG GTAACGGAAA 

551 TGGGCGGCGT GGTGTTTCCC CCTGTTCCTG CGATGTACCG CAAGCCGCAG 

601 ACGGCGGACG ACATAGTGGC GCACAGTATC GCACACACGC TGTCGCTGTT 

651 CGGAATCGAT ACGCCGGATT 7GGCGGAATG GCAGGGAATG GCGGATTAA 

This corresponds to the amino acid sequence <SEQ ID 1278; ORF 308.ng>: 
g308.pep 

1 MLNRVFYRIL GVADNLYPCL S DFCFFTIIA GLPLQAVLW E RRMMVRRLII 

51 GISGASGFQY GVKALELLRA QDVETHLWS KGAEMARASE TDYTKDEVYA 

101 LADFVHPIGN IGACIASGTF KTDGMLVAPC SMRTLASVAH GFGDNLLTRA 

151 ADWLKERRR LVLMVRETPL NLAHLDNMKR VTEMGGWFP PVPAMYRKPQ 

201 TADDIVAHSI AHTLSLFGID TPDLAEWQGM AD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1279>: 
m308.seq (partial) 

1 ATGTTAAATC GGGTATTTTA TCGGATATTG GGTGTTGCCG ACAATTTGTA 

51 TCCGCGTTTA TCGGATTTCT GTTTTTTCAC TATAATAGCC GGTTTGCCGT 

101 TGCAGGCGGT TTTATGGGAA AGGCGGATGA TGGTACGGCG TTTGATAATC 

151 GGCATCAGCG GGGCGAGCGG TTTCCAATAC GGCGTGAAGG CTTTGGAACT 

201 TTTGCGCGCG CAAGATGTCG AAACGCACCT TGTGGTATCG AAAGGTGCGG 

251 AGATGGCGCG CGCTTCGGAA ACGGCTTATG CGAGAGACGA GGTATATGCC 

301 TTGGCGGACT TCGTGCATCC GATCGGCAAT ATCGGGGCGT GCATTGCCAG 

351 CGGTACGTTT AAAACGGATG GGATGCTGGT CGCCCCCTGT TCGATGCGGA 

401 CGCTTGCCTC TGTCGCGCAC GGCTTCGGCG ACAATCTGcT GACGCgTGCG 

4 51 GCGGATGTGG TTTTGAAGGA AAGGCGGCGG CTGGTGCTGA TGGTGCGCGA 

5 31 AACGCCGCTG AACCTTGCCC ATTTGGACAA TATGAAGCGG GwAACGGAAA 
551 TGGGCGGCGT GGTGTTTCCC CCTGTTCCTG CGATGTACCG CAAACCGCAG 
601 ACGGCGGACG ACATAGTGGC GCACAGTGTT GCACACGCtT TGTCGCTGTT 
651 CGGAATCGAT ACGCCGGATT CGGCGGAATG GCA rGGAATG gcG. . . 

This corresponds to the amino acid sequence <SEQ ID 1280; ORF 308>: 

m3 08.pep (partial) 
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1 MLNRVFYRIL GVADNLYPRL SDF CFFTIIA GLPLQAVL WE RRMMVRRLI I 
51 GISGASGFQY GVKALELLRA QDVETHLWS KGAEMARASE TAYARDEVYA 
101 LADFVHPIGN IGACIASGTF KTDGMLVAPC SMRTLASVAH GFGDNLLTRA 
151 ADWLKERRR LVLMVRETPL NLAHLDNMKR XTEMGGWFP PVPAMYRKPQ 
2 01 TADDIVAHSV AHALSLFGID TPDSAEWQGM A. . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 308 shows 96.5% identity over a 231 aa overlap with a predicted ORF (ORF 308.ng) 
from N. gonorrhoeae: 

m308/g308 

10 20 30 40 50 60 

MLNRVFYRI LGVADNL Y P RLS D F CF FTI IAGLPLQAVLWERRMMVRRLI IGI SGASGFQY 
I I I I I I I I I I I I I I I I I I J I I II I I I II I I I I I I I I I M | I I | I I I | | | | | | | | | | | | | 
MLNRVFYRI LGVADNL YPCLSDFCFFTI IAGLPLQAVLWERRMMVRRLI IGI SGASGFQY 
10 20 30 40 50 60 

70 80 90 100 110 120 

GVKALELLRAQDVETHLWSKGAEMARASET AY ARDEVYALADFVHP I GN IGACIASGTF 

IMIIIIIIMNIIIIIIMIIIIIIIIII l-IIIIIIIIIIIIMI lillllllll 

G V KALE LLRAQDVETHLWS KG AEMARASETDYTKDEVYALADFVHP I GN IGACIASGTF 
70 80 90 100 110 120 

130 140 150 160 170 180 

KTDGMLVAPCSMRTLASVAHGFGDNLLTRAADWLKERRRLVLMVRETPLNLAHLDNMKR 

I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M | 
KTDGMLVA P CS MRTLAS VAHGFGDNLLTRAADWL KE RRR LVLMVRE T PLNLAHLDNMKR 
130 140 150 160 170 180 

190 200 210 220 230 

X TEMGGW F PPV PAMYRKPQTADD I VAHSVAHALS L FG I DT PDS AE WQGMA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I : I I I | I | | | | II I N 
VTEMGGWF PPVP AMYRKPQTADD I VAHS I AHTLSLFGI DTPDLAEWQGMADX 
190 200 210 220 230 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1281>: 

a308 . seq 

1 ATGTTAAATC GGATATTTTA TCGGATATTG GGTGTTGCCG ACAATTTGTA 

51 TCCGTATTTA TCGGATTTCT GTTTTTTCAC TATAATAGCC GGTTTGCCGT 

101 TGCAGGCGGT TTTATGGGAA AGGCGGATGA TGGTACGGCG TTTGATAATC 

151 GGCATCAGTG GGGCGAGCGG TTTCCAATAC GGCGTGAAGG CTTTGGANCT 

201 TTTACGCGCG CAAGATATCG AAACGCACCT TGTGGTATCG AAAGGTGCGG 

251 AGATGGCGCG CGCTTCGGAA ACGGNTTATG CGAGAGACGA NGTATATGCC 

301 TTGGCGGACT TNGTGCATCC GATCGGCAAT ATCGGGGCGT GCATTGCCAG 

351 CGGTACGTTT AAAACGGACG GGATGCTGGT CGCCCCCTGT TCGATGCGGA 

401 CGCTTGCCTC GGTCGTGCAC GGCTTCGGCG ACAACCTCTT GACGCGTGCG 

4 51 GCGGATGTGG TTTTGAAGGA AAGGCGGCGG CTGGTGCTGA TGGTGCGCGA 

501 AACGCCGCTG AACCTTGCCC ATTTGGACAA TATGAANCGG GTAACGGAAA 

551 TGGGCGGCGT GGTGTTTCCC CCTGTTCCTG CGATGTACCG CAAACCGCAG 

601 ACGGCGGACG ACATAGTGGC GCACAGTGTT GCACACGCTT TGTCGCTGTT 

651 CGGAATCGAT ACGCCGGATT CGGCGGAATG GCAGGGAATG GCGGATTAA 

This corresponds to the amino acid sequence <SEQ ID 1282; ORF 308. a>: 
a308.pep 

1 MLNRIFYRIL GVADNLYPYL SD FCFFTIIA GLPLQAVLW E RRMMVRRLII 

51 GISGASGFQY GVKALXLLRA QDIETHLVVS KGAEMARASE TXYARDXVYA 

101 LADXVHPIGN IGACIASGTF KTDGMLVAPC SMRTLASVVH GFGDNLLTRA 

151 ADWLKERRR LVLMVRETPL NLAHLDNMXR VTEMGGVVFP PVPAMYRKPQ 

201 TADDIVAHSV AHALSLFGID TPDSAEWQGM AD* ' 



m3 0 8 . pep 
g308 

m308 .pep 
g308 

m308.pep 
g308 

m308.pep 
9308 
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m308/a308 95.7% identity in 231 aa overlap 

10 20 30 40 50 60 

MLNRVFYRILGVADNLYPRLSDFCFFTIIAGLPLQAVLWERRM1WRRLIIGISGA5GFQY 
M I I : II i I I I I I I I I I I I I I I I I I I I I II I I I I i I I I I I I I I I I I ! I I I I I I II I I I I 
MLNRIFYRILGVAD^YPYLSDFCFFTIIAGLPLQAVLWERRMMVRRLIIGISGASGFQY 
10 20 30 40 50 60 

70 80 90 100 110 120 

GVKALELLRAQDVETHLVVSKGAEMARASETAYARDEVYALADFVHPIGNIGACIASGTF 
I I I I I I I 1111:1111 Mill I II II II II llll MINI I I I I I I I I I I I I I I I I 
GVKALXLLRAQDIETHLVVSKGAEMARASETXYARDXVYALADXVHPIGNIGACIASGTF 
70 80 90 100 110 120 

130 140 150 160 170 180 

KT DGMLVA PCSMRTLAS V AH GFGDNLLTRAADVVLKERRRLVLMVRET PLNLAHL DNMKR 
I I I I I I I I I I I I I I I I I I: ! I I i I I I I I I I I I I I I I I I I I I I I I ( I I I I I ill I I I I I I 
KT DGMLVA PC SMRT LAS VVHGFGDNLLTRAADVVLKERRRLVLMVRETPLNLAHLDNMXR 
130 140 150 160 170 180 

190 200 210 220 230 

XT EMGG W FP P V P AMYRK PQTA DD I V AH S VAH AL S L FG I DTP DS AEWQGHA 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I III I II I I 
VTEMGG W FPPV PAMYRKPQTADD I VAH SVAH ALSLFG I DTPDS AEWQGMADX 
190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1283>: 

g308-l.seq 

1 ATGTTAAATC GGGTATTTTA 

51 TCCGTGTTTA TCGGATTTCT 

101 TGCAGGCGGT TTTATGGGAA 

151 GGCATCAGCG GGGCGAGCGG 

201 TTTGCGCGCG CAAGATGTCG 

251 AGATGGCGCG CGCTTCGGAA 

301 TTGGCTGATT TCGTCCATCC 

351 CGGTACGTTT AAAACGGACG 

401 CGCTTGCCTC TGTCGCGCAC 

451 GCGGATGTGG TTTTGAAGGA 

501 AACGCCGCTG AACCTTGCCC 

551 TGGGCGGCGT GGTGTTTCCC 

601 ACGGCGGACG ACATAGTGGC 

651 CGGAATCGAT ACGCCGGATT 

This corresponds to the amino acid sequence <SEQ ID 1284; ORF 308-1. ng>: 

g3 08-1. pep 

1 MLNRVFYRIL GVADNLYPCL SD FCFFTIIA GLPLQAVLWE RRMMVRRL1I 

51 GISGASGFQY GVKALELLRA QDVETHLVVS KGAEMARASE TDYTKDEVYA 

101 LADFVHPIGN IGACIASGTF KTDGMLVAPC SMRTLASVAH GFGDNLLTRA 

151 ADWLKERRR LVLMVRETPL NLAHLDNMKR VTEMGGVVFP PVPAMYRKPQ 

201 TADDIVAHSI AHTLSLFGID TPDLAEWQGM AD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1285>: 

m30B-l.seq 

1 ATGTTAAATC GGGTATTTTA TCGGATATTG GGTGTTGCCG ACAATTTGTA 

51 TCCGCGTTTA TCGGATTTCT GTTTTTTCAC TATAATAGCC GGTTTGCCGT 

101 TGCAGGCGGT TTTATGGGAA AGGCGGATGA TGGTACGGCG TTTGATAATC 

151 GGCATCAGCG GGGCGAGCGG TTTCCAATAC GGCGTGAAGG CTTTGGAACT 

201 TTTGCGCGCG CAAGATGTCG AAACC-CACCT TGTGGTATCG AAAGGTGCGG 

251 AGATGGCGCG CGCTTCGGAA ACGGCTTATG CGAGAGACGA GGTATATGCC 

301 TTGGCGGACT TCGTGCATCC GATCGGCAAT ATCGGGGCGT GCATTGCCAG 

351 CGGTACGTTT AAAACGGATG GGATGCTGGT CGCCCCCTGT TCGATGCGGA 

401 CGCTTGCCTC TGTCGCGCAC GGCTTCGGCG ACAATCTGCT GACGCGTGCG 

451 GCGGATGTGG TTTTGAAGGA AAGGCGGCGG CTGGTGCTGA TGGTGCGCGA 

501 AACGCCGCTG AACCTTGCCC ATTTGGACAA TATGAAGCGG GTAACGGAAA 

551 TGGGCGGCGT GGTGTTTCCC CCTGTTCCTG CGATGTACCG CAAACCGCAG 

601 ACGGCGGACG ACATAGTGGC GCACAGTGTT GCACACGCTT TGTCGCTGTT 

651 CGGAATCGAT ACGCCGGATT CGGCGGAATG GCAGGGAATG GCGGATTAA 

This corresponds to the amino acid sequence <SEQ ID 1286; ORF 308- 1>: 

m308-l .pep 

1 MLNRVFYRIL GVADNLYPRL SD FCFFTIIA GLPLQAVLW E RRMMVRRLII 



m308 .pep 
a308 

m300 .pep 
a308 

m308 .pap 
a308 

m308 -pep 
a308 



TCGGATATTG GGTGTTGCCG ACAATTTGTA 
GTTTTTTCAC TATAATAGCC GGTTTGCCGT 
AGGCGGATGA TGGTACGGCG TTTGATAATC 
TTTCCAATAC GGCGTGAAGG CTTTGGAACT 
AAACGCACCT TGTGGTATCG AAAGGCGCGG 
ACGGATTATA CGAAAGACGA AGTATATGCC 
GATCGGCAAT ATCGGGGCGT GCATTGCCAG 
GGATGCTGGT CGCACCCTGT TCGATGCGGA 
GGCTTCGGCG ACAACCTCTT GACGCGTGCG 
AAGGCGGCGG CTGGTGCTGA TGGTGCGCGA 
ATTTGGACAA TATGAAGCGG GTAACGGAAA 
CCTGTTCCTG CGATGTACCG CAAGCCGCAG 
GCACAGTATC GCACACACGC TGTCGCTGTT 
TGGCGGAATG GCAGGGAATG GCGGATTAA 
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51 GISGASGFQY GVKALELLRA QDVETHLVVS KGAEMARASE TAYARDEVYA 

1Q 1 LADFVHPIGN IGACIASGTF KTDGMLVAPC SMRTLASVAH GFGDNLLTRA 

151 ADVVLKERRR LVLMVRETPL NLAHLDNMKR VTEMGGVVFP PVPAMYRKPQ 

201 TADDIVAHSV AHALSLFGID TPDSAEWQGK AD* 

m308-l/g308-l 97.0% identity in 232 aa overlap 

10 20 30 40 50 60 

MLNRVFY R I LGV ADNL Y PRLS D FC F FT 1 1 AG L P LQA VL W E RRMMVRRLI IGISGASGFQY 
I I I I I i I I I I I I I II I II I I I I I 1 I I I I I I I I I I I I I I I I I I I I ! t I I I I I I I I I I I I 
MLNRVFYRILGVADNLYPCLSDFCFFTI IAGLPLQAVLWERRMMVRRLI IGISGASGFQY 
10 20 30 40 50 60 

70 80 90 100 110 120 

G VKALELLRAQD VETH LVV S KG AEMARA SET AY ARDE V Y AL A D FVH P I GN I G AC I A SGT F 
I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I :: I I I I I I I I I I I I I I I I I I I I I II I I 
GVKALELLRAQDVETHLVVSKGAEMARASETDYTKDEVYALADFVHP I GN IGACIASGTF 
70 80 90 100 110 120 

130 140 150 160 170 180 

KTDGMLVAPC SMRTLASVAHGFGDNLLTRAADVVLKERRRLVLMVRETPLNLAHLDNMKR 
I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I | I I | I I 
KTDGHLVAPCSMRTIASVAHGFGDNLLTRAADVVLKERRRLVLMVRETPLNLAHLDNMKR 
130 140 150 160 170 180 

190 200 210 220 230 

VTEMGGVVFP PVPAMYRKPQTADDIVAHSVAHALSLFGIDTPDSAEWQGMADX 
I II I I I I I I I I I I I I I I I I I I I I I t I I I I : I I : I I I I I I I I I I I I i I I I II I 
VTEMGGWFPPVPAMYRKPQrADDIVAHSIAHTLSLFGIDTPDLAEWQGMADX 
190 200 210 220 230 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1287>: 

a308-l.seq 

1 AT5TTAAATC GGATATTTTA TCGGATATTG GGTGTTGCCG ACAATTTGTA 

51 TCCGTATTTA TCGGATTTCT GTTTTTTCAC TATAATAGCC GGTTTGCCGT 

101 TGCAGGCGGT TTTATGGGAA AGGCGGATGA TGGTACGGCG TTTGATAATC 

151 GGCATCAGTG GGGCGAGCGG TTTCCAATAC GGCGTGAAGG CTTTGGANCT 

201 TTTACGCGCG CAAGATATCG AAACGCACCT TGTGGTATCG AAAGGTGCGG 

251 AGATGGCGCG CGCTTCGGAA ACGGNTTATG CGAGAGACGA NGTATATGCC 

301 TT3GCGGACT TNGTGCATCC GATCGGCAAT ATCGGGGCGT GCATTGCCAG 

351 CGSTACGTTT AAAACGGACG GGATGCTGGT CGCCCCCTGT TCGATGCGGA 

401 CGCTTGCCTC GGTCGTGCAC GGCTTCGGCG ACAACCTCTT GACGCGTGCG 

451 GCGGATGTGG TTTTGAAGGA AAGGCGGCGG CTGGTGCTGA TGGTGCGCGA 

501 AACGCCCCTG AACCTTGCCC ATTTGGACAA TATGAANCGG GTAACGGAAA 

551 TGGGCGGCGT GGTGTTTCCC CCTGTTCCTG CGATGTACCG CAAACCGCAG 

601 ACSGCGGACG ACATAGTGGC GCACAGTGTT GCACACGCTT TGTCGCTGTT 

651 CGGAATCGAT ACGCCGGATT CGGCGGAATG GCAGGGAATG GCGGATTAA 

This corresponds to the amino acid sequence <SEQ ED 1288; ORF 308-1. a>: 

a3 08-1. pep 

1 MLNRIFYRIL GVADNLYPYL SD FCFFTIIA GLPLQAVLW E RRMMVRRLI I 
51 GISGASGFQY GVKALXLLRA QDIETHLWS KGAEMARASE TXYARDXVYA 
101 LADXVHPIGN IGACIASGTF KTDGMLVAPC SMRTLASWH GFGDNLLTRA 
151 ADVVLKERRR LVLMVRETPL NLAHLDNMXR VTEMGGVVFP PVPAMYRKPQ 
201 TADDIVAHSV AHALSLFGID TPDSAEWQGM AD* 

a308-l/m308-l 96.1% identity in 232 aa overlap 

10 20 30 40 50 60 

a308-l MLNRI FYRILGVADNLYPYLSDFCFFTI IAGLPLQAVLWERRMMVRRLI IGISGASGFQY 

I I I I : II I I I I I I I I I I t ! I I I I I I I I I I I I i I I ! M ! I I I I I I I I I I I I I I I I i I I I I 
m308-l MLNRVFYRILGVADNLYPRLSDFCFFTI IAGLPLQAVLWERRMMVRRLI I GISGASGFQY 

10 20 30 40 50 60 

70 80 90 100 110 120 

a308-l G V K ALX LL RAQD I ETH LVV S KGAEMARA SETX Y ARDX V Y ALA DX VH PI GN I G AC I ASGT F 

I I t I I I J I 1 1 I - 1 1 I I J * 1 1 I I 1 I I I 1 I i t MM HUM I II 1 1 I M I I i M I I I 
m308-l GVKALELLRAQDVETHLVVSKGAEMARASETAYARDEVYALADFVHPIGNIGACIASGTF 

70 80 90 100 110 120 

130 140 150 160 170 180 
a308-l KTDGMLVAPCSMRTLASVVHGFGDNLLTRAADWLKERRRLVLMVRETPLNLAHLDNMXR 
IMIIIIIMIIt M I I I I M I I I I M i I I I I I I I J I 1 II f i M Ml I 



m308-l.pep 
g308-l 

m308-l.pep 
g308-l 

m308-l.pep 
g308-l 

m308-l.pep 
g308-l 
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m308-l KTDGMLVAPCSMRTLASVAHGFGDNLLTRAADVVLKERRRLVLMVRETPLNLAHLDNMKR 
130 140 150 160 170 180 

190 200 210 220 230 

a308-l VTEMGGVVFPPVPAMYRKPQTADDIVAHSVAHALSLFGIDTPDSAEWQGMADX 
I I tl I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IMI 
m308-l VTEMGGVVFPPVPAMYRKPQTADDIVAHSVAHALSLFGIDTPDSAEWQGMADX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1289>: 

g311 . Geq 

atgttcagtt tcggctgggc gtttgaccgc ccgcagtatg agttgggttc 

gctgtcgcct gttgcggcac ttgcgtgccg gcgcgctttg gggtgtttgg 

gtttggaaac gcaaatcaag tggccaaacg atttggtcgt cggacgcgac 

aaattgggcg gcattctgat tgaaacagtc agggcgggcg gtaaaacggt 

tgccgtggtc ggtatcggca tcaatttcgt gctgcccaag gaagtggaaa 

acgccgcttc cgtgcagtcg ctgtttcaga cggcatcgcg gcggggcaat 

gccgacgccg ccgtattgct ggaaacattg cttgcggaac tgggcgcggt 

gttggaacaa tatgcggaag aagggttcgc gccattttta aatgagtatg 

aaacggccaa ccgcgaccac ggcaaggcgg tattgctgtt gcgcgacggc 

gaaaccgtgt gcgaaggcac ggttaaaggc gtggacggac gaggcgttct 

gcacttggaa acggcagaag gcgaacagac ggtcgtcagc ggcgaaatca 

^cggtttccg 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 



gcctgcggcc cgacaacagg 
gaacgttttt tgctgttgga aggcgggaac 



tgccgaagcg gccggattcg 
agccggctca agtgggcgtg 



ggtggaaaac ggcacgttcg caaccgtggg cagcgcgccg -accgcgatt 
tgtcgccttt gggcgcggag tgggcggaaa aggcggatgg aaatgtccgc 
atcgtcggtt gcgccgtgtg cggagaatcc aaaaaggcac aagtgaagga 
acagctcgcc cgaaaaatcg agtggctgcc gtcttccgca caggctttgg 
gcatacgcaa ccactaccgc caccccgaag aacacggttc cgaccgttgg 
ttcaacgcct tgggcagccg ccgcttcagc cgcaacgcct gcgtcgtcgt 
cagttgcggc acggcggtaa cggttgacgc gctcaccgat gacggacatt 
atctcggcgg aaccatcatg cccggcttcc acctgatgaa agaatcgctc 

caaccgcccc gccggcaaac gttacccttt 
ccgtcgcaag cggcatgatg gacgcggttt 
gcggctcgat aatgatgatg cacggccgtt tgaaagaaaa aaacggcgcg 

cattaccggc ggcggcgcgg cgaaagtcgc 
ttttggcgga aaataccgtg cgcgtggcgg 
acaacctcgt catccacggg ctgctgaacc tgattgccgc cgaaggcggg 



gccgtccgaa ccgccaacct 
cccgaccaca acgggcaacg 



ggcaagcctg tcgatgtcat 
cgaagccctg ccgcctgcat 



gaatcggaac acgcttaa 

This corresponds to the amino acid sequence <SEQ ID 1290; ORF 31 l.ng>: 

g311.pep 

1 MFSFGWAFDR PQYELGSLSP VAALACRRAL GCLGLETQIK WPNDLWGRD 

51 KLGGILIETV RAGGKTVAW GIGINFVLPK EVENAASVQS LFQTASRRGN 

101 ADAAVLLETL LAELGAVLEQ YAEEGFAPFL NEYETANRDH GKAVLLLRDG 

151 ETVCEGTVKG VDGRGVLHLE TAEGEQTWS GEISLRPDNR SVSVPKRPDS 

201 ERFLLLEGGN SRLKWAWVEN GTFATVGSAP YRDLSPLGAE WAEKADGNVR 

251 IVGCAVCGES KKAQVKEQLA RKIEWLPSSA QALGIRNHYR HPEEHGSDRW 

301 FNALGSRRFS RNACVWSCG TAVTVDALT D DGHYLGGTIM PGFHLMKESL 

351 AVRTANLNRP AGKRYPFPTT TGNAVASGMM DAVCGSIMMM HGRLKEKNGA 

401 GKPVDVIITG GGAAKVAEAL PPAF1AENTV RVADNLVIHG LLNLIAAEGG 

451 ESEHA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1291 >: 

m311.seq {partial) 

1 ATGTTCAGTT TTGGCTGGGT GTTTGACCGG CCGCAGTATG AGTTGGGTTC 

51 GCTGTCGCCT GTTGCGGCAG TGGCGTGTCG GCGCGCCTTG TCGCGTTTAG 

101 GTTTGGATGT GCArATTAAG TGGCCCAATG ATTTGGTTGT CGGACGCGAC 

151 AAATTGGGCG GCATTCTGAT TGAAACGGTC AGGACGGGCG GCAAAACGGT 

201 TGCCGTGGTC GGTATCGGCA TCAATTTTGT CCTGCCCAAn GAAGTAGAAA 

251 ATGCCGCTTC CGTGC AATCG CTGTTTCAGA CGGCATCGCG GCGGGGCAAT 

301 GCCGATGCCG CCGTGCTGCT nnnnnnnnnn nnnnnnnnnn nnnnGGAAAT 

351 CAGCCTGCGG TCCGACnACA GGCCGGTTTC CGTGnCGAAG CGGCGGGATT 
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4 01 CGGAACGTTT TCTGCTGTTG GACGGCGGCA ACAGCCGGCT CAAGTGGgCG 

451 TGGGTGGAAA ACGGCACGTT CGCAACCGTC GGTAGCGCGC CGTACCgCGA 

501 TTTGTCGCCT TTGGGCGCGG AGTGGG CGGA AAAGGCGGAT GGAAATGTCC \ 

551 GCATCGTCGG TTGCGCTGTG TGCGGAGAAT TCAAAAAGGC ACAAGTGCAG 

601 GAACAGCTCG CCCGAAAAAT CGAGTGGCTG CCGTCTTCCG CACAGG CTTT 

6 51 GTTTGGCATA CGCAACCACT ACCGCCACCC CGAAGAACAC GGTTCCGACC 

701 GCTGGTTCAA CGCCTTGGGC AGCCGCCGCT TCAGCCGCAA CGCyTGCGTC 

751 GTCGTCAGTT GCGGCACGGC GGTAACGGTT GACGCGCTCA CCGATGACGG 

801 ACATTATCTC GGrgGAACCA TCATGCCCGG TTTCCACCTG ATGAAAGAAT 

851 CGCTCGCCGT CCGAACCGCC AACCTCAACC GGCACGCCGG TAAGCGTTAT 

901 CCTTTCCCGA CCACAACGGG CAATGCCGTC GCCAGCGGCA TGATGGATGC 

951 GGTTTGCGGC TCGGTTATGA TGATGCACGG GCGTTTGAAA GAAAAAACCG 

1001 GGGCGGGCAA GCCTGTCGAT GTCATCATTA CCGGCGGCGG CGCGGCAAAA 

1051 GTTGCCGAAG CCCTGCCGCC TGCATTTTTG GCGGAAAATA CCGTGCGCGT 

1101 GGCGGACAAC CTCGTCATTT ACGGGTTGTT GAACATGATT GCCGCCGAAG 

1151 GCAGGGAATA TGAACAT 

This corresponds to the amino acid sequence <SEQ ID 1292; ORF 31 1>: 

m311.pep (partial) 

1 MFSFGWVFDR PQYELGSLSP VAAVACRRAL SRLGLDVQIK WPNDLWGRD 

51 KXiGGILIETV RTGGKTVAW GIGINFVLPX EVENAASVQS LFQTASRRGN 

101 ADAAVLLXXX XXXXXEISLR SDXRPVSVXK RRDSERFLLL DGGNSRLKWA 

151 V7VENGT FATV GSAPYRDLSP LGAEWAEKAD GNVRIVGCAV CGEFKKAQVQ 

201 EQLARKIEWL PSSAQALFGI RNHYRHPEEH GSDRWFNALG SRRFSRNACT 

251 WSCGTAVrV DALT DDGHYL GGTIMPGFHL MKESLAVRTA NLNRHAGKRY 

301 PFPTTTGNAV ASGMMDAVCG SVMMMHGRLK EKTGAGKPVD VIITGGGAAK 

351 VAEALPPAFL AENTVRVADN LVIYGLLNMI AAEGREYEH . . . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 3 1 1 shows 78.5% identity over a 455 aa overlap with a predicted ORF (ORF 3 1 1 .ng) 
from N. gonorrhoeae: 
ra311/g311 

10 20 30 40 50 60 

m3 11 . pep M FS FGWVFDRPQYE LGS LS PVAAVACRRALS RLGLDVQ I KW PNDL WGRDKLGG I L I E TV 

lllllhlllllllllllllllhlllllh I ll-l : III 1 1 Ml !l II I II MM 

9 3 1 1 M FS FGW AFDRPQYE LGS LS P VAALACRRALGCLGLETQ I KW PNDLWGRD KLGG I L I ETV 

10 20 30 40 50 60 

70 80 90 100 110 
Itl3 11 . pep RTGGKTVAWGIGINFVLPXEVENAASVQSLFQTASRRGNADAAVLLXXX 

hlllllMIIIIIIIIII IMMmiMlillllllllllllll 

g 3 1 1 RAGGKTVAVVGIGINFVLPKEVENAASVQSLFQTASRRGNADAAVLLETLLAELGAVLEQ 
70 80 90 100 110 120 



m311.pep XXXX 

93 1 1 YAEEGFAPFLNEYETANRDHGKAVLLLRDGETVCEGTVKGVDGRGVLHLETAEGEQTWS 
130 140 150 160 170 180 

120 130 140 150 160 170 

m3 11 . pep X E I S LRS DXR P VSVXKRRDS ERFLLLDGGNSRL KWAWVENGT F ATVGS AP YRDLS PLGAE 

Mill I I III II I III III M IIIIMIIIIIII I III III MINIMUM 

9311 GEISLRPDNRSVSVPKRPDSERFLLLEGGNSRLKWAWVENGTFATVGSAPYRDLSPLGAE 
190 200 210 220 230 240 

180 190 200 210 220 230 

m311 . pep WAEKADGNVRIVGCAVCGEFKKAQVQEQLARKI EWL PSSAQALFGI RNHYRHPEEHGSDR 

IMINMIMIIMMM 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II I 

9 3 1 1 WAE KADGNVR rVGCAVCGES KKAQVKEQLARKI EWLPSSAQAL - GI RNHYRHPEEHGSDR 

250 260 270 280 290 
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240 250 260 270 280 290 

m3 11 . pep W FNALGS RR FSRNACWVS CGTAVTVDALTDDGHYLGGT I M P G FH LMKE S LA VRT ANLNR 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

g311 W FN RR FS RNACWV S CGTAVTVDALTDDGHYLGGT I MPGFHLMKE S LAVRTANLNR 

300 310 320 330 340 350 

300 310 320 330 340 350 

m3 1 1 . pep HAGKRYP FPTTTGNAVASGMMDAVCGSVMMMHGRLKEKTGAGKPVDVI ITGGGAAKVAEA 

II I M I II M ill III III IN ! II I :| 1 1 II i II ' hi llllll 1 1 1 1 1 II 111 I II I 

g311 PAG KR Y P FPTTTGNAVASGMMDAVCGS 1 MMMHGRLKEKNGAGKPVBVI ITGGGAAKVAEA 

360 370 380 390 400 410 



360 370 380 389 

m311 - pep LPPAFLAENTVRVADNLVI YGLLNMIAAEGREYEH 

iiiiiiiiiiimmihiiihiiiii i n 

g 3 1 1 LP P AFLAENTVRVADNLV I HGLLNL I AAEGGESEHAX 

420 430 440 450 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1293>: 

a311.seq 

1 ATGTTCAGTT TTGGCTGGGT GTTTGACCGG CCGCAGTATG AGTTGGGTTC 

51 GCTGTCGCCT GTTGCGGCAG TGGCGTGCCG GCGCGCCTTG TCGCGTTTGG 

101 GTTTGAAAAC GCAAATCAAG TGGCCAAACG ATTTGGTCGT CGGACGCGAC 

151 AAATTGGGCG GCATTCTGAT TGAAACGGTC AGGACGGGCG GCAAAACGGT 

201 TGCCGTGGTC GGTATCGGCA TCAATTTCGT GCTGCCCAAG GAAGTGGAAA 

251 ACGCCGCTTC CGTGCAATCG CTGTTTCAGA CGGCATCGCG GCGGGGAAAT 

301 GCCGATGCCG CCGTGTTGCT GGAAACGCTG TTGGCGGAAC TTGATGCGGT 

351 GTTGTTGCAA TATGCGCGGG ACGGATTTGC GCCTTTTGTG GCGGAATATC 

401 AGGCTGCCAA CCGCGACCAC GGCAAGGCGG TATTGCTGTT GCGCGACGGC 

451 GAAACCGTGT TCGAAGGCAC GGTTAAAGGC GTGGACGGAC AAGGCGTTCT 

501 GCACTTGGAA ACGGCAGAGG GCAAACAGAC GGTCGTCAGC GGCGAAATCA 

551 GCCTGCGGTC CGACGACAGG CCGGTTTCCG TGCCGAAGCG GCGGGATTCG 

601 GAACGTTTTC TGCTGTTGGA CGGCGGCAAC AGCCGGCTCA AGTGGGCGTG 

651 GGTGGAAAAC GGCACGTTCG CAACCGTCGG TAGCGCGCCG TACCGCGATT 

701 TGTCGCCTTT GGGCGCC-GAG TGGGCGGAAA AGGTGGATGG AAATGTCCGC 

751 ATCGTCGGTT GCGCCGTGTG CGGAGAATTC AAAAAGGCAC AAGTGCAGGA 

801 ACAGCTCGCC CGAAAAATCG AGTGGCTGCC GTCTTCCGCA CAGGCTTTGG 

851 GCATACGCAA CCACTACCGC CACCCCGAAG AACACGGTTC CGACCGCTGG 

901 TTCAACGCCT TGGGCAGCCG CCGCTTCAGC CGCAACGCCT GCGTCGTCGT 

951 CAGTTGCGGC ACGGCGGTAA CGGTTGACGC GCTCACCGAT GACGGACATT 

1001 ATCTCGGGGG AACCATCATG CCCGGTTTCC ACCTGATGAA AGAATCGCTC 

1051 GCCGTCCGAA CCGCCAACCT CAACCGGCAC GCCGGTAAGC GTTATCCTTT 

1101 CCCGACCACA ACGGGCAATG CCGTCGCCAG CGGCATGATG GATGCGGTTT 

1151 GCGGCTCGGT TATGATGATG CACGGGCGTT TGAAAGAAAA AACCGGGGCG 

1201 GGCAAGCCTG TCGATGTCAT CATTACCGGC GGCGGCGCGG CAAAAGTTGC 

1251 CGAAGCCCTG CC3CCTGCAT TTTTGGCGGA AAATACCGTG CGCGTGGCGG 

1301 ACAACCTCGT CATTCACGGG CTGCTGAACC TGATTGCCGC CGAAGGCGGG 

1351 GAATCGGAAC ATACTTAA 

This corresponds to the amino acid sequence <SEQ ID 1294; ORF 3 1 1 .a>: 

a311.pep 

1 MFSFGWVFDR PQYELGSLSP VAAVACRRAL SRLGLKTQIK WPNDLWGRD 

51 KLGGILIETV RTGGKTVAW GIGINFVLPK EVENAASVQS LFQTASRRGN 

101 AD AAVLLETL LAELDAVLL Q YARDGFAPFV AEYQAANRDH GKAVLLLRDG 

151 ETVFEGTVKG VDGQGVLHLE TAEGKQTVVS GEISLRSDDR PVSVPXRRDS 

201 ERFLLLDGGN SRLKWAWVEN GTFATVGSAP YRDLSPLGAE WAEKVDGNVR 

251 IVGCAVCGEF KKAQVQEQLA RKIEWLPSSA QALGIRKHYR HPEEHGSDRW 

301 FNALGS RRFS RN ACVWSCG TAVTVDALT D DGHYLGGTIM PGFHLMKESL 

351 AVRTANLNRH AGKRYPFPTT TGNAVASGMM DAVCGSVMMM HGRLKEKTGA 

401 GKPVDVIITG GGAAKVAEAL PPAFLAENTV RVADNLVIHG LLNLIAAEGG 

451 ESEHT* 



m31 l/a31 1 81.3% identity in 455 aa overlap 
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10 20 30 40 50 60 

m311 . pep MFSFGWVFDRPQYELGSLSPVAAVACRRALSRLGLDVQIKWPNDLWGRDKLGGILIETV 
I i I > I I I I I t I I I I N i I I M I I I I I I I I I I I i i I : I | I | | | | | | j | | | M I I I I I I II 
a311 MFSFGWVFDRPQYELGSLSPVAAVACRRALSRLGLKTQIKWPNDLVVGRDKLGGILIETV 

10 20 30 40 50 60 

7C 80 90 100 110 

m311 . pep RTGGKTVAWGIGINFVLPXEVENAASVQSLFQTASRRGNADAAVLLXXXXXXXX 

! I I I II I I I I I I I I i I I I I M I I I I I I I I II I I I I I I | | | | | | M | : 
a311 RTGGKTVAVVGIGINFVLPKEVENAASVQSLFQTASRRGNADAAVLLETLLAELDAVLLQ 

70 83 90 100 110 120 



m311 .pep 

a311 YARDGFAPFVAEYQAANRDHGKAVLLLRDGETVFEGTVKGVDGQGVLHLETAEGKQTWS 
130 140 150 160 170 180 

120 130 140 150 160 170 

m311 . pep -EISLRSDXRPVSVXKRRDSERFLLLDGGNSRLKWAWVENGTFATVGSAPYRDLSPLGAE 
I I I I I I I Mill I I I I I I I I I I I I I I I I I I I | | | | || | | | M | J | | | | | | | || | | | 
a311 GEISLRSDDRPVSVPKRRDSERFLLLDGGNSRLKWAWVENGTFATVGSAPYRDLSPLGAE 
190 200 210 220 230 240 

180 190 200 210 220 230 

m311.pep WAEKADGNVRIVGCAVCGE?KKAQVQEQLARKIEWLPSSAQALFGIRNHYRHPEEHGSDR 
I I I I : I I I I I I i I I I I I 1 I M I I I I I I I I ! I II I I II I II I I I I I I I I | | | | | | | | | | | 
a311 WAEKVDGNVRIVGCAVCGEFKKAQVQEQLARKIEWLPSSAQAL-GIRNHYRHPEEHGSDR 
250 260 270 280 290 

240 250 260 270 280 290 

m311.pep W FNALG SRR FS RN AC VW S CGT AVT VDALTD DGH YLGGT IM PG FH LMKES LAVRT AN LNR ' 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I | I | | | | | | | | | | 
a311 W FNALG SRR FS RN AC W V S CGT AVT V D ALT D DGH Y LG GT I M PG FH LMKES LAVRT AN LNR 

300 310 320 330 340 350 

300 310 320 330 340 350 

m311 . pep HAGKRYPFPTTTGNAVASGI4MDAVCGSVMMMHGRLECEKTGAGKPVDVIITGGGAAKVAEA 
I M I I I I I I I I I I I I I I I I I I I I I I I I I'l I I I I I I I I I I I I I I I I I || I I I I I I I I | | | | 
a 31 1 HAGKRYPFPTTTGNAVASGr^lMDAVCGSVMMMHGRLKEKTGAGKPVDVI ITGGGAAKVAEA 

360 370 380 390 400 410 

360 370 380 389 

m311 .pep LPPAFLAENTVRVADNLVIYGLLNMIAAEGREYEH 
i I M I I I I 1 I I I I I I I I I I : I I I I: I I I I I I || 
a311 LPPAFLAENTVRVADNLVIHGLLNLIAAEGGESEHTX 
420 430 440 450 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 1295>: 

g311-l.seq 

1 ATGACGGTTT TGAAGCCTTC GCATTGGCGG GTGTTGGCGG AGCTTGCCGA 

51 CGGTTTGCCG CAACACGTAT CGCAATTGGC GCGTGAGGCG GACATGAAGC 

101 CGCAGCAGCT CAACGGTTTT TGGCAGCAGA TGCCGGCGCA TATACGCGGG 

151 CTGTTGCGCC AACACGACGG CTATTGGCGG CTGGTGCGCC CCTTGGCGGT 

201 TTTCGATGCC GAAGGTTTGC GCGATCTGGG GGAAAGGTCG GGTTTTCAGA 

251 CGGCATTGAA GCACGAGTGC GCGTCCAGCA ACGACGAGAT ACTGGAATTG 

301 GCGCGGATTG CGCCGGACAA GGCGCACAAA ACCATATGCG TGACCCACCT 

351 GCAAAGTAAG GGCAGGGGGC GGCAGGGGCG GAAGTGGTCG CACCGTTTGG 

401 GCGAGTGCCT GATGTTCAGT TTCGCCTGGG CGTTTGACCG GCCGCAGTAT 

451 GAGTTGGGTT CGCTGTCGCC TGTTGCGGCA CTTGCGTGCC GGCGCGCTTT 

501 GGGGTGTTTG GGTTTGGAAA CGCAAATCAA GTGGCCAAAC GATTTGGTCG 

551 TCGGACGCGA CAAATTGGGC GGCATTCTGA TTGAAACAGT CAGGGCGGGC 

60 1 GGTAAAACGG TTGCCGTGGT CGGTATCGGC ATCAATTTCG TGCTGCCCAA 

651 GGAAGTGGAA AACGCCGCTT CCGTGCAGTC GCTGTTTCAG ACGGCATCGC 

701 GGCGGGGCAA TGCCGATGCC GCCGTATTGC TGGAAACATT GCTTGCGGAA 

751 CTGGGCGCGG TGTTGGAACA ATATGCGGAA GAAGGGTTCG CGCCATTTTT 
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801 AAATGAGTAT GAAACGGCCA ACCGCGACCA CGGCAAGGCG 

851 TGCGCGACGG CGAAACCGTG TGCGAAGGCA CGGTTAAAGG 

901 CGAGGCGTTC TGCACTTGGA AACGGCAGaa ggCGAACAGa 

951 cggcGaaaTC AGccTGCGGc CCGacaacag gtcggtttcc 

1001 gGccggatTC GgaacgttTT tTGCTgttgg aaggcgggaa 

1051 aAGTGGgcgt gGGTggAAAA Cggcacgttc gcaaccgtgg 

1101 gtaCCGCGAT TTGTCGCCTT TGGGCGCGGA GTGGGCGGAA 

1151 GAAATGTCCG CATCGTCGGT TGCGCCGTGT GCGGAGAATC 

1201 CAAGTGAAGG AACAGCTCGC CCGAAAAATC GAGTGGCTGC 

1251 ACAGGCTTTG GGCATACGCA ACCACTACCG CCACCCCGAA 

1301 CCGACCGTTG GTTCAACGCC TTGGGCAGCC GCCGCTTCAG 

1351 TGCGTCGTCG TCAGTTGCGG CACGGCGGTA ACGGTTGACG 

1401 TGACGGACAT TATCTCGGCG GAACCATCAT GCCCGGCTTC 

1451 AAGAATCGCT CGCCGTCCGA ACCGCCAACC TCAACCGCCC 

1501 CGTTACCCTT TCCCGACCAC AACGGGCAAC GCCGTCGCAA 

1551 GGACGCGGTT TGCGGCTCGA TAATGATGAT GCACGGCCGT 

1601 AAAACGGCGC GGGCAAGCCT GTCGATGTCA TCATTACCGG 

1651 GCGAAAGTCG CCGAAGCCCT GCCGCCTGCA TTTTTGGCGG 

1701 GCGCGTGGCG GACAACCTCG TCATCCACGG GCTGCTGAAC 

1751 CCGAAGGCGG GGAATCGGAA CACGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 1296; ORF 31 1-1 .ng>: 

g311-l.pep 

1 MTVLKPSHWR VLAELADGLP QHVS0LAREA DMKPQQLNGF WQQMPAHIRG 

51 LLRQHDGYWR LVRPLAVFDA EGLRDLGERS GFQTALKHEC ASSNDEILEL 

101 ARIAPDKAHK TICVTHLQSK GRGRC'GRKWS HRLGECLMFS FGWAFDRPQY 

151 ELGSLSPVAA LACRRALGCL GLETQIKWPN DLVVGRDKLG GILIETVRAG 

201 GKTVAWGIG INFVLPKEVE NAASVQSLFQ TASRRGNAD A AVLLETLLAE 

251 LGAVLE QYAE EGFAPFLNEY ETANRDHGKA VLLLRDGETV CEGTVKGVDG 

301 RGVLHLETAE GEQTVVSGEI SLRPDNRSVS VPKRPDSERF LLLEGGNSRL 

351 KWAWVENGTF ATVGSAPYRD LSPLGAEWAE KADGNVRIVG CAVCGESKKA 

401 QVKEQLARKI EWLPSSAQAL GIRNHYRHPE EH GS DRW FN A LGSRRFSRNA 

451 CVYVSCGTAV TVDALT DDGH YLGGTIMPGF HLMKESLAVR TANLNRPAGK 

501 RYPFPTTTGN AVASGMMDAV CGSIMMMHGR LKEKNGAGKP VDVIITGGGA 

551 AKVAEALPPA FLAENTVRVA DNLVIHGLLN LIAAEGGESE HA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1297>: 

m311-l . seq 

1 ATGACGGTTT TGAAGCTTTC GCACTGGCGG GTGTTGGCGG AGCTTGCCGA 

51 CGGTTTGCCG CAACACGTCT CGCAACTGGC GCGTATGGCG GATATGAAGC 

101 CGCAGCAGCT CAACGGTTTT TGGCAGCAGA TGCCGGCGCA CATACGCGGG 

151 CTGTTGCGCC AACACGACGG CTATTGGCGG CTGGTGCGCC CATTGGCGGT 

201 TTTCGATGCC GAAGGTTTGC GCGAGCTGGG GGAAAGGTCG GGTTTTCAGA 

251 CGGCATTGAA GCACGAGTGC GCGTCCAGCA ACGACGAGAT ACTGGAATTG 

301 GCGCGGATTG CGCCGGACAA GGCGCACAAA ACCATATGCG TGACCCACCT 

351 GCAAAGTAAG GGCAGGGGGC GGCAGGGGCG GAAGTGGTCG CACCGTTTGG 

401 GCGAGTGTCT GATGTTCAGT TTTGGCTGGG TGTTTGACCG GCCGCAGTAT 

451 GAGTTGGGTT CGCTGTCGCC TGTTGCGGCA GTGGCGTGTC GGCGCGCCTT 

501 GTCGCGTTTA GGTTTGGATG TGCAGATTAA GTGGCCCAAT GATTTGGTTG 

551 TCGGACGCGA CAAATTGGGC GGCATTCTGA TTGAAACGGT CAGGACGGGC 

601 GGCAAAACGG TTGCCGTGGT CGGTATCGGC ATCAATTTTG TCCTGCCCAA 

651 GGAAGTAGAA AATGCCGCTT CCGTGCAATC GCTGTTTCAG ACGGCATCGC 

701 GGCGGGGCAA TGCCGATGCC GCCGTGCTGC TGGAAACGCT GTTGGTGGAA 

751 CTGGACGCGG TGTTGTTGCA ATATGCGCGG GACGGATTTG CGCCTTTTGT 

801 GGCGGAATAT CAGGCTGCCA ACCGCGACCA CGGCAAGGCG GTATTGCTGT 

851 TGCGCGACGG CGAAACCGTG TTCGAAGGCA CGGTTAAAGG CGTGGACGGA 

901 CAAGGCGTTT TGCACTTGGA AACGGCAGAG GGCAAACAGA CGGTCGTCAG 

951 CGGCGAAATC AGCCTGCGGT CCGACGACAG GCCGGTTTCC GTGCCGAAGC 

1001 GGCGGGATTC GGAACGTTTT CTGCTGTTGG ACGGCGGCAA CAGCCGGCTC 

1051 AAGTGGGCGT GGGTGGAAAA CGGCACGTTC GCAACCGTCG GTAGCGCGCC 

1101 GTACCGCGAT TTGTCGCCTT TGGGCGCGGA GTGGGCGGAA AAGGCGGATG 

1151 GAAATGTCCG CATCGTCGGT TGCGCTGTGT GCGGAGAATT CAAAAAGGCA 

1201 CAAGTGCAGG AACAGCTCGC CCGAAAAATC GAGTGGCTGC CGTCTTCCGC 

1251 ACAGGCTTTG GGCATACGCA ACCACTACCG CCACCCCGAA GAACACGGTT 

1301 CCGACCGCTG GTTCAACGCC TTGGGCAGCC GCCGCTTCAG CCGCAACGCC 

1351 TGCGTCGTCG TCAGTTGCGG CACGGCGGTA ACGGTTGACG CGCTCACCGA 

1401 TGACGGACAT TATCTCGGGG GAACCATCAT GCCCGGTTTC CACCTGATGA 

1451 AAGAATCGCT CGCCGTCCGA ACCGCCAACC TCAACCGGCA CGCCGGTAAG 

1501 CG7TATCCTT TCCCGACCAC AACGGGCAAT GCCGTCGCCA GCGGCATGAT 

1551 GGATGCGGTT TGCGGCTCGG TTATGATGAT GCACGGCCGT TTGAAAGAAA 

1601 AAACCGGGGC GGGCAAGCCT GTCGATGTCA TCATTACCGG CGGCGGCGCG 

1651 GCAAAAGTTG CCGAAGCCCT GCCGCCTGCA TTTTTGGCGG AAAATACCGT 

1701 GCGCGTGGCG GACAACCTCG TCATTTACGG GTTGTTGAAC ATGATTGCCG 



GTATTGCTGT 
CGTGGACGGA 
cggtcGtcag 
GTgccgaagc 
cagccggctc 
gcAGCGCgCC 
AAGGCGGATG 
CAAAAAGGCA 
CGTCTTCCGC 
GAACACGGTT 
CCGCAACGCC 
CGCTCACCGA 
CACCTGATGA 
CGCCGGCAAA 
GCGGCATGAT 
TTGAAAGAAA 
CGGCGGCGCG 
AAAATACCGT 
CTGATTGCCG 
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1751 CCGAAGGCAG GGAATATGAA CATATTTAA 



This corresponds to the amino acid sequence <SEQ ID 1298; ORF 31 1-1>: 

m311-l.pep 

1 MTVLKLSHWR VLAELADGLP QHVSQLARMA DMKPQQLNGF WQQMPAHIRG 

51 LLRQHDGYWR LVRPLAVFDA EGLRELGERS GFQTALKHEC ASSNDEILEL 

101 ARIAPDKAHK TICVTHLQSK GRGRQ3RKWS HRLGECLMFS FGWVFDRPQY 

151 ELGSLSPVAA VACRRALSRL GLDVQIKWPN DLWGRDKLG GILIETVRTG 

201 GKTVAVVGIG INFVLPKEVE NAASVQSLFQ TASRRGNAD A AVLLETLLVE 

251 LDAVLLQYAR DGFAPFVAEY QAANRDHGKA VLLLRDGETV FEGTVKGVDG 

301 QGVLHLETAE GKQTVVSGEI SLRSDDRPVS VPKRRDSERF LLLDGGNSRL 

351 KWAWVENGTF ATVGSAPYRD LSPLGAEWAE KADGNVRIVG CAVCGEFKKA 

401 QVQEQLARKI EWLPSSAQAL GIRNHYRHPE EHGSDRWFNA LGSRRFSRNA 

451 CWVSCGTAV TVDALT DDGH YLGGTIMPGF HLMKESLAVR TANLNRHAGK 

501 RYPFPTTTGN AVASGNMDAV CGSVMMMHGR LKEKTGAGKP VDVIITGGGA 

551 AKVAEALPPA FLAENTVRVA DNLVIYGLLN MIAAEGREYE HI* 



m311-l/g311-l 93.9% identity in 591 aa overlap 



10 20 30 40 50 60 

ra3 1 1 - 1 . pep MT VL K LS H WRVLAELADG LP QH VS QLARMADMKPGXDLNG FWQQM PAH I RGLLRQHDGYWR 

mi i 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 limit 

g 3 1 1 - 1 MTVLKPSHWRVLAELADGLPQHVSQLAREADMKPQQLNGFWQQMPAHI RGLLRQHDGYWR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m3Il-l . pep LVRPLAVFDAEGLRELGERSGFQTALKHECASSNDEILELARIAPDKAHKTICVTHLQSK 
Mil t I M I I I I i 1:11 II I UNI Mil III tl II II III II II MMMIIIIII II I 
g311-l LVRPLAVFDAEGLRDLGERSGFQTALKHECASSNDEILELARIAPDKAHKTICVTHLQSK 

70 80 90 100 110 120 

130 140 15C 160 170 180 

m31 1-1 . pep G RG RQGRKWS H R LGECLM FS FGWV FDRPQ YELGS LS PV AAVAC RRAL S RLGL DVQ I KW PN 
I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I | I | : | | | | | | : I I I : : | | I I I 
g311-l GRGRQGRKWSHRLGECLMFSFGWAFDRPQYELGSLSPVAALACRRALGCLGLETQIKWPK 

130 140 150 160 170 18C 

190 200 210 220 230 240 

ni311-l . pep DLVVGRDKLGGILIETVRTGGKTVAVVGIGINFVLPKEVENAASVQSLFQTASRRGNADA 
I I I i I I I I I I I I I I I I I I : I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | j | 
g311-l DL VVG RDKLGG I L I ET VRAGGKT VAVVGI GIN FVL PKE VEN AA SVQS L FQTAS RRGN ADA 

190 200 210 220 230 240 

250 260 270 280 290 300 

m31 1-1 . pep AVLLETLLVELDAVLLQYARDGFAPFVAEYQAANRDHGKAVLLLRDGETVFEGTVKGVDG 
11111111:11 Ml M : M II I 1 1 I I II I I I II II I I MINIM 

g3 1 1- 1 AVLLETLLAELGAVXEQYAEEGFAPFLNEYETANRDHGKAVLLLRDGETVCEGTVKGVDG 

250 260 270 280 290 300 

310 320 330 340 350 360 

ra3 11-1. pep QGVLHLETAEGKQTWSGEISLRSDDRPVSVPKRRDSERFLLLDGGNSRLKWAWVENGTF 
= 1 1 1 1 s 1 1 1 1 I = 1 1 1 1 I I 1 1 1 1 I 1:1 MIMI MlillllrlllMIIIIIIIIIII 
g31 1-1 RGVLH LET AEGEQTWS GE I S LRPDNRSV SV PKRP DS ERFLLL EGGN S RLKWA WVENGT F 

310 320 330 340 350 360 

370 380 390 400 410 420 

m31 1- 1 . pep ATVGS A P YRDLS PLG AE W AEKADGN VR I VGC A VCG EFKK AQVQEQLARK I EWLPS S AQAL 

M I 1 1 I M I M M I I I I 1 1 1 I II M I 1 1 M II 1 1 1 1 I 1 1 II : I II M I M I 1 1 1 1 I M i 
g3 1 1- 1 ATVGS APY RDL S PLG AEWAEKADGN VRI VGC A VCGES K K AQVK EQLARK I E WL PS S AQAL 

370 380 390 400 410 420 



430 440 450 460 470 480 

m311-l.pep GIRNHYRHPEEHGSDRWFNALGSRRFSRNACVVVSCGTAVTVDALTDDGH YLGGTIMPGF 
M I I I I I I I I II I M I I I I I M I II I M I M I II I I I I I I I II I I I II I IMIIIMM 
g311-l GIRNHYRHPEEHGSDRWFNALGSRRFSRNACVVVSCGTAVTVDALTDDGH YLGGTIMPGF 

430 440 450 460 470 480 

490 500 510 520 530 540 

m311-l . pep HLMKESLAVRTANLNRHAGKRYPFFTTTGNAVASGMMDAVCGSVMMMHGRLKEKTGAGKP 
MIIIIIIIMIIIIi M I I I I I M I 1 | I | | | | | | | | | | | | | : | j | | | | | | | j : | | j | j 
g311-l HLMKESLAVRTANLNRPAGKRYPFPTTTGNAVASGMMDAVCGSIMMMHGRLKEKNGAGKP 

490 500 510 520 530 540 



550 560 570 580 590 
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m311-l .pep VDVIITGGGAAKVAEALPPAFLAENTVRVADNLVIYGLLIIMIAAEGREYEHIX 

I I I I M I I I I I I I I I I I I I I I I I I I 1 I I I II I I I I : I I I f : | | | | | | ( | 
g 31 1 - 1 VDVII TGGGAAKVAEALPPAFLAKNTVRVADNLVrHGLLNLI AAEGGESEHAX 

550 560 570 580 590 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1299>: 

a311-l.seq 

1 ATGACGGTTT TGAAGCCTTC GCACTGGCGG GTGTTGGCGG AGCTTGCCGA 

51 CGGTTTGCCG CAACACGTCT CGCAACTGGC GCGTATGGCG GATATGAAGC 

101 CGCAGCAGCT CAACGGTTTT TGGCAGCAGA TGCCGGCGCA CATACGCGGG 

151 CTGTTGCGCC AACACGACGG CTATTGGCGG CTGGTGCGCC CATTGGCGGT 

201 TTTCGATGCC GAAGGTTTGC GCGAGCTGGG GGAAAGGTCG GGTTTTCAGA 

251 CGGCATTGAA GCACGAGTGC GCGTCCAGCA ACGACGAGAT ACTGGAATTG 

301 GCGCGGATTG CGCCGGACAA GGCGCACAAA ACCATATGTG TGACCCACCT 

351 GCAAAGTAAG GGCAGGGGGC GGCAGGGGCG GAAGTGGTCG CACCGTTTGG 

401 GCGAGTGTCT GATGTTCAGT TTTGGCTGGG TGTTTGACCG GCCGCAGTAT 

451 GAGTTGGGTT CGCTGTCGCC TGTTGCGGCA GTGGCGTGCC GGCGCGCCTT 

501 GTCGCGTTTG GGTTTGAAAA CGCAAATCAA GTGGCCAAAC GATTTGGTCG 

551 TCGGACGCGA CAAATTGGGC GGCATTCTGA TTGAAACGGT CAGGACGGGC 

601 GGCAAAACGG TTGCCGTGGT CGGTATCGGC ATCAATTTCG TGCTGCCCAA 

651 GGAAGTCGAA AACGCCCCTT CCGTGCAATC GCTGTTTCAG ACGGCATCGC 

701 GGCGGGGAAA TGCCGATGCC GCCGTGTTGC TGGAAACGCT GTTGGCGGAA 

751 CTTGATGCGG TGTTGTTGCA ATATGCGCGG GACGGATTTG CGCCTTTTGT 

801 GGCGGAATAT CAGGCTGCCA ACCGCGACCA CGGCAAGGCG GTATTGCTGT 

851 TGCGCGACGG CGAAACCGTG TTCGAAGGCA CGGTTAAAGG CGTGGACGGA 

901 CAAGGCGTTC TGCACTTGGA AACGGCAGAG GGCAAACAGA CGGTCGTCAG 

951 CGGCGAAATC AGCCTGCGGT CCGACGACAG GCCGGTTTCC GTGCCGAAGC 

1001 GGCGGGATTC GGAACGTTTT CTGCTGTTGG ACGGCGGCAA CAGCCGGCTC 

1051 AAGTGGGCGT GGGTGGAAAA CGGGACGTTC GCAACCGTCG GTAGCGCGCC 

1101 GTACCGCGAT TTGTCGCCTT TGGGCGCGGA GTGGGCGGAA AAGGTGGATG 

1151 GAAATGTCCG CATCGTCGGT TGCGCCGTGT GCGGAGAATT CAAAAAGGCA 

1201 CAAGTGCAGG AACAGCTCGC CCGAAAAATC GAGTGGCTGC CGTCTTCCGC 

1251 ACAGGCTTTG GGCATACGCA ACCACTACCG CCACCCCGAA GAACACGGTT 

1301 CCGACCGCTG GTTCAACGCC TTGGGCAGCC GCCGCTTCAG CCGCAACGCC 

1351 TGCGTCGTCG TCAGTTGCGG CACGGCGGTA ACGGTTGACG CGCTCACCGA 

1401 TGACGGACAT TATCTCGGGG GAACCATCAT GCCCGGTTTC CACCTGATGA 

1451 AAGAATCGCT CGCCGTCCGA ACCGCCAACC TCAACCGGCA CGCCGGTAAG 

1501 CGTTATCCTT TCCCGACCAC AACGGGCAAT GCCGTCGCCA GCGGCATGAT 

1551 GGATGCGGTT TGCGGCTCGG TTATGATGAT GCACGGGCGT TTGAAAGAAA 

1601 AAACCGGGGC GGGCAAGCCT GTCGATGTCA TCATTACCGG CGGCGGCGCG 

1651 GCAAAAGTTG CCGAAGCCCT GCCGCCTGCA TTTTTGGCGG AAAATACCGT 

1701 GCGCGTGGCG GACAACCTCG TCATTCACGG GCTGCTGAAC CTGATTGCCG 

1751 CCGAAGGCGG GGAATCGGAA CATACTTAA 

This corresponds to the amino acid sequence <SEQ ID 1300; ORF 31 l-l.a>: 

a311-l.pep 

1 MTVLKPSHWR VLAELADGLP QHVSQLARMA DMKPQQLNGF WQQMPAHIRG 

51 LLRQHDGYWR LVRPLAVFDA EGLRELGERS GFQTALKHEC ASSNDEILEL 

101 ARIAPDKAHK TICVTHLQSK GRGRQGRKWS HRLGECLMFS FGWVFDRFQY 

151 ELGSLSPVAA VACRRALSRL GLKTQIKWPN DLWGRDKLG GILIETVRTG 

201 GKTVAWGIG INFVLPKEVE NAASVQSLFQ TASRRGNA DA AVLLETLLAE 

251 LDAVLL QYAR DGFAPFVAEY QAANRDHGKA VLLLRDGETV FEGTVKGVDG 

301 QGVLHLETAE GKQTWSGEI SLRSDDRPVS VPKRRDSERF LLLDGGNSRL 

351 KWAWVENGTF ATVGSAPYRD LSPLGAEWAE KVDGNVRIVG CAVCGEFKKA 

401 QVQEQLARKI EWLPSSAOAL GIRNHYRHPE EHGSDRWFNA LGSRRFSRNA 

A 51 CWVSCGTAV TVDALT DDGH YLGGTIMPGF HLMKESLAVR TANLNRHAGK 

501 RYPFPTTTGN AVASGMMDAV CGSVMMMHGR LKEKTGAGKP VDVIITGGGA 

551 AKVAEALPPA FLAENTVRVA DNLVIHGLLN LIAAEGGESE HT* 

a311-l/m311-l 98.5% identity in 591 aa overlap 

10 20 30 40 50 60 

MTVLKPSHWRVLAELADGLPQHVSQLARMADMKPQQLNGFWQQMPAHIRGLLRQHDGYWR 
I'"" I I I I I I I I I I I I I I M I !! II I I I I I I I I I I II I | I | | | f | | | | I | | | | | | | | | 
MTV L K LS H W RVL AELA DG LPQH V S QLARMA DMKP QQLN G FWQQM PAH I RG LLRQHDGYWR 
10 20 3C 40 50 60 

*?0 80 90 100 110 120 

LVRPLAVFDAEGLRELGERSGFQTALKHECASSNDEILELAR1APDKAHKTICV7HLQSK 
llllllllllllliltlillltilltllillllllillllllllllilllllll | | | | | 
LVRPLAVFDAEGLRELGERSGFQ1ALKHECASSNDEILELARIAPDKAHKTICVTHLQSK 
7 0 80 90 100 110 120 



a311-l.pep 
m311-l 

a311-l.pep 
m311-l 
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130 140 150 160 HO 180 

a311-l .pep GRGRQGRKWSHRLGECLMFSFGWVFDRPQYELGSLSPVAAVACRRALSRLGLKTQIKWPN 
I i I I i I I I I I I I t I I I I I i i I 11 I I I I I t I I t I I ! I I I I I I I I I I I ] I I I | | : | | | | | | 
m311- 1 GRGRQGRKWSHRLGECIMFSFGWVFDRPQYELGSLSPVAAVACRRAIiSRLGLDVQIKWPN 
130 140 15D 160 170 180 

190 200 210 220 230 240 

a 31 1-1. pep DLVVGRDKLGGILIETVRTGGKTVAWGIGINFVLPKEVENAASVQSLrQTASRRGNADA 

M II III MINIMI II 111! II I! II tl II II llllllllllll II Mllllllll | j 
m311-l DL WG RD KLGG I L I ETVRTGG KT VA WGI GIN FVLP KEV EN AAS VQS LFQTA S RRGN A DA 

190 200 210 220 230 240 

250 260 270 280 290 300 

a31 1- 1 . pep A VLL ETLLAEL DA VLLQY ARDG F AP FV AE "f QAANRDHGKA VLLLRDGET V FEGTVKG V DG 
N I ! M I I : I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I i I I I 
m31 1- 1 AVLLETLLVELDAVLLQYARDGFAPFVAEYQAANRDHGKAVLLLRDGETVFEGWKGVDG 

250 260 270 280 290 300 

310 320 330 340 350 360 

a311-l.pep QGVLHLETAEGKQTVVSGEISLRSDDRPVSVPKRRDSERFLLLDGGNSRLKWAWENGTr 
MIMMII II Ml II II lllll It I I III II II II II I II I I I II II II Mil I II I || 
m311-l QGVLHLETAEGKQTVVSGEISLRSDDRPVSVPKRRDSERFLLLDGGNSRLKWAWENGTF 

310 320 330 340 350 360 

370 380 390 400 410 420 

a3 11-1. pep ATVGSAPYRDLSPLGAEWAEKVDGNVRIVGCAVCGEFKKAQVQEQLARKIEWLPSSAQAL 
I I M I I I I I I I I II I I I I I I I :| I I I I I I I I I I I I I I I I I I | | | | I | | M M | | | | | | | | 
m3 1 1- 1 ATVGSAPYRDLSPLGAEWAEKADG-^VRIVGCAVCGEFKKAQVQEQLARKIEWLPSSAQAL 

370 380 390 400 410 420 

430 440 450 460 470 480 

a3 11-1. pep GIRNHYRHPEEHGSDRWFNALGSRRFSRNACVWSCGTAVTVDALTDDGHYLGGTIMPGF 
I II I I I I M I I I I I I I I I I I I I I I I I I I 1 I I I I I I I | I | | | | | I | | | j | | | | | | | | | | | j 
m311-l GIRNHYRHPEEHGSDRWFNALGSRRFSRNACVWSCGTAVTVDALTDDGHYLGGTIHPGF 
430 440 450 460 470 480 

490 500 510 520 530 540 

a3 1 1-1 . pep H1^KESLA\HRTANLNRHAGKRYPF?TT7GNAVASGMMDAVCGSVMMMHGRLKEKTGAGKP 

MIIN1IMIIIIIIIIIIIIIIIIIII lillllllll IIIIIIIIIIIIMIIIIII I 

m3 1 1 - 1 HLMKESLAVRTANLNRHAGKRYPFPTTTGtf AVASGMMDAVCGSVMMMHGRLKEKTGAGKP 
490 500 510 520 530 540 

550 560 570 580 590 

a311-l . pep VDVIITGGGAAKVAEALPPAFLAENTVRVADNLVIHGLIiNLIAAEGGESEHTX 

I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I | ! | | | : | | | |: | | | I I I I i 
m311-l VDVIITGGGAAKVAEALPPAFLAENTVRVADNLVIYGLLNMIAAEGREYEHIX 
550 560 570 580 590 



The following partial DNA sequence 



g312.aeq 






1 


atgaGtatCc 


aatCcGgcga 


51 


ccggaATttt 


gAtgtccgCA 


101 


gcatcagcac 


cgacatcgac 


151 


accacggccg 


gcaaagactT 


201 


ATACGGCGTG 


CCGATTGTGA 


251 


AaatcGCGGC 


GGcgaccaAa 


301 


tTGGACAAGG 


CAGCCAAAGC 


351 


CGCGCTGGTG 


CAAAAAGGTA 


401 


CCGTTCCCGA 


AG CGATG AAA 


451 


ATCGGCAGCA 


CGCGTGCCGG 


501 


CGAAACCATC 


AAACGCACGG 


551 


CCAAAATCGT 


CGTGTTCTGC 


601 


GGTGCGTTCC 


ACGGCTCGGG 


651 


ATCCGGTCCA 


GGCGTGGTCA 


701 


GCCTGACCGA 


GGTCGCCGAA 


751 


CGCGTGGGCG 


AACTCATCGG 


801 


GTTCGGCATT 


CTCGATTTGT 


851 


CGGTGGCGCG 


CATTCTTGAA 



i identified in N. gonorrhoeae <SEQ ID 1301>: 

AATTTtagaa accgtCAAAA TGGTTGCCGA 
CCATTAccat cggcaTTgaT ttgcacgact 
ctgttaAACC AAAACATt t a caaGAaaaTc 
GGTGGCAacg Gcgaaacacc tTTccgcCAA 
ATCAGCGCAT TTCCGTTACG CCGAttgccc 
gccgaCAGTT AtgtcAGCgt cgcgcAGact 
CATCGGCGTG TCCTTTATCG GcggCTTTTC 
TGTCGCCTTC GGATGAGGTG TTGATCCGTT 
ACTACCGATA TCGTGTGCAG CTCCATCAAT 
TATCAATATG GATGCGGTCA AGCTGGCAGG 
CTGAAATCAC ACCCGAAGGT TTCGGCTGCG 
AACGCGGTGG AAGACAATCC C-TTTATGGCG 
CGAAGCGGAT GCTGTGATTA ATGTCGGCGT 
AAGCCGCGCT GGAAAATTCG GACGCGGTCA 
GTCGTGAAGA AAACCGCTTT CAAAATCACC 
TCGCGAAGCC TCAAAAATGC TGAATATCCC 
CGCTGGCACC GACCCCCGCC GTCGGCGACT 
GAAATGGGCT TGAGCGTCTG CGGTACGCAC 
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901 GGCACAACAG CAGCTTTGGC ATTGCTGAAC GATGCCGTGA AAAAGGGCGG 
951 CATGATGGCT TCCAGCGCGG TCGGCGGTTT GAGCGGCGCG TTTATCCCCG 
1001 TTTCCGAAGA CGAAGGTATG ATTGCCGCCG CCGAGGCAGG CGTGTTGACG 
1051 CTGGACAAAC TCGAAGCCAT GACCGCCGTC TGCTCCGTTG GTTTGGACAT 
1101 GATTGCCGTT CCCGGCGACA CGCCCGCGCA CACCATTTCC GGCATCATCG 
1151 CCGACGAAGC CGCCATCGGC ATGATCAACA GCAAAACCAC CGCCGTGCGC 
1201 ATTATTCCGG TAACGGGCAA AACCGTCGGC GACAGCGTCG AGTTCGGCGG 
1251 TCTGTTGGGC TACGCGCCTG TAATGCCGGC AAAAGAAGGT TCGTGCGAAG 
1301 TGTTCGTCAA CCGGGGCGGC AGGATTCCCG CACCGGTTCA ATCGATGAAA 
1351 AACTGA 

This corresponds to the amino acid sequence <SEQ ID 1302; ORF 312.ng>: 



g312.pep 












1 


MSIQSGEILE 


TVKMVADRNF 


DVRTITIGID 


LHDCISTDID 


VLNQNIYNKI 


51 


TTVGKDLVAT 


AKHLSAKYGV 


PIVNQRISVT 


PIAQIAAATK ADSYVSVAQT 


101 


LDKAAKAIGV 


SFIGGFSALV 


0KGMSPSDEV 


LIRSVPEAMK 


TTDIVCSSIN 


151 


IGSTRAGINM 


DAVKLAGETI 


KRTAEITPEG 


FGCAKIWFC 


NAVEDNP FMA 


201 


GAFHGSGEAD 


AVINVGVSGP 


GWKAALENS 


DAVSLTEVAE 


WKKTAFKIT 


251 


RVGELIGREA 


SKMLNIPFGI 


LDLSLAPTPA 


VGDSVARILE 


3MGLSVCGTH 


301 


GTTAALALLN 


DAVKKGGMMA 


SSAVGGLSGA 


FIPVSEDEGM 


I AAAEAG VLT 


351 


LDKLEAMTAV 


CSVGLDMIAV 


PGDTPAHTIS 


GIIADEAAIG 


MINSKTTAVR 


401 


T I PVTGKTVG 


DSVEFGGLLG 


YAPVMPAKEG 


SCEVFVNRGG 


RIPAPVQSMK 


451 


N* 











The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1303>: 

m312.seq 

1 ATGAGTATCC AATCCGGCGA AATTTTAGAA. ACCGTCAAAA TGGTTGCCGA 

51 CCAGAATTTT GATGTCCGCA CCATTACCAT CGGCATTGAT TTGCACGACT 

101 GCATCAGCAG CGATATCAAT GTGTTGAACC AAAATATTTA CAATAAAATT 

151 ACCACAGTCG GCAAAGACTT GGTCACTACG GCAAAATATC TGTCTGCCAA 

201 ATACGGCGTA CCGATTGTGA ATCAGCGCAT TTCCGTTACG CCGATTGCCC 

251 AAATCGCGGC GGCCACCCAT GCTGATTCTT ACX3TCAGCGT GGCGCAAACT 

301 TTGGATAAAG CTGCCAAAGC CATCGGTGTG TCTTTTATCG GCGGTTTTTC 

351 CGCGTTGGTG CAAAAAGGGA TGTCGCcTTC GGATGAGGTG TTAATC CGCT 

401 CCATTCCCGA AGCGATGAAG ACTACCGATA TTGTGTGCwG CTCCATCAAT 

451 ATCGGCAGTA CGCGTGCCGG TATCAATATG GATGCGGTCA AGCTG3CGGG 

501 CGAAACCGTC AAACGCACGG CGGAAATCAC GCCCGAAGGT TTCGGCTGCG 

551 CTAAAATTGT CGTGTTCTGC AACGCGGTGG AAGACAACCC GTTTwTGGCG 

601 GGCGCGTTTC ATGGTT CGGG CGATGCCGTT ATCAATGTCG GCGTATCCGG 

651 CCCAGGTGTC GTAAAAGCCG CGTTGGAAAA TTCAGATGCA ACGACATTGA 

701 CCGAAGTTGC GGAAGTAGTG AAGAAAACTG CTTTCAAAAT TACCCGCGTG 

751 GGCGAACTCA TCGGCCGCGA AGCcTCAAAA ATGCTGAATA TCCCGTTTGG 

801 TATTCTCGAC TTGTCGCCGA CCCCGCCCGT CGGCGACTCA GTGGCACGCA 

851 TTCTTGAAGA AATGGGCTTG AGCGTCTGCG GTACGCACGG CACAACAGCA 

901 GCTTTGGCAT TGCTGAACGA TGCCGTGAAA AAAGGCGGCA 7GATGGCTTC 

951 CAGCGCGGTC GGGGGTTTGA GTGGCGCGTT TATCCCCGTT TCCGAAGACG 

1001 AAGGTATGAT yGmCgCcGCC GAAGCAGGCG TGCTGACGCT GGACAAACTC 

1051 GAAGCCATGA CCGCCGTTTG TTCGGTCGGC TTGGATATGA TTGCCGTTCC 

1101 CGGCGACACG CCCGCGCACA CCATTTCCGG CATCATTGCC GACGAAGCCG 

1151 CCATCGGCAt GATCAACAGC AAAACCACTG CCGTGCGCAT TATTCCGGTA 

1201 ACCGGTAAAA CCGTCGGCGA CAcGGTCGAG TTCGGCGGCT TGTTGGgCTA 

1251 CGCGCCTGTG ATGCCGGTCA AAGAAGGTTC GTGCGAAGTA TTCGTCAACC 

1301 GAGGCGGCAG AATTCCGGCT CCGGTTCAAT CGATGAAAAA CTGA 

This corresponds to the amino acid sequence <SEQ ID 1304; ORF 312>: 

m312 .pep 

1 MSIQSGEILE TVKMVADQNF DVRTITIGID LHDCISSDIN VLNQNIYNKI 

51 TTVGKDLVTT AKYLSAKYGV PIVNQRISVT PIAQIAAATH ADSYVSVAQT 

101 LD KAAKAIGV SFIGGFSALV QKGMSPSDEV LIRSIPEAMK TTDIVCXSIN 

151 IGSTRAGINM DAVKLAGETV KRTAEITPEG FGCAKIWFC NAVEDNPFXA 

201 GAFHGS GDAV INVGVSGPGV VKAALENS DA TTLTEVAEW KKTAFKITRV 

251 GELIGREASK MLNIPFGILD LSPTPPVGDS VARILEEMGL SVCGTHGTTA 

301 ALALLNDAVK KGGMMASSAV GGLSGAFIPV SEDEGMIXAA EAGVLTLDKL 

351 EAMTAVCSVG LDMIAV PGDT PAHTISGIIA DEAAIGMINS KTTAVRIIPV 

401 TGKTVGDTVE FGGLLGYAPV MPVKEGSCEV FVNRGGRI PA PVQSMJCN * 
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